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Abstract 


Interferometric  lithography,  the  interference  of  a  small  number  of  coherent  optical  beams,  is  a 
powerful  technique  for  the  fabrication  of  a  wide  array  of  samples.  The  techniques  and  limits  of 
interferometric  lithography  are  discussed  with  particular  attention  to  the  smallest  scales 
achievable.  With  immersion  techniques,  the  smallest  pattern  size  for  a  single  exposure  is  a  half 
pitch  of  A/4 n  were  A  is  the  optical  wavelength  and  n  is  the  refractive  index  of  the  immersion 
material.  Currently  with  a  193-nm  excimer  laser  source  and  H2O  immersion,  this  limiting 
dimension  is  ~  34  nm.  With  nonlinear  spatial  frequency  multiplication  techniques,  this  limit  is 
extended  by  factors  of  1/2,  1/3,  etc.  -  extending  well  into  the  nanoscale  regime.  Interferometric 
lithography  provides  an  inexpensive,  large-area  capability  as  a  result  of  its  parallelism.  Imaging 
interferometric  lithography  provides  an  approach  to  arbitrary  structures  with  comparable 
resolution.  Imaging  interferometric  microscopy  provides  an  alternative  super-resolution 
technique  with  application  to  lithographic  mask  inspection.  Numerous  application  areas  include: 
nanoscale  epitaxial  growth  for  semiconductor  heterostructures;  nanofluidics  for  biological 
separations;  nanophotonics  including  distributed  feedback  and  distributed  Bragg  reflectors,  2D 
and  3D  photonic  crystals,  metamaterials  and  negative  refractive  index  materials  for  enhanced 
optical  interactions. 
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Deep  Subwavelength  Optical  Nanolithography 


I  INTRODUCTION 

Nanoscience  and  nanotechnology  are  increasingly  important  research  directions  because  of 
the  exciting  new  physics  and  chemistry  that  become  accessible  at  the  nanoscale  and  because  of 
the  accompanying  potential  for  new  products  and  processes  that  will  have  a  major  impact  on  the 
world’s  technological  future.  The  new  science  involves  both  the  increased  importance  of  sur¬ 
faces  (e.g.  increased  surface/volume  ratio)  and  of  the  chemical/steric  effects  that  are  the  basis  of 
much  of  molecular  biology,  as  well  as  the  quantum  phenomena  that  become  dominant  as  the  ma¬ 
terial  scale  becomes  comparable  to  the  wavefunctions  of  elementary  excitations:  electrons,  holes, 
photons,  spin  waves,  magnetic  excitations,  and  others. 

Optical  lithography  is  well  established  as  the  manufacturing  technology  of  choice  for  the 
semiconductor  integrated  circuit  industry  which  is  today  well  into  the  nanoscale  with  ~  65 -nm 
gate  lengths  in  large-volume  manufacturing  production.  With  recent  developments  such  as  im¬ 
mersion  lithography,  it  now  appears  that  the  hegemony  of  optical  lithography  has  a  significant 
duration  yet  to  run,  despite  the  continuing  refrain  that  the  end  of  optical  lithography  as  we  know 
it  is  neigh  upon  us.  In  this  context,  it  is  worth  noting  that  predictions  of  the  end  of  optical  lithog¬ 
raphy  have  held  steady  at  “two  generations  out”  for  roughly  the  past  30  years.  The  dual  purposes 
of  this  article  are  to  discuss  the  limits  to  optical  lithography  on  very  general  grounds  and  to  point 
out  the  opportunities  for  exciting  nanoscience  research  that  are  made  possible  by  its  capabilities. 

Any  discussion  of  tools  for  nanoscience  research  necessarily  involves  consideration  of  the 
associated  costs  -  in  resources  and  in  time.  The  research  community  never  has  resources  for  eve¬ 
rything  and  must  make  choices  based  on  affordability  and  productivity.  Thus,  for  example,  e- 
beam  lithography  can  produce  structures  at  scales  smaller  than  optical  lithography  and  with  an 
almost  complete  pattern  flexibility  (within  the  limits  imposed  by  proximity  effects)  and  is  a  sta¬ 
ple  of  nanoscience  research.  Nonetheless,  there  are  many  applications  that  require  large  areas 
(many  cm2  of  nanopattemed  material)  where  e-beam  lithography  is  not  a  viable  approach  as  a 
result  of  its  serial  point-by-point  writing  modality  and  the  consequent  long  times  involved  in  ad¬ 
dressing  the  large  number  of  pixels  in  a  large-area  nanoscale  image.  A  linear  pixel  resolution  of 
20  nm  corresponds  to  a  density  of  2.5xl0u  cm'2!  This  issue  only  gets  worse  as  the  patterning 
gets  deeper  into  the  nanoscale.  As  an  aside,  this  is  a  powerful  argument  applying  to  any  tech¬ 
nique,  such  as  the  various  approaches  to  maskless  lithography,  that  requires  storage  and  transfer 
of  information  on  an  individual  pixel  basis,  the  data  transfer  demands  are  very  difficult,  even 
with  today’s  advanced  computing  and  communication  technologies. 

Optical  lithography,  in  contrast,  is  a  parallel  writing  technique.  Traditional  optical  lithogra¬ 
phy  uses  a  mask-based  approach  along  with  optical  reduction  to  ameliorate  the  demands  on  both 
the  mask  fabrication  and  on  the  optical  system.  Once  the  mask  is  fabricated,  all  of  the  informa¬ 
tion  on  the  mask  is  transferred  onto  the  wafer  in  the  lithography  step.  The  economics  of  the  inte¬ 
grated  circuit  industry  has  put  a  premium  on  throughput  performance.  Current  lithography  tools 
expose  ~  85  wafers  per  hour.  Each  300-mm  diameter  wafer  contains  ~  125  die  with  an  area  of 
22x36  mm  =  8.2  cm  .  This  corresponds  to  a  sustained  information  data  rate  onto  the  wafer  of  ~ 
1  THz! 


Nonetheless,  the  cost  of  optical  lithography  remains  an  issue  for  nanotechnology  both  at  the 
research  and  the  early-stage  commercialization  phases.  The  impressive  and  very  capable  lithog¬ 
raphy  tools  used  by  the  integrated  circuit  industry  are  very  expensive  both  in  initial  (~  $20  mil¬ 
lion)  and  in  operating  costs  (typical  mask  set  costs  for  a  modem  microprocessor  exceed  $1  mil¬ 
lion).  Clearly  these  numbers  only  make  sense  in  a  high  volume,  high  product- value  manufactur¬ 
ing  context,  and  are  out  of  reach  for  a  typical  research  venue. 

Fortunately,  many  applications  require  only  a  periodic  or  quasi-periodic  pattern  and  a  much 
simpler  laboratory  scale  technology,  interferometric  lithography,  based  on  the  interference  of  a 
small  number  (most  often  two)  coherent  laser  beams  can  produce  useful  patterns  over  large  ar¬ 
eas,  and  volumes,  with  considerable,  but  not  total,  pattern  flexibility,  and  with  dimensions  that 
today  are  approaching  20-nm  scales.  Combining  multiple  interferometric  exposures  and  mix- 
and-match  with  lower-resolution,  laboratory-scale  optical  lithography  and  with  limited  use  of 
higher-resolution  e-beam  lithography  dramatically  expands  the  available  range  of  patterns. 

Nonlinearities  at  all  stages  of  the  patterning  process  can  be  used  to  further  extend  the  spatial 
scales.  In  simple  terms,  the  aerial  intensity  image  in  interferometric  lithography  is  a  simple 
[l+cos(2 kxsind)]  functional  dependence,  where  k  =  lunik  is  the  optical  wavevector  in  the  me¬ 
dium  above  the  resist  and  0  is  the  angle  of  incidence  in  the  same  medium.  After  development  of 
the  photoresist,  a  square  wave  lineshape  is  typically  achieved,  described  by  the  functional  de¬ 
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cos(2/Axsin<9)  where  a  is  the  critical  dimension  or  CD  (width  of 


the  line).  This  expression  clearly  contains  spatial  frequencies  well  above  those  in  the  aerial  im¬ 
age  and  provides  justification  for  the  claim  that  there  is  no  fundamental  limit  to  optical  lithogra¬ 
phy  [1],  These  nonlinear  concepts,  discussed  in  more  detail  in  Section  II.4,  are  likely  to  extend 
the  reach  of  optical  lithography  by  another  factor  of  at  least  two,  to  scales  of  ~  10  nm  with  a  193- 
nm  source.  Further  similar  extensions  by  integral  factors,  3,  4,  ...  are  possible  but  will  be  in¬ 
creasingly  difficult  as  a  result  of  alignment,  process,  uniformity  and  edge  roughness  issues.  An¬ 
other  direction  is  the  use  of  directed  self-assembly  -  using  molecular  moieties  as  rulers  to  pro¬ 
ceed  further  into  the  nanoscale  while  retaining  the  capabilities  of  optical  lithography  to  hierar¬ 
chically  span  the  range  of  dimensions  from  cm’s  down  to  ~  10  nm. 

The  concepts  of  interferometric  lithography  and  examples  of  results  are  presented  in  section 
II  starting  from  the  simplest  case  of  two-beam  interference.  Particular  emphasis  is  placed  on  the 
recent  re-emergence  of  an  old  idea  -  immersion  interferometric  lithography.  This  section  con¬ 
cludes  with  a  discussion  of  the  limits  of  interferometric  lithography.  In  section  III,  a  very  brief 
discussion  of  the  extension  of  these  ideas  to  a  full  imaging  solution  is  presented.  This  is  already 
the  topic  of  many  more  extensive  treatments  and  cannot  be  fully  developed  in  the  available 
space.  The  remainder  of  the  report  contains  a  survey  of  active  research  areas  for  the  application 
of  these  technologies.  Examples  are  drawn  from  nanoscale  epitaxial  crystal  growth,  nanoflu- 
idics/nanobiological  applications,  nanomagnetics,  and  nanophotonics.  Details  are  included  in  the 
attached  articles.  The  treatment  presented  here  is  necessarily  very  high-level.  Nonetheless,  the 
juxtaposition  of  this  wide  range  of  application  areas  serves  to  make  the  major  point  -  optical  and 
especially  interferometric  lithography  are  enabling  technologies  that  allow  research  into,  and 
even  manufacturing  of,  a  wide  variety  of  exciting  nanoscience  and  nanotechnology  -  at  an  af¬ 
fordable  cost  that  is  broadly  accessible. 


II  INTERFEROMETRIC  LITHOGRAPHY  (IL) 

Interferometric  lithography  is  the  use  of  a  small  number  of  coherent  optical  beams  incident  from 
different  directions  on  a  2D  (thin  photoresist  film  «  X)  or  3D  (thick  photosensitive  layer  »  X) 
to  produce  an  interference  pattern  whose  intensity  distribution  is  recorded  in  the  photosensitive 
layer  and  is  later  transferred  (developed)  by  thermal  and  chemical  processes.  In  the  literature, 
this  concept  has  been  variously  referred  to  as  holographic  lithography,  interference  lithography, 
and  interferometric  lithography. 

Interferometric  lithography  (IL)  is  a  well-established  concept  that  has  largely  tracked  the 
application  of  lasers  to  lithography  [2]-[6],  Early  work  was  based  on  cw,  single  mode  lasers  such 
as  the  Ar+  line  at  488  and  the  Cd+  line  at  442  nm.  With  the  commercial  development  of  i-line  li¬ 
thography  (365  nm),  attention  shifted  to  the  cw  Ar++  lines  at  364  and  352  nm  and  to  the  355-nm 
third-harmonic  of  a  YAG  laser  to  take  advantage  of  commercial  resist  developments.  As  leading- 
edge  commercial  lithography  shifted  to  KrF  (248  nm)  wavelengths,  the  intracavity  doubled  Ar- 
ion  line  at  244  nm  was  used.  Today,  the  IC  industry  is  dominantly  using  ArF  excimer  lasers  (193 
nm)  and  work  has  been  reported  both  with  fifth-harmonic  YAG  (213  nm)  and  with  line-narrowed 
ArF  lasers  at  193  nm.  The  very  small  bandwidth  tolerance  of  the  high  resolution  lenses  used  in 
modem  lithography  has  forced  the  development  of  line-narrowed  excimer  lasers  with  sub-pm 
bandwidth.  These  lasers  have  longitudinal  coherence  lengths  of  several  cm  and  are  more  than 
adequate  for  interferometric  application. 


II.  1  Two  beam  interferometric  lithography 

The  basic  concept  of  two-beam  interference  is  shown  schematically  in  Fig.  1 .  Two  coherent  opti¬ 
cal  plane  waves  with  the  same  polarization  are  symmetrically  incident  on  a  photosensitive  layer 
coated  onto  a  substrate  at  angles  of  ±9.  The  intensity  pattern  in  space  is  simply  given  by: 


I(x,z)  =  L +  e2e-ikxsiae\ 


=  |£f[2  +  (e,  •  e2ei2kxsind  +  e{  ■  e2 cT2")] 


=  2\Ef  [l  +  <ypol  cos(2Axsin#)] 
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Fig.  1:  Schematic  representation  of  two  beam  interference. 
Coherent  laser  beams  are  symmetrically  incident  from  the 
left  and  the  right.  The  period  of  the  interference  pattern  is 
Xl[2nsm0\  =  X/2NA.  The  standing  wave  pattern  exists 
throughout  the  overlap  between  the  beams  as  long  as  this 
overlap  distance  is  shorter  than  the  longitudinal  coherence 
length  of  the  laser  beams  and  the  wafer  can  be  placed 
anywhere  inside  this  coherence  volume. 


where  k  =  2nn/A  with  n  the  refractive  in¬ 
dex  of  the  medium  (1  for  air)  and  X  the 
optical  wavelength  and  et  is  the  unit  po¬ 
larization  vector  for  each  wave.  The  period 
of  the  interference  pattern  along  the  x- 
direction  is  T/2nsin<9and  crpoi  =  [1, 
cos(26)]  for  (TE,  TM)  polarization.  By 
Snell’s  law  this  period  is  invariant  across 
any  0  =  tt/2  layer  boundaries  such  as  the 
lithography  stack;  this  is  not  true  for  the 
contrast.  TE  polarization  always  gives  the 
maximum  contrast,  while  the  contrast  is 
reduced  for  TM  polarization  as  a  result  of 
the  7T-phase  shift  between  the  ex  and  ez 
contributions  to  the  interference  intensity. 
For  evaluating  the  polarization  depend¬ 
ence,  it  is  important  to  use  the  propagation 


angles  inside  the  photosensitive  layer.  The 
contrast  reduction  for  TM  polarization  be¬ 
comes  significant  for  steep  angles  in  the 
resist  and  will  be  discussed  in  detail  in  the 
immersion  discussion  below.  For  now  take 
w  =  1;  it  is  relatively  easy  experimentally  to 
reach  sin#  ~  1  (for  example  at  a  75°  angle 
of  incidence,  sin<9=  0.97  so  the  limiting  pe¬ 
riod  of  IL  is  >  2J2  and  extends  below  100 

nm  at  a  wavelength  of  193  nm. 

Two  experimental  arrangements  for  IL 
are  shown  in  Fig.  2.  The  top  set-up  (a)  is  a 
simple  comer  cube  arrangement  (Fresnel 
mirror)  where  the  right  and  left  halves  of 
the  beam  are  folded  onto  each  other  using  a 
90°  geometry.  This  arrangement  has  been 
used  quite  successfully  with  laser  sources 
with  a  high  transverse  coherence  such  as 
single-mode  TEMoo  Ar-ion  laser  beams. 

Expanding  the  beam  allows  quite  large  ar¬ 
eas  -especially  at  periods  corresponding  to 
incidence  angles  #of  ~  45°  where  the  sizes 
of  the  beams  incident  on  the  sample  and  the 
mirror  are  equal.  At  extremely  large  or 
small  periods,  the  available  area  becomes 
limited  by  the  required  size  of  the  folding 
mirror  and  alternative  experimental  con¬ 
figurations  are  preferable. 

Excimer  lasers  exhibit  a  large  number 
of  transverse  modes  and  consequently  a  very  low  transverse  coherence.  The  experimental  ar¬ 
rangement  in  Fig.  2a  is  not  suitable  in  this  case,  since  the  two  halves  of  the  beam  are  not,  in  gen¬ 
eral,  mutually  coherent.  The  interferometer  arrangement  shown  in  Fig.  2b  resolves  this  problem 
by  splitting  the  input  beam  in  half  and  folding  it  onto  itself  as  indicated  by  the  various  line  styles 
in  the  figure;  to  maintain  contrast,  any  offset  has  to  be  within  the  transverse  coherence  length. 
There  remains  the  requirement  of  a  significant  longitudinal  coherence  (narrow  spectral  width) 
since  the  distances  from  the  beamsplitter  to  the  image  plane  vary  for  the  two  beams  across  the 
field.  Fortunately,  today’s  very  narrow  linewidth  lithography-optimized  excimer  lasers  have 
linewidths  «  1  pm  (longitudinal  coherence  length  ~  X  /AX  >  4  cm,  making  large-area  (at  least 
several  cm  )  IL  straightforward. 

Diffractive  beam  splitters  offer  an  achromatic  interferometric  solution  that  has  been  used 
with  larger  linewidth  lasers,  for  example  free-running  excimer  lasers  without  line  narrowing,  to 
provide  large  area  patterns  [7],  [8],  The  basic  idea  is  to  maintain  both  the  angular  relationship  of 
the  two  beams  that  gives  the  interferometric  pattern  and  also  equal  path  lengths  for  the  two 
beams  across  the  pattern  area  to  provide  the  necessary  self-coherence.  A  grating  beam  splitter 
provides  the  necessary  capability  to  meet  both  of  these  requirements.  Generally  a  phase  grating 
is  used  optimized  to  result  in  maximum  coupling  into  the  ±1  diffraction  orders.  These  two  orders 


are  then  recombined  either  with  a  second 
grating  or  with  a  pair  of  mirrors  to  provide 
the  final  aerial  image  pattern. 


II.  2  Immersion  interferometric  lithography 

Recently,  there  has  been  resurgence  of 

interest  in  immersion  lithography,  the  use 

of  a  suitable  fluid  between  the  imaging 

lens  and  the  wafer  [9]-[12],  This  allows  an 

improvement  in  the  optical  resolution  by  a 

T,.  „  .  factor  of  the  refractive  index  of  the  im- 

Fig.  3:  45-nm  halt-pitch  pattern  in  photoresist.  (Resist  sup-  .  ...  .  .  .  ,  .  ,  ,.  . 

r  au  a  v  ci  *  •  */*  •  i  i  mersion  fluid,  provided  that  the  lens  is 
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suitably  modified.  At  193  nm,  water  is  a 
very  attractive  immersion  fluid  with  a  refractive  index  of  1.44  at  193  nm  [13].  The  simple  addi¬ 
tion  of  a  prism  to  the  interferometric  arrangement  of  Fig.  2b  allows  investigation  of  immersion 
interferometric  lithography.  In  the  simplest  configuration,  a  drop  of  water  is  placed  on  the  wafer 
and  the  wafer  and  water  are  raised  till  the  water  contacts  the  uncoated  prism  and  forms  a  high 
quality  optical  component.  More  elaborate  automatic  fill  and  dry  systems  will  be  used  in  manu¬ 
facturing  applications. 

An  example  of  a  45-nm  half-pitch  immersion  IL  pattern  is  shown  in  Fig.  3.  After  the  im¬ 
mersion  exposure  the  sample  was  baked  to  activate  the  chemically  amplified  resist  and  devel¬ 
oped  in  a  standard  basic  solution.  More  details  are  provided  elsewhere  [14],  [15]. 

Polarization  is  a  very  important  parameter  at  the  steep  angles  in  the  resist  enabled  by  im¬ 
mersion  [16].  For  TE  polarization,  as  indicated  by  Eq.  (1),  the  fringe  visibility  [( [Imax  -  Imi„)/(Imax  + 
Imin )]  is  unity  independent  of  the  half  pitch.  However,  for  TM  polarization,  the  fringe  visibility 
decreases  as  the  half-pitch  is  decreased  as  a  result  of  competition  between  the  out-of-phase  Ex2 
and  Ez2  contributions  to  the  intensity.  At  an  interference  angle  of  45°  in  the  resist,  these  two 
terms  are  equal  and  opposite  in  phase  and  the  visibility  goes  to  zero.  For  larger  interference  an¬ 
gles,  the  line-space  pattern  is  inverted.  Fig.  4  shows  the  simple  calculation  of  the  fringe  visibility 
for  the  parameters  indicated.  The  cusp  at  ~  40  nm  corresponds  to  the  inversion  of  the  pattern  - 
the  interchange  of  lines  and  spaces.  The  minimum  half-pitch  available  with  a  193  nm  source  and 
i  water  as  the  immersion  liquid  is  33.5  nm; 

'  1  '  1  '  '  1  '  '  1  '  '  '  if  a  higher  index  liquid  becomes  avail- 

Si°2 limit  able,  the  glass  of  the  lens  sets  the  limit  at 
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Fig.  4:  Aerial  image  contrast  in  photoresist  as  a  function  of 
half-pitch. 


II.  3  Multiple-Exposure  interference  -  Com¬ 
plex  Two  Dimensional  Patterns 

Simple  two-beam  interference  produces 
an  array  of  uniformly  spaced  parallel 
lines  covering  the  exposure  area,  e.  g.  a 


simple  grating.  By  combining  multiple 
such  gratings,  structures  of  significant 
complexity  can  be  constructed  [17]. 

Fourier  analysis  tells  us  that  any  pat¬ 
tern  consists  of  a  summation  of  peri¬ 
odic  patterns.  A  seemingly  small,  but 
important,  difference  between  IL  and  a 
Fourier  expansion  is  the  DC  level  that 
accompanies  each  exposure,  cf.  Eq. 

(1),  since  the  exposure  intensity  must 
be  positive  for  all  positions.  The  con¬ 
sequence  is  that  the  DC  level  increases 
with  the  number  of  exposures  and  for  a 
large  number  of  exposures,  the  result  is 
a  perfect  aerial  image  of  the  desired 
pattern  riding  atop  a  large,  uniform 
exposure  level.  This  loss  of  contrast  in 
the  image  is  an  inevitable  consequence 
of  summing  incoherently  related  im¬ 
ages,  and  is  the  same  reason  that  the 
contrast  is  lower  in  incoherent  imaging  than  it  is  in  coherent  imaging  [18].  Today’s  photoresists 
are  quite  nonlinear,  giving  an  approximately  sigmoidal  response  such  that  every  location  above 
(below)  a  certain  threshold  is  removed  (retained)  on  development.  To  a  point,  this  nonlinear  re¬ 
sponse  can  ameliorate  this  uniform  exposure  issue  and  restore  the  pattern.  However,  eventually 
the  uniform  exposure  becomes  too  large  relative  to  the  image  information  and  pattern  details  are 
lost. 

Notwithstanding  this  complexity  limitation,  there  are  many  useful  structures  readily  avail¬ 
able  with  multiple  exposure  techniques.  The  simplest,  and  perhaps  the  most  useful,  is  the  square 
array  2D  pattern  obtained  by  carrying  out  one  IL  exposure,  rotating  the  sample  by  90°,  and  per¬ 
forming  a  second  exposure.  For  low  exposure  doses  in  a  positive  photoresist,  such  that  neither 
exposure  alone  is  sufficient  to  clear  the  resist  during  developing,  the  result  is  a  2D  array  of  holes; 
at  higher  exposure  doses  where  each  exposure  clears,  the  result  is  a  2D  array  of  posts.  For  a 
negative-tone  resist,  the  occurrence  of  holes  and  posts  are  reversed.  Details  of  the  pattern,  such 
as  the  sidewall  angle,  are  sensitive  to  the  exposure  dose  and  need  to  be  optimized  in  any  applica¬ 
tion  [19]. 

Another  simple,  but  important,  application  of  multiple  exposures  is  to  reduce  the  density  of 
a  pattern  while  retaining  the  small  size  capability  that  is  inherent  in  small-pitch  patterns.  Again, 
the  idea  is  quite  simple,  expose  a  first  pattern  at  a  period  p\  =  p  with,  for  example,  equal 
line:  space  dimensions  and  a  critical  dimension  (CD)  or  pattern  size  of  pi  2,  then  expose  the  same 
area  again  at  a  period  of  po  =  p/2.  For  a  positive  photoresist,  the  result  is  a  pattern  with  a  period 
of  p  and  a  CD  of  p/4.  Two  constraints  that  have  to  be  achieved  for  this  approach  to  be  successful 
are  matching  of  the  periods  and  adjusting  the  alignment  between  the  two  patterns.  The  linear  di¬ 
mension  of  the  final  pattern  L,  perpendicular  to  the  lines  over  which  the  patterns  are  in  phase, 
depends  critically  on  the  deviation  of  the  two  periods  5=  2pi-p\  and  is  given  by  L~  pxp2/ 5  . 
Fortunately,  by  using  diffraction  from  the  latent  image  of  the  first  exposure  (or  from  a  predefined 
grating  pattern  alignment  mark)  it  is  possible  to  derive  a  feedback  signal  to  allow  setting  both  the 
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Fig.  5:  Experimental  configuration  for  projection  moire 
alignment.  The  two  beams  are  incident  on  the  photoresist 
image  produced  in  the  first  exposure,  or  on  a  predefined  grat¬ 
ing  pattern.  The  angles  are  adjusted  so  that  there  is  only  a 
single  fringe  across  the  illuminated  area  (sets  the  period),  and 
the  sample  translation  is  adjusted  so  that  the  fringe  is  at 
maximum  brightness  (sets  the  phase). 


frequency  and  the  phase  of  the  second  exposure  with  the  necessary  precision  to  cover  a  large 
area,  as  is  illustrated  in  Fig.  5  [20].  The  basic  concept  is  to  monitor  the  interference  between  two 
diffracted  beams,  one  from  each  input  beam  over  across  the  exposure  area.  This  can  either  be 
done  at  low  intensity  over  the  entire  area,  or  at  higher  intensity  with  the  sacrifice  of  a  small  por¬ 
tion  of  the  final  pattern  using  large  area  masks.  The  goal  is  to  set  the  input  angles  on  the  second 
exposure  so  that  a  single  fringe  of  the  interference  pattern  extends  over  the  entire  exposure  area 
(this  sets  the  period,  e.  g.  S— >  0)  and  then  to  adjust  the  phase  of  that  pattern  (light  or  dark  -  this 
sets  the  relative  phase  of  the  two  patterns).  As  a  result  of  the  moire  spatial  amplification  of  the 
pitch  differential  (small  pitch  mismatches  translate  into  large,  easily  measured,  distances)  this 
alignment  technique  has  general  applicability  across  many  applications  in  nanoscale  fabrication. 


II.4  Multiple-beam  interference  -  2D  and  3D  Patterns 

Adding  more  input  beams  to  an  IL  exposure  is  another  direction  toward  increasing  the 
flexibility  of  the  technique  and  the  complexity  of  the  patterns  that  can  be  achieved.  The  simplest 
example  is  to  use  four  beams,  two  oriented  at  ±6  in  the  x- direction  and  two  at  ±0  in  the  y- 
direction.  If  orthogonal  polarizations  are  used  for  the  two  directions,  this  is  identically  equivalent 
to  the  process  for  defining  a  square  array  in  the  previous  section,  with  higher  throughput  since 
the  two  exposures  are  simultaneous  rather  than  sequential  and  without  the  need  for  a  rotation  of 
the  wafer. 

If  all  of  the  interfering  beams  are  inclined  at  the  same  angle  to  the  surface  normal,  the  result¬ 
ing  aerial  image  is  again  independent  of  the  position  of  the  wafer,  however,  if  the  beams  are  at 
different  angles,  the  aerial  image  is  no  longer  invariant  along  the  z-direction  and  the  placement  of 
the  wafer  becomes  important.  For  a  small  number  of  beams,  the  pattern  is  periodic  along  z,  as  the 
number  of  beams  increases  to  a  continuum,  the  periodicity  along  z  tends  towards  infinity  and  the 
familiar  result  of  a  single  focal  plane  for  an  imaging  system  results.  A  simple,  but  useful,  exam¬ 
ple  is  the  addition  of  a  normal  incidence  beam  along  with  the  two  offset  beams  of  two-beam  in¬ 
terference.  Assuming  TE  polarization,  the  straightforward  generalization  of  Eq.  1  is: 


l{x,z) 


j  /Tp  jg-^Z  |  g-ikx sin 0  ^^-ikz cos#  ^ 

E\ ’  [3  +  4  cos(kx  sin  9)  cos[fe(l  -  cos  #)]  +  2  cos(2 kx  sin  #)] 


(2) 


For  z-positions  where  the  second  term  vanishes,  the  aerial  image  is  very  similar  to  that  of  two- 
beam  interference  with  an  additional  DC  bias.  However,  at  z-positions,  such  as  z  =  0,  where  the 
second  term  is  at  a  maximum,  only  alternate  lines  survive.  The  result  is  a  set  of  lines  for  a  ID 
exposure  with  a  linewidth  comparable  to  the  dense  pattern,  but  with  a  twice  the  separation,  e.g.  a 
sparse  array.  For  some  applications,  for  example  for  field-emitter  tips  for  displays,  there  are  im¬ 
portant  advantages  to  the  sparse  pattern  [19].  Fig.  6  shows  the  aerial  images  for  (a)  a  two-beam, 
interference  at  a  period  of  2 p  from  Eq.  (1);  (b)  the  three-beam  interference  for  z  =  0  (spaced  at  2 p 
but  with  a  linewidth  comparable  to  p/2),  and  (c)  the  three-beam  interference  where  the  second 
term  vanishes  (period  of  p).  The  smaller  pattern  feature  size  for  the  three  beam  exposure  is  evi¬ 
dent. 


The  variation  in  the  z-direction  also  of¬ 
fers  the  possibility  of  patterning  a  full  3D 
pattern  in  a  single  exposure  or  set  of  expo¬ 
sures.  This  possibility  will  be  discussed  in 
more  detail  in  Sect.  IV. 4.1  as  part  of  the 
discussion  of  photonic  crystals. 

II.  5  Mix-and-Match  Approach  to  Complex 

Structures 

Another  straightforward  strategy  is 
to  use  IL  to  provide  large-area  periodic 
patterns  and  then  to  use  another  litho¬ 
graphic  technique  to  add  either  a  small 
area  customization  (for  example  a  defect 
in  a  photonic  crystal)  or  large  area  aperi¬ 
odic  structures  (for  example  contact  pads 
and  integration  with  standard  microfabri¬ 
cation.  In  a  preliminary  investigation  of 
this  approach,  narrow  lines  (~  200  nm) 
produced  by  IL  were  placed  atop  a  sparse 
array  of  much  larger  lines  (2  pm)  pro¬ 
duced  by  conventional  lithography  for 
electrical  test  structures  [21].  Fig.  7  shows 
the  result.  Fig.  7(a)  shows  a  SEM  of  the 
final  structure,  the  narrow  line  produced 
by  IL  is  shown  atop  the  wider  pad  defined 
by  optical  lithography,  the  next  line  over 
misses  the  pad  and  has  no  effect  on  the 
measurement.  Fig.  7(b)  shows  the  varia¬ 
tion  of  the  alignment  across  the  1-cm  wide 


Position 


Fig.  6:  Aerial  image  intensity:  (a)  for  a  two-beam  exposure  at 
twice  the  period  and  (b)  and  (c)  at  two  positions  in  z  for  a 
three-beam  exposure.  The  50%  linewidths  are  indicated.  The 
three  beam  exposure  offers  a  smaller  linewidth  with  the  same 
period  as  the  two-beam  exposure. 


die.  There  is  a  period  mismatch  along  with  random  errors.  The  period  mismatch,  which  corre¬ 


sponds  to  a  Ap/p 


could  be  eliminated  with  an  adjustment  of  the  IL  pitch,  the  random 


errors,  with  a  3a  of  only  ~  140  nm,  likely  arise  from  placement  errors  in  the  optical  lithography 


mask. 


Fig.  8  shows  another  mix-and-match  combination  of  optical  and  interferometric  litho¬ 
graphies.  This  was  an  alternative  approach  to  address  the  sparse  pattern  issue  discussed  above  in 
the  context  of  field-emitter  displays  [19].  For  this  integration  process,  a  two  lithography- step  se¬ 
quence  was  developed  using  a  sacrificial  silicon  dioxide  layer.  A  first  lithography  step,  either 
optical,  OL,  as  in  (a)  or  interferometric,  IL,  as  in  (b)  was  carried  out  and  the  results  were  etched 
into  the  sacrificial  layer.  Then,  a  second  lithography  step,  using  the  other  lithography  technique 
was  carried  out  to  isolate  every  other  hole  in  the  pattern.  Alignment  was  carried  out  with  a  grat¬ 
ing  surrounding  the  pattern.  In  (a)  a  period  and  phase  adjustment  very  similar  to  that  described 
above  was  carried  out.  In  (b)  the  alignment  was  performed  using  standard,  pre-printed  alignment 
marks. 
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Nonlinear  process  in  lithography 
spatial  frequency  multiplication 

As  noted  in  the  introduction,  the  spa¬ 
tial  frequencies  of  the  developed  pattern 
can  extend  well  beyond  the  2/A,  bandpass 
limits  of  optics  as  a  result  of  nonlinear 
processes  in  the  exposure,  development 
and  processing  -  e.g.  resulting  in  trans¬ 
forming  a  sinusoidal  aerial  image  to  a 
square  wave  developed  resist  pattern. 
Modern  resists  are  engineered  to  be 
highly  nonlinear  -  that  is  to  exhibit  a 
sharp  threshold  between  exposed  and  un¬ 
exposed  regions.  Indeed,  this  is  an  all 
important  characteristic  that  has  allowed 
the  development  of  many  of  the  resolu¬ 
tion  enhancement  techniques  such  as  op¬ 
tical  proximity  correction  and  phase  shift 
masks  that  are  routinely  employed  by  the 
integrated  circuit  industry. 

These  nonlinear  concepts  are  likely 
to  extend  the  reach  of  optical  lithography 
by  another  factor  of  at  least  two,  or  to 
scales  of  ~  20-  to  10-nm  with  a  193-nm 
source.  Further  similar  extensions  by  in¬ 
tegral  factors,  3,  4,  ...  are  possible  but 
will  be  increasingly  difficult  as  a  result  of 
process,  uniformity  and  edge  roughness 
issues. 

In  the  previous  section,  we  showed 
that  the  optical  system  band  pass  limits 
the  one-dimensional  pattern  density  in  a 
single  exposure  to  ~  AINA,  while  the 
nonlinearities  in  the  processing  result  in 
higher  harmonics  extending  in  spatial 
frequency  well  beyond  the  diffraction 
limits  of  Eq.  1 .  These  nonlinearities  can 
be  used  to  extend  the  available  pattern 
density  [22],  leading  to  the  claim  that 
“there  is  no  fundamental  limit  to  optical 
lithography  -  there  are  only  process  con¬ 
trol,  uniformity  and  surface  roughness  issues  [1].”  The  key  is  to  apply  the  nonlinearities  between 
exposure  steps  and  combine  images  in  such  as  way  as  to  reduce  or  eliminate  the  lower  frequency 
components,  leaving  the  higher  density  of  structures.  This  is  variously  know  as  spatial  period 
division  or  spatial  frequency  multiplication  depending  on  whether  the  focus  is  on  real  space  (di¬ 
vision)  or  frequency  space  (multiplication). 
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Fig.  7:  Demonstration  of  the  integration  of  OL  and  IL  for  an 
electrical  test  structure  pattern,  (a)  SEM  of  the  placement  of  a 
narrow  line  defined  by  IL  (200  nm  on  a  2  pm  pitch)  atop  a 
wider  electrical  test  structure  pattern  defined  in  a  separate  OL 
step,  (b)  Variation  of  the  placement  across  a  1-cm  wide  die. 
The  variation  consists  of  a  slight  misadjustment  of  the  period 
(~  6><10"5,  dashed  line  in  the  fig.)  and  a  random  error  shown 
in  the  bottom  part  of  the  figure  with  a  3a  of  140  nm. 


A  one-dimensional  example  of  a 
spatial  period  division  is  shown  in  Fig.  9 
to  illustrate  the  general  concept.  The  top 
panel  Fig.  9a  shows  a  photoresist  pattern 
atop  a  sacrificial  layer  on  a  substrate.  The 
period  of  the  pattern  is  limited  by  the  op¬ 
tics  to  X/2NA;  the  CD,  in  contrast,  is  not 
limited  by  the  optics  and  is  chosen  to  be 
less  than  URN  A.  This  does  not  violate 
any  optical  limits  and  is,  indeed,  common 
practice  in  integrated  circuit  manufactur¬ 
ing  where  the  transistor  gate  length  is  less 
than  '/2  the  pattern  period  (e.g.  50-nm 
gates  for  a  180-nm  period  in  today’s  mi¬ 
croprocessors).  In  Fig.  9b,  the  photoresist 
pattern  has  been  transferred  into  the  sacrificial  layer.  In  Fig.  9c,  a  second  photoresist  layer  has 
been  spun,  exposed  and  developed.  The  period  and  CD  is  the  same  as  in  the  first  case,  but  the 
pattern  has  been  shifted  by  XI ANA  to  interpolate  the  second  pattern  with  the  first.  Finally  this  pat¬ 
tern  is  also  transferred  into  the  sacrificial  layer  (Fig.  9d),  that  layer  is  then  used  as  a  mask  to 
transfer  the  total  pattern  into  the  substrate  layers,  and  finally  the  sacrificial  layer  is  removed  leav¬ 
ing  a  final  pattern  at  a  period  of  XI ANA,  twice  as  dense  as  the  single  exposure  optical  limit. 

The  mathematics  follows  simply: 


Fig.  8:  Sparse  array  of  holes  produced  by  a  combination  of 
optical  and  interferometric  lithographies:  (a)  OL  first,  IL  sec¬ 
ond;  (b)  IL  first,  OL  second.  In  both  cases  the  first  pattern 
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The  result  is  the  combination  of  two  exposures  with  the  nonlinearities  applied  to  each  expo¬ 
sure  independently.  The  symbol  used  to  represent  the  combination  is  left  ambiguous;  in  this  case 
the  combination  is  simple  addition.  In  another  case,  for  example  the  use  of  two  resist  layers  sen¬ 
sitive  at  different  wavelengths,  it  might  be  multiplication.  The  pi  A  phase  shift  of  the  second  ex¬ 
posure  results  in  a  factor  of  (-1  /  in  the  Fourier  coefficients,  with  the  net  result  that  all  of  the  odd 
terms  cancel  leaving  a  sin(x)/x  pattern  at  twice  the  original  pitch  or  half  the  period. 

This  spatial  frequency  multiplication  concept  applies  equally  to  arbitrary  patterns  as  to  the 
periodic  grating  pattern  discussed  here.  The  parsing  of  the  pattern  into  two  masks  becomes  a 
simple  coloring  problem,  with  the  design  rule  that  no  two  features  on  a  mask  can  be  closer  than 
twice  the  final  pitch  (adjusted  for  the  system  magnification).  An  important  difference  between 
the  arbitrary  and  periodic  pattern  cases  is  that  for  a  periodic  pattern  the  aerial  image  contrast  is 
uniform  across  the  pattern  and  all  that  is  necessary  to  ensure  a  uniform  final  pattern  is  to  arrange 
for  a  uniform  illumination;  in  contrast,  for  an  arbitrary  image  the  aerial  image  contrast  inherently 
varies  across  the  pattern  and  additional  measures,  such  as  optical  proximity  correction,  to  create 
a  more  uniform  image  contrast  to  maintain  the  uniformity  of  the  nonlinear  processes  are  neces¬ 
sary. 

In  principle,  this  process  can  be  repeated  multiple  times,  for  integer  spatial  frequency  mul¬ 
tiplications  -  3,  4,  ...  The  alignment  requirements,  the  image  uniformity  requirements,  and  the 


difficulties  of  minimizing  edge  roughness  of  the  resulting  patterns  will  ultimately  provide  a  prac¬ 
tical  limit  [23]. 

This  discussion  has  so  far  omitted  the  important  issue  of  alignment  between  these  multiple 
exposures.  For  the  periodic  pattern  case,  the  image  being  printed  is  a  large  grating,  as  is  well 
known,  the  angular  resolution  of  a  grating  scales  as  the  number  of  lines  in  the  grating  so  that 
alignment  schemes  based  on  moire  interference,  as  discussed  above,  offer  an  approach  to  align¬ 
ment  [24],  Optical  aberrations  and  other  contributions  to  wavefront  distortion  will  introduce  po¬ 
sitional  errors  and  limit  the  ability  to  interpolate  patterns.  These  issues  need  to  be  investigated 
before  the  ultimate  feasibility  of  these  spatial  frequency  multiplication  schemes  can  be  assessed. 

II.  7  Limits  of  interferometric  lithography 

The  semiconductor  industry  maintains  a  15  year  plan,  the  International  Technology  Road¬ 
map  for  Semiconductors  (ITRS)  [25],  that  projects  forward  and  tries  to  match  industry  manufac¬ 
turing  requirements  (scaled  with  a  Moore’s  law  0.7  linear  dimension  shrink  every  three  years) 
with  available  and  projected  technologies.  Table  I  compares  the  industry  requirements  as  defined 
in  the  ITRS  with  the  scales  available  from  interferometric  lithography  and  from  imaging  inter¬ 
ferometric  lithography  (discussed  below). 


Table  1: 11 

fRS  Requirements  and  (Imaging)  Interferometric  Lithography  Capabilii 

ties 

Year  of  Vol. 
Manufac. 

Half-Pitch 

(nm) 

IL  Approach 

IL  Limit 

(nm) 

(periodic) 

IIL  Limit 
(nm) 

(arbitrary) 

2007 

65 

Air  immersion 

48 

64 

2010 

45 

H2O  immersion 

34 

45 

2013 

32 

HI*  immersion 

27 

36 

Air  immersion&  Freq.  Doubling 

24 

32 

2016 

22 

H2O  immersion  &  Freq.  Doubling 

17 

23 

2019 

16 

HI*  immersion  &  Freq.  Doubling 

12 

16 

H2O  immersion  &  Freq.  Tripling 

12 

16 

Gray  squares  are  ITRS  roadmap  requirements. 

*  HI  -  High  Index  immersion  (assuming  both  an  immersion  liquid  index  and  a  glass  index  of  1.8  @  193  nm). 

For  periodic  patterns,  as  used  in  development  of  many  Si  manufacturing  processes,  the  65- 
nm  hp  (half-pitch)  requirements  are  met  with  IL  without  the  need  for  immersion  techniques.  For 
the  45-nm  hp  pitch  node,  water  immersion  is  suitable  as  demonstrated  in  Sect.  II.2  above.  Two 
alternatives  are  indicated  for  reaching  the  32-nm  hp  node:  use  of  a  higher-index  matching  fluid 
(and  a  higher- index  final  glass  element  as  well),  here  assumed  as  an  index  of  1.8,  or  a  frequency¬ 
doubling  step  from  the  65-nm  node.  Of  course,  frequency  doubling  from  the  45-nm  H2O  immer¬ 
sion  result  is  possible  as  well.  Spatial  frequency  doubling  from  the  45-nm  hp  node  will  access 
the  22  nm  node.  Finally,  the  16  nm  half-pitch  node  can  be  accessed  either  by  frequency  doubling 
from  the  high  index  version  of  the  32-nm  node,  or  by  frequency  tripling  from  the  45-nm  hp  H2O 
immersion  node.  This  table  gives  the  very  simply  derived  optical  limits.  As  discussed  above,  is¬ 
sues  of  line-edge  roughness,  alignment,  and  wavefront  distortion  are  significant  and  must  be 
carefully  investigated  to  determine  ultimate  feasibility.  Nonetheless,  this  simple  analysis  is  suffi¬ 
cient  to  justify  the  claim  that  there  are  no  fundamental  limits  to  optical  lithography !  Imaging  is¬ 
sues  (the  right  most  column  in  Table  I)  will  be  briefly  discussed  in  Sect.  III. 
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Fig.  9:  Illustration  of  process  flow  for  spatial  period  division, 
a)  Expose  a  develop  a  line:space  pattern  with  the  line  width  < 
p/4  with  p  the  pitch,  b)  Transfer  this  pattern  into  a  sacrificial 
layer,  c)  Expose  and  develop  a  second  photoresist  layer  with 
the  same  pattern,  but  shifted  by  pt 2.  d)  Transfer  this  pattern 
into  the  sacrificial  layer,  and  e)  Transfer  the  composite  pattern 
into  the  wafer  resulting  in  a  spatial  frequency  doubled  pattern 
at  pt. 2. 


II.  8  Directed  self-assembly 
While  the  above  statement  about  the  op¬ 
tical  limits  of  lithography  is  correct,  the 
issues  associated  with  line-edge  rough¬ 
ness,  photoresist  mechanical  properties, 
alignment,  and  wavefront  flatness  will 
get  continuously  more  difficult.  Self- 
assembly  is  a  very  attractive  alternative 
for  extending  the  spatial  scales  to  sub-nm 
dimensions.  Directed  self-assembly  using 
the  properties  of  nanoscale  particles  and 
of  molecular  moieties  is  an  attractive  al¬ 
ternative  as  the  lithographic  scale  de¬ 
creases  below  ~  20-  to  10-nm  that  both 
addresses  the  need  for  smaller  features 
(extending  to  the  nm  regime  and  below) 
and  retains  the  exquisitely  wide  scale  - 
from  cm  to  nm  of  existing  optical  lithog¬ 
raphy  [26], 

One  example  of  a  directed  self- 
assembly  process  is  shown  in  Fig.  10 
which  shows  ~  50-nm  silica  particles  as¬ 
sembled  into  trenches  etched  into  a  Si 
substrate  [27],  [28].  This  is  a  very  flexi¬ 
ble  process  that  can  be  used  to  produce 
nanochannels  with  controlled  porosity 
and  even  multilevel  structures  as  shown 
in  Fig.  11. 

Ill  IMAGING-INTERFEROMETRIC 
LITHOGRAPHY 


Optical  lithography  is  the  only  technol¬ 
ogy  that  has  been  used  for  volume  manu¬ 
facturing  throughout  the  history  of  the 
integrated  circuit.  The  parallel  nature  of 
lithography  in  which  an  entire  image  is 
transferred  in  a  single  exposure  is  the  primary  reason  for  this  dominance.  It  now  seems  apparent 
that  optical  lithography  -  with  the  extension  of  water  immersion  -  will  continue  as  the  primary 
manufacturing  technology  at  least  through  the  45 -nm  half-pitch  node. 

Arbitrary  images,  in  principle  containing  as  many  spatial  frequencies  as  there  are  pixels  in 
the  image,  are  significantly  more  difficult  to  produce  than  the  simple,  single  spatial  frequency 
periodic  images  of  IL.  The  scale  of  features  produced  is  smaller  than  the  resolution  limits  of  tra¬ 
ditional  imaging,  and  many  very  sophisticated  techniques  have  been  introduced  to  extend  the 
range  of  feature  sizes  that  can  be  produced.  Optical  proximity  correction,  adding  additional  sub¬ 
resolution  features  to  the  mask  to  impact  the  diffraction  limited  image,  off-axis  illumination,  and 
phase  shift  masks  have  been  extensively  discussed  in  the  literature  and  are  not  covered  here  [29]- 

[31]- 


Imaging  interferometric  lithography 
(IIL)[32]-[34]  is  an  approach  to  resolu¬ 
tion  enhancement  that  includes  both  ex¬ 
treme  off-axis  illumination  (tilt  of  a  co¬ 
herent  illumination  source  to  the  edge  of 
the  image  pupil  plane)  with  spatial  filters 
to  assure  uniform  coverage  of  frequency 
space.  This  ensures  that  the  maximum 
spatial  frequency  available  to  the  optical 
system  is  accessed  (corresponding  to  the 
interference  between  coherent  beams 
originating  across  the  diameter  of  the  pu¬ 
pil).  In  order  to  print  these  frequencies  in 
both  the  x-  and  y-spatial  directions  of  the 
image,  both  the  offset  direction  and  the 
pupil  filter  have  to  be  rotated  by  90°.  Fi¬ 
nally,  to  restore  a  telecentricity  to  the  im¬ 
age  so  that  deviations  from  precise  focus 
can  be  tolerated,  additional  exposures 
with  illumination  from  the  opposite  sides 
of  the  pupil  are  required.  This  can  all  be 
automated  within  the  tool  so  that  expo¬ 
sure  times  are  not  impacted  by  these  mul¬ 
tiple  exposures.  An  important  advantage 
of  IIL  for  hyper-AT  immersion  exposures 
is  that  the  polarization  of  the  source  can 
be  adjusted  at  each  exposure  to  ensure 
that  the  optimal  (TE)  polarization  is  used 
for  both  the  x-  and  y-offsets.  This  is  more  difficult  to  accommodate  with  phase-shift  mask  ap¬ 
proaches  to  accessing  the  limits  of  image  frequency  space. 

Both  OPC  and  PSM  techniques  significantly  complicate  the  mask  and  increase  mask  costs 
that  are  becoming  comparable  to  or  even  larger  than  tool  costs  in  a  cost-of-ownership  calcula¬ 
tion.  IIL  uses  simpler  binary  masks  and  minimizes  the  need  for  OPC,  this  can  be  an  important 
cost  savings  -  particularly  for  small  lots  as  in  ASIC  production  or  during  initial  product  devel¬ 
opment  cycles  with  frequent  mask  replacements. 

The  resolution  limits  for  arbitrary  patterns  are  not  as  small  as  for  periodic  patterns  as  a  result 
of  the  need  to  capture  the  pattern  information  in  the  sidebands  around  the  fundamental  half-pitch 
frequency  [35].  This  reduction  in  resolution  is  pattern  dependent,  a  factor  of  13%  reduction  was 
assumed  for  the  final  column  of  Table  I.  Fig.  12  shows  simulations  for  some  of  the  nodes  in  Ta¬ 
ble  I.  For  the  65-  and  45-nm  hp  node,  the  fidelity  of  the  image  is  quite  good  [36].  For  the  nodes 
involving  spatial  frequency  multiplication,  the  variation  of  the  contrast  across  the  image  causes 
more  distortion  of  the  pattern.  This  could  be  addressed  with  additional  OPC  features;  some  rela¬ 
tively  minor  OPC  has  already  been  applied  for  the  modeling  of  the  22-nm  hp  node  in  Fig.  12 
[37].  The  important  result  is  that  there  is  a  path  for  extending  193-nm  based  optical  lithography 
all  the  way  to  the  16  nm  node  with  relatively  small  changes  to  current  optical  lithography.  Inves¬ 
tigations  are  currently  underway  towards  a  number  of  less  evolutionary  alternatives:  EUV;  mask- 


Fig.  10:  Examples  of  directed  self-assembly  of  ~  50-nm  di¬ 
ameter  silica  particles  into  patterns  defined  in  an  Si02  layer. 
The  particles  are  deposited  by  spin-coating  a  colloidal  sus¬ 
pension  of  particles  onto  an  IL  patterned  wafer.  Both  ID  and 
2D  patterns  are  shown.  The  perfection  of  the  deposition  is 
impacted  by  the  particle  size  distribution. 


less;  and  nanoimprint  lithographies.  Ul¬ 
timately,  the  manufacturing  decision  will 
be  based  on  economics  -  the  most  cost 
effective  solution  -  including  all  factors 
such  as  throughput  and  yield  -  will  be 
adopted  by  the  industry. 


IV  APPLICATIONS  OF  INTERFER¬ 
OMETRIC  LITHOGRAPHY 

Periodic  patterns  are  suitable  -  and  even 
desirable  -  for  many  experimental 
nanoscience  investigations.  The  obvious 
example  is  a  photonic  crystal  for  which 
periodicity  is  the  primary  attribute.  In 
many  cases  an  important  requirement  is 
on  the  overall  sample  area.  For  example, 
for  the  study  of  nanoscale  epitaxial  crys¬ 
tal  growth,  it  is  critical  to  have  large  area 
samples  for  detailed  characterization, 
since  many  semiconductor  material 
probes  require  large-area  samples. 

Interferometric  lithography  provides 
a  unique  capability  for  the  inexpensive 
fabrication  of  large-area  samples  with 
nanoscale  features  for  a  wide  variety  of 
nanoscience  investigations.  Because  IL  is 
maskless,  it  is  a  simple  matter  to  change 
periodicities,  pattern  symmetries, 
line:space  ratios  and  other  pattern  details. 
These  periodic  patterns  find  extensive 
applicability  in  nanoscale  epitaxial 
growth,  in  investigations  of  nanofluidics 
and  of  interaction  of  biomolecular  spe¬ 
cies  at  the  nanoscale,  in  nanomagnetics, 
and  in  nanophotonics  among  others. 
Space  does  not  permit  a  full  recitation  of 
each  of  these  areas.  A  brief  overview  of 
each  of  these  areas  is  presented  to  illustrate  the  broad  range  of  investigations  enabled  by  interfer¬ 
ometric  lithography. 


Fig.  1 1 :  Multilevel  directed  self-assembly  of  silica  nanoparti¬ 
cles  to  form  multilevel  nanofluidic  channels. 


IV.  1  Nanoscale  Epitaxial  Growth 

Epitaxial  growth  of  semiconductors  continues  to  have  an  enormous  impact  on  the  development 
of  information  technology.  The  traditional  paradigm  of  epitaxial  crystal  growth  begins  with  a 
nearly  perfect  crystalline  template  (the  substrate)  upon  which  new  materials  are  deposited  in 
atomic  registry  with  exquisite  atomic  level  control  in  the  growth  direction,  and  with  no  control 
whatsoever  in  the  transverse  dimensions.  When  materials  with  a  lattice  mismatch  of  more  than 
~0.1%  are  combined,  a  large  strain  energy  builds  up  and  a  prohibitively  high  defect  density  re- 
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suits.  The  situation  is  entirely  different  if 
the  links  between  the  two  materials  are  in 
the  form  of  an  array  of  “bridges”  of  nano¬ 
scale  dimensions  [38],  [39].  In  this  case 
3D  strain  mechanisms  are  enabled  that  are 
forbidden  in  traditional,  large-area  epitaxy 
and  the  lattice  mismatch  strain  can  be  lo¬ 
calized  to  the  growth  interface.  Further¬ 
more,  during  nanoscale  growth  there  are 
significant  modifications  to  fundamental 
kinetics  and  energetics  of  the  crystal 
growth  that  lead  to  new  growth  modalities 
that  are  not  available  in  large  area  growth, 
and  to  the  nucleation  and  propagation  of 
defects  that  allow  defects  to  be  localized  at 
the  growth  interface.  The  dimensionless 
parameters  that  operate  in  nanoscale 
growth  are  the  ratio  of  the  linear  pattern 
dimension  to  an  adatom  diffusion  length 
and  to  the  critical  thickness  for  the  genera¬ 
tion  of  defects  in  heterostructure  growth. 

Patterning  can  involve  masking  of 
the  growth  surface  with  a  second  usually 
amorphous  material  such  as  Si02,  or  the 
exposing  of  different  crystal  faces  on  the 
substrate  with  selectivity  coming  from 
preferential  growth  on  one  class  of  facets. 
An  example  of  growth  through  a  Si02 
mask  is  shown  in  Fig.  13  that  shows  a  se¬ 
quence  of  results  for  the  growth  of  InGaAs 
on  GaAs  [40],  Panel  (a)  shows  the  starting 
substrate  and  45-nm  thick  SiCE  growth 
mask  patterned  at  a  pitch  of  355  nm.  Panel 
(b)  is  a  SEM  surface  after  growth  of  a  50- 
nm  GaAs  buffer  layer  and  a  200-nm 
In0.06Ga0.94As  layer.  Except  for  the  incom¬ 
pletely  filled  hole,  which  was  a  rare  occur¬ 
rence  across  the  2  cm2  sample,  the  surface 
is  remarkably  smooth  and  featureless 
showing  good  coalescence  between  the 
isolated  growth  areas.  Panel  (c)  shows  a 
low  resolution  cross  section  TEM.  The 
SiC>2  mask,  the  GaAs  buffer  layer  growth 
and  the  InGaAs  growth  are  clearly  seen. 
There  are  no  evident  long  range  defects  such  as  threading  dislocations  that  would  be  expected  for 
blanket  growth,  even  though  the  InGaAs  layer  thickness  is  three  times  the  critical  thickness  for 
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Fig.  12:  Simulation  of  the  upcoming  ITRS  nodes  with  193- 
nm  based  optical  lithography. 


Before  growth  (Si02  pattern) 


this  lattice  mismatch.  Finally,  panel  (d)  is 
a  high  resolution  TEM  of  the  area  within 
the  small  square  in  the  center  of  (c).  The 
various  films  are  clearly  identified  and  no 
extended  defect  is  evident.  Both  x-ray  dif¬ 
fraction  and  photoluminescence  measure¬ 
ments  indicate  that  the  InGaAs  film  is  al¬ 
most  fully  relaxed.  This  confirms  the  ini¬ 
tial  premise  that  nanoscale  3D  growth  pro¬ 
vides  strain-relief  mechanisms  that  are  not 
available  for  traditional  blanket-film 
growth. 

An  example  of  selective  growth  us¬ 
ing  crystal  facets  is  shown  in  Fig.  14 
which  shows  the  growth  of  GaN  on  a 
nanoscale  textured  v-groove  Si  substrate 
that  exhibits  both  (100)  and  {111}  faces 
[41],  [42],  Fig.  14a  shows  the  starting  sub¬ 
strate  with  a  pitch  of  355  nm.  Within  a  pe¬ 
riod,  two  (100)  faces  and  two  equivalent 
{111}  faces  are  exposed.  Fig.  14b  shows 
the  result  of  ~  75  nm  of  GaN  growth.  The 
GaN  has  nucleated  in  the  hexagonal  phase 
on  the  two  {111}  faces  as  show  by  the  opposing  white  arrows  in  the  figure.  There  are  voids 
where  no  material  grows  at  the  centers  of  both  (100)  faces,  clearly  evincing  the  selective  nuclea- 
tion  on  the  {111}  surfaces.  There  is  a  large  density  of  stacking  faults  (fluctuations  between  hex¬ 
agonal  and  cubic  phases)  that  provide  strain  relief  of  the  large  lattice  mismatch  between  Si  and 
GaN.  After  the  growth  proceeds  some  small  amount,  defect-free  hexagonal  GaN  is  obtained  in 
two  separated  nanocrystals  located  on  each  Si  {111}  face.  As  the  growth  proceeds,  these 
nanocrystals  coalesce,  cutting  of  the  material  flow  to  the  bottom  facet  and  leaving  the  void.  Sym¬ 
metry  dictates  that  the  GaN  crystal  cannot  maintain  the  two  opposed  c-axes  and  as  a  result  a  cu¬ 
bic  symmetry  region  is  established,  initially  again  with  a  significant  density  of  localized  stacking 
faults.  Fig.  14c  shows  a  mapping  of  these  regions.  Fig.  14d  is  a  high  resolution  TEM  in  the  cen¬ 
tral  region  clearly  showing  the  stacking  fault  and  cubic  GaN  regions.  This  spatial  phase  separa¬ 
tion  is  a  demonstration  of  the  radically  new  and  unanticipated  growth  modalities  available  with 
nanoscale  growth. 

IV.  2  Nanofluidics 

Microfluidics  has  become  an  important  research  area,  particularly  for  biological  separations  and 
other  investigations.  Very  complex  systems  are  being  investigated  [43],  Nanofluidic  systems  of¬ 
fer  an  exciting  new  domain.  This  results  because  the  Debye  screening  length  in  biologically  rele¬ 
vant  fluids  is  ~  5-  to  50-nm,  thus  in  a  nanofluidic  system  the  screening  from  the  walls  impacts 
the  entire  fluid  volume.  In  contrast  to  the  situation  in  microfluidic  systems  where  the  bulk  of  the 
fluid  volume  is  neutral,  in  nanofluidic  channels  the  entire  fluid  is  charged.  This  makes  it  possible 
to  vary  the  fluid  conditions  locally  along  the  channel  and  also  strongly  modifies  electrokinetic 
transport  since  the  electroosmotic  component  extends  fully  across  the  channel.  In  this  condition 
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Fig.  13:  Nanoscale  epitaxial  growth  of  InGaAs  on  GaAs 
over  a  thin  Si02  mask.  The  top  two  panels  are  SEM  pictures 
showing  the  intial  Si02  mask  (a)  and  the  smooth  featureless 
overgrowth  c)  Low  resolution  TEM  showing  the  overgrown 
mask  and  the  GaAs  buffer  and  InGaAs  overgrowth,  d)  High 
resolution  TEM  showing  the  coherent  interface  between  the 
growths  and  the  absence  of  any  long  range  defects.  This 
InGaAs  layer  is  three  times  the  critical  thickness  for  blanket 
epitaxy. 
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Fig.  14:  GaN  growth  (MOCVD)  on  a  nanoscale  patterned 
{111  }-(100)  textured  Si  surface,  (a)  Shows  the  patterned 
substrate  with  two  opposing  {111}  faces  and  two  (100) 
faces  in  each  355-nm  period,  (b)  Shows  the  result  of  a 
growth  of  ~75  nm  of  GaN.  The  GaN  nucleates  in  the  hex¬ 
agonal  phase  on  the  two  {111}  faces  with  c-axes  directed 
perpendicular  to  the  Si  face,  (c)  Map  showing  the  material 
evolution.  As  the  two  GaN  nanocrystals  come  together, 
symmetry  dictates  that  a  cubic  region  is  formed.  There  are 
regions  of  stacking  faults  both  at  the  Si:GaN  interfaces  and 
again  at  the  hexagonalxubic  interfaces,  (d)  High-resolution 
TEM  of  the  cubic  region. 


the  fluid  flow  is  modified  from  the  usual 
plug  flow  conditions  and  new  phenomena 
are  observed  [44],  [45], 

Fig.  15a  shows  an  integrated  nan¬ 
ofluidic  flow  system  [46],  Integrated  refers 
to  the  multiple  hierarchical  length  scales 
from  mm  for  the  pipette  interfaces  to  ~  100 
pm  for  the  microchannels  that  run  between 
the  pipette  ports  and  the  ~  100-nm  wide 
nanochannels  which  are  ~  1  cm  long.  Thus 
the  fabrication  scales  for  this  chip  extend 
over  a  range  of  1 05 !  The  nanochannels  were 
defined  with  interferometric  lithography, 
etched  into  a  Si  wafer  using  RIE  and  then 
oxidized  to  isolate  the  fluid  from  the  Si  and 
to  further  narrow  the  nanochannel  dimen¬ 
sions  (SiC>2  has  a  ~  40%  larger  volume  than 
the  Si  it  replaces).  The  remainder  of  the 
structure  was  fabricated  with  traditional  op¬ 
tical  lithography  since  there  are  no  small 
features  nor  is  there  a  critical  alignment  be¬ 
tween  the  two  phases  of  the  fabrication. 
This  is  a  good  example  of  the  mix-and- 
match  fabrication  discussed  above. 

IV.  3  Nanoscale  Magnetic  Effects 

There  has  been  substantial  interest  in 
the  magnetic  properties  of  individual  single¬ 
domain  nanomagnetic  structures  as  a  result 
of  their  potential  for  increased  storage  den¬ 
sity  as  compared  with  traditional  multi- 
domain  microstructures  [47],  [48].  Interfer¬ 
ometric  lithography  is  a  natural  technology 
for  the  definition  of  large  areas  of  these  pat¬ 
terned  magnetic  materials  [49]-[52],  Interest 
in  this  area  has  been  stimulated  by  the  de¬ 
velopment  of  lithographically  patterned 
multilayer  elements  including  magnetic 
tunnel  junction  (MTJ)  [53]  and  pseudo¬ 
spin-valve  (PSV)  [54],  [55]  elements  that 
are  key  components  in  high-density  mag¬ 
netic  random  access  memory  (MRAM)  ar¬ 
chitectures. 

Superconducting  films  also  show  inter¬ 
esting  phenomena  in  the  presence  of  a  peri¬ 
odic  perturbation  such  as  an  array  of  holes. 


The  holes  act  as  pinning  sites  for  the  quan¬ 
tized  magnetic  flux  lines  associated  with  a 
superconducting  film.  At  low  magnetic 
fields,  the  flux  lines  are  all  associated  with 
these  pinning  sites  and  the  magnetization 
shows  distinctive  structure  when  the  flux 
density  is  commensurate  with  the  density 
of  the  hole  array  or  is  an  integral  multiple 
of  the  hole  density  so  that  the  flux  lattice 
forms  a  superstructure  locked  to  the  hole 
array  [56],  For  a  large-area  sample,  mis¬ 
matches  of  the  crystal  structure  between 
grains  nucleated  in  different  regions  of  the 
sample,  quite  analogous  to  grain  bounda¬ 
ries  in  crystal  growth,  are  observed  [57], 

II  .4  Nano-optics  and  Nanophotonics 
Nanophotonics  is  an  obvious  application 
area  for  IL.  Often  the  requirement  is  for 
large  areas  of  periodic  or  quasi-periodic 
structures.  Several  immediately  evident 
areas  include  ID,  2D  and  3D  photonic 
crystals  as  well  as  more  complex  periodic 
structures  for  metamaterial  applications 
such  as  enhanced  transmission  and  field 
concentration  in  arrays  of  nanoscale  metal¬ 
lic  structures  (holes  and  toroids)  and  nega¬ 
tive  index  metamaterials. 

IV.  4.1  Distributed  Feedback/Distributed 
Bragg  Reflectors/Photonic  Crystals 

Distributed  feedback  (DFB)  and  distrib¬ 


uted  Bragg  reflector  (DBR)  lasers  require 
simple  ID  gratings.  The  typical  periodicity 
requirement  for  a  first  order  grating  is  7J2 n 
where  X  is  the  laser  wavelength  and  n  is 
the  modal  index.  For  a  GaAs  laser,  A  ~ 
860  nm  and  n  ~  3.5,  so  p  ~  125  nm.  As 
demonstrated  earlier  in  this  paper,  immer¬ 
sion  IL  at  193  nm  can  extend  to  as  small 
as  p  ~  67  nm,  covering  the  full  range  of 


Fig.  15:  (a)  Optical  micrograph  of  an  integrated  nanofluidic 
chip  consisting  of  pipette  ports,  microchannels  in  a  T  con¬ 
figuration,  and  ~  2000  parallel  nanochannels  (~  1  cm  long 
running  from  top  to  bottom  in  the  figure),  (b)  SEM  micro¬ 
graph  of  a  cross  section  of  the  nanochannel  array.  The  chan¬ 
nels  are  etched  ~  1  pm  deep  into  Si,  oxidized  to  provide 
Si02  isolation  for  the  fluid,  and  capped  with  a  pyrex  cover 
slip  using  anodic  bonding,  (c)  Optical  micrograph  showing 
electrokinetic  separation  of  two  dyes  with  different  charge 
states. 


periods  of  interest.  Fig.  16  shows  a  typical 


result  for  a  mid-IR  DFB  laser  [58].  Over  30  dB  of  side-mode  suppression  was  achieved  in  this 
initial,  unoptimized  experiment. 


Two-dimensional  photonic  crystals  in  slab  geometries  have  many  applications  from 
waveguiding  [59]  and  photonic-crystal  lasers  [60]  to  photonic  crystal  light  extraction  [61].  The 
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required  pitch  for  each  of  these  structures 
is  similar  to  that  for  the  DFB  lasers  dis¬ 
cussed  above,  since  the  same  wave¬ 
lengths  and  materials  are  of  interest.  For 
photonic  crystal  LED  light  extraction,  the 
wavelengths  are  shorter  (~  460  nm  for 
blue  LEDs)  and  the  refractive  index  is 
somewhat  lower  (~  2.5)  so  the  required 
pitch  is  as  small  as  92  nm;  again  well 
within  the  capabilities  of  immersion  IL. 

There  are  several  approaches  to 
the  fabrication  of  2D  structures  using  IL, 
perhaps  the  easiest  is  to  use  multiple  2- 
beam  exposures  with  appropriate  rotation 
of  the  wafer  between  exposures  as  dis¬ 
cussed  above  in  Sect.  II.3.  Fig.  17  shows 
an  example  of  a  2D  hexagonal  pattern 
mask  defined  in  a  Ni  etch  mask  deposited 
onto  a  GaN  wafer  with  a  hexagonal  lat¬ 
tice  parameter  of  405  nm.  Pattern  transfer 
becomes  a  significant  issue  as  the  scale 
of  the  pattern  is  reduced.  Typically,  the 
thickness  of  the  photoresist  layer  has  to 
be  reduced  along  with  the  pattern  size  to 
avoid  photoresist  collapse  issues  on  de¬ 
veloping  that  result  from  surface  tension 
forces  as  the  resist  layer  dries.  Ulti¬ 
mately,  this  may  force  the  adoption  of  a 
dry  developing  scheme  or  a  new  class  of  resist  with  more  structural  rigidity  (higher  Young’s 
modulus).  Various  multi-level  resist  schemes  incorporating  a  hard  mask  layer  are  also  possible 
approaches  to  this  issue. 

There  are  at  least  two  approaches  to  3D  structures.  Structures  can  be  built  up  in  a  layer- 
by-layer  fashion  very  analogous  to  traditional  semiconductor  manufacturing.  In  particular,  there 

has  been  quite  a  bit  of  attention  devoted 
to  a  woodpile  structure,  with  alternating 
layers  of  long  bars  oriented  in  the  x-  and 
y-  directions  with  each  sequential  layer  in 
the  same  direction  offset  by  V2  the  pitch 
[62],  Most  often  the  fabrication  has  used 
standard  semiconductor  processing  tech¬ 
nologies  [63]-[64],  Alternatively,  using 
multiple-beam,  noncoplanar  interfer¬ 
ometric  lithography  the  entire  3D 
photonic  crystal  pattern  can  be  produced 
in  a  single  series  of  exposures  in  a  thick 
Fig.  17:  2D  patterned  Ni  etch  mask  at  a  hexagonal  lattice  con-  photosensitive  material  [65]-[67],  All  14 

stant  of  405  nm. 


Fig.  16:  (a)  465-nm  period  distributed  feedback  grating 
etched  into  a  interband  cascade  InAs/GaSb  laser  structure,  (b) 
High  resolution  output  spectrum  showing  30  dB  side-mode 
suppression. 


Bravais  lattices  and  many  other  space  groups  are  accessible  with  adjusting  the  number,  intensity 
and  polarizations  of  the  various  beams  [68],  [69].  This  interferometric  technique  has  been  ex¬ 
tended  to  the  fabrication  of  compound  (inter-penetrating)  lattices  such  as  the  woodpile  structure 
and  the  diamond  lattice  [66],  [70]. 


IV.4.2  Metamaterials  and  Negative  Refractive  Index 

Metals  naturally  provide  a  negative  permittivity  at  frequencies  below  the  plasma  frequency  cop  as 
a  consequence  of  the  free  electron  response  function  given  by 
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where  y  is  the  electronic  scattering  frequency.  There  is  no  comparable  naturally  occurring  nega¬ 
tive  magnetic  susceptibility.  However,  LC  resonant  (tank)  circuits  provide  a  region  of  negative 
permeability  around  their  resonance  frequency.  If  an  ensemble  of  such  resonant  structures  with 
dimensions  much  less  than  the  wavelength  is  aggregated  into  a  material  with  inter-structure  dis¬ 
tances  of  much  less  than  a  wavelength,  then  an  effective  medium  results  with  a  negative  perme¬ 
ability  over  a  limited  frequency  range.  The  permeability  in  this  case  is  dependent  upon  the  prop¬ 
erties  of  the  individual  resonances  and  is  independent  of  the  details,  such  as  periodicity,  of  their 
arrangement  into  a  material.  Such  a  material  is  classified  as  a  metamaterial.  Its  permittivity  ( s ) 
and  permeability  (//)  result  from  the  fabricated  resonant  structures. 

Fig.  18  shows  an  example  of 
magnetic  metamaterial  structures  fabri¬ 
cated  using  IL  [71].  The  major  feature  is 
a  loop  formed  by  shadow  evaporation 
over  a  ID  patterned  Si3N4  sacrificial 
layer  with  a  600-nm  pitch.  In  the  top 
panel,  this  has  been  segmented  into  an 
array  of  “staples”  and  the  nitride  has  been 
selectively  removed.  The  bottom  panel 
shows  a  ID  array  with  the  nitride  still  in 
place.  Fig.  19  shows  the  measured  (top) 
and  simulated  (middle)  responses  for  a 
number  of  structures  with  different  struc¬ 
tural  dimensions,  but  with  the  same  pitch. 
The  response  is  clearly  dependent  on  the 
individual  structural  parameters  satisfy¬ 
ing  the  definition  of  a  metamaterial.  Fi¬ 
nally,  the  bottom  panel  shows  the  perme¬ 
ability  derived  from  the  model  for  the 
highest  frequency  resonance.  A  region  of 
negative  Re(//)  is  clearly  established. 

Other  reports  of  negative  permeability  at 

10  cc„.  .  t  f„M  .  ...  infrared  frequencies  [72]  have  relied  on 

Fig.  18:  SEM  pictures  of  the  arrays  of  2D  (a)  and  ID  (b)  n  L  J 

staple-shaped  nanostructures.  The  ID  samples  are  different  e-beam  lithography  to  define  the  smallest 
from  the  2D  sample  in  that  the  gold  “staples”  are  continuous  feature,  and  consequently  have  been  re¬ 
in  the  y-direction  and  the  loops  are  filled  with  Si3N4.  stricted  to  small  samples.  In  contrast,  this 


Wavelength  [pm] 


process  relies  on  deposition  (of  a  dielec¬ 
tric  spacer  and  of  the  gold  fdm)  to  define 
the  smallest  features  and  samples  are 
several  cm2  in  size,  limited  by  conven¬ 
ience  of  handling  rather  than  by  cost  and 
time  of  pattern  definition.  This  illustrates 
one  of  the  major  advantages  of  a  parallel 
optical  lithography  approach  to  nano¬ 
photonic  structures. 

There  is  great  interest  in  the  fabrica¬ 
tion  of  negative  index  materials  [73]  - 
combining  a  negative  permeability,  as 
shown  above,  with  a  negative  permittiv¬ 
ity  usually  from  electron  plasma  response 
of  a  metal  film.  The  interest  in  this  topic 
has  increased  dramatically  as  a  result  of 
the  prediction  of  important  new  effects, 
including  diffractionless  imaging,  in  such 
a  material  [74],  A  particularly  simple  re¬ 
alization  of  such  a  material,  shown  in 
Fig.  20,  is  a 

metal/dielectric/metal/substrate  multi¬ 
layer  stack  perforated  by  a  2D  hole  array 
[75].  For  the  incident  polarization  indi¬ 
cated  in  the  cartoon,  the  magnetic  re¬ 
sponse  -  very  similar  to  that  discussed 
above  -  results  from  the  distributed  in¬ 
ductance/capacitance  associated  with  the 
two  separated  metal  films.  Fig.  21  shows 
the  measurement  and  modeling  of  the 
refractive  index,  clearly  showing  a  nega¬ 
tive  index  over  a  limited  wavelength 
range.  Importantly,  in  this  experiment, 
the  refractive  index  was  obtained 
uniquely  from  measurements  of  both  the 
amplitude  and  the  phase  of  both  the  film 
reflectance  and  transmission.  For  the 
simulation  the  only  adjustable  parameter 
was  the  electronic  scattering  frequency  in  the  metal  that  was  taken  as  approximately  3x  the  lit¬ 
erature  value  for  bulk  Au  based  on  fitting  the  transmission  measurements.  There  is  remarkably 
good  agreement  between  the  measurement  and  the  modeling.  There  remains  significant  loss  in 
this  material  with  the  Im(«)  >  Re(«)  across  the  resonance.  Detailed  investigation  shows  that  for 
this  structure  while  the  permeability  shows  a  resonance  response,  its  real  part  does  not  become 
negative  and  the  negative  index  is  a  consequence  of  the  combination  of  negative  real  part  of  the 
permittivity  and  a  positive  imaginary  part  of  the  permeability  at  the  resonance.  More  optimal 
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Fig  19:  Experimental  (top)  and  simulation  (middle)  results  for 
the  reflectivity  as  a  function  of  frequency  for  three  different 
structures.  The  resonance  frequency  is  a  function  of  the 
nanostructure  parameters  and  is  independent  of  array  period, 
which  is  the  same  for  all  three  structures.  The  resonance  is 
observed  only  for  TM  polarization,  i.e.  incident  magnetic 
field  coupled  into  the  inductive  structure.  Only  one  TE  polari¬ 
zation  result  is  shown.  The  bottom  panel  shows  the  effective 
permeability  extracted  from  the  model  for  the  highest  fre¬ 
quency  resonance. 


structures  have  been  designed  that  promise  a  significant  improvement  in  the  quality  of  the  nega¬ 
tive  index  material  (|Re(n)/Im(«)|). 

Interferometric  lithography  is  especially  attractive  for  the  fabrication  of  this  class  of  negative 
index  materials.  As  has  been  amply  demonstrated  earlier,  IL  is  ideally  suited  to  the  fabrication  of 
large  arrays  of  2D  structures  such  as  the  pattern  of  holes.  The  fabrication  in  this  case  was  simply 
to  create  a  2D  post  array,  etch  through  the  underlying  ARC  layer,  and  then  to  deposit  the  three 
films  sequentially  and  dissolve  away  the  photoresist  to  leave  the  final  structure.  Sample  dimen¬ 
sions  were  ~  2x2  cm2.  This  simplicity  of  fabrication  of  large  areas  of  materials  offers  promise 
both  for  the  detailed  study  and  optimization  of  these  effects  and  also  for  manufacturing  of  large 
areas  of  nanoscale  structures  for  practical  applications. 
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V  CONCLUSIONS 

Nanoscale  fabrication  is  a  critical  aspect  of  any  nanotechnology.  Optical  lithography  is  the  only 
large-scale  manufacturing  technique  that  the  integrated  circuit  industry  has  ever  used  for  pattern 
definition.  Despite  continuing  predictions  of  its  demise,  optical  lithography  continues  to  be  ex¬ 
tended,  most  recently  with  immersion  techniques  that  gain  by  the  refractive  index  of  the  material 
between  the  lens  and  the  photoresist  -  initially  H20  with  an  index  of  1.44  at  193  nm.  It  now  ap¬ 
pears  likely  that  a  hyper- NA  ( NA  >  1)  optical  lithography  will  address  the  industry  needs  for  the 
45-nm  half-pitch  node.  Further  extensions  are  possible  and  are  being  actively  investigated  for 
even  finer  lithographic  patterns. 

While  this  is  good  news  for  the  semi¬ 
conductor  industry,  the  cost  of  these  optical 
lithography  tools,  ~  $20  to  $30  million, 
makes  them  inaccessible  to  a  typical  re¬ 
search  environment.  Much  of  this  cost  is 
associated  with  the  need  of  the  industry  to 
produce  fully  arbitrary  patterns  at  very  high 
throughputs.  Current  steppers  also  require 
quite  expensive  masks  (~  $  1-  to  2-million 
for  a  microprocessor  mask  set)  to  define  the 
patterns  and  are  capable  of  a  throughput  of 
~  85  wafers  per  hour,  each  with  ~  125  dies 
(22x33  mm  )  across  a  300-mm  diameter 
wafer;  this  corresponds  to  an  average  sus- 
tained  data  throughput  rate  of  ~  10  bit/s. 

Fortunately,  nanoscience  research  often 
requires  less  flexibility  -  for  example  peri¬ 
odic  patterns,  and  always  requires  less 
throughput.  The  needs  of  much  of 
nanoscience  research  can  be  met  at  a  sub¬ 
stantially  lower  cost.  Interferometric  lithog¬ 
raphy,  the  interference  of  a  small  number  of 
coherent  laser  beams,  readily  produces 
large-area  patterns  (several  cm2)  at  the  same 
and  even  smaller  scales  than  are  available 
from  commercial  steppers.  With  an  ArF  la¬ 
ser  source  at  193  nm  and  H20  immersion, 


Fig.  20:  a)  Negative  index  material  at  2  (am  consisting  of  a 
Au/A1203/Au  film  stack  on  a  glass  substrate  perforated  with 
a  2D  array  of  holes  at  a  pitch  of  0.8  pm.  The  green  shaded 
regions  contribute  to  the  permeability  while  the  blue  shaded 
regions  are  responsible  for  the  negative  permittivity,  b) 
SEM  micrograph  of  the  structure. 


the  smallest  half-pitch  (equal  line: space) 
pattern  accessible  extends  to  34  nm.  Spa¬ 
tial  frequency  multiplication,  taking  ad¬ 
vantage  of  nonlinearities  in  the  process¬ 
ing,  allows  extensions  by  a  factor  of  two 
or  more,  extending  this  capability  ulti¬ 
mately  to  below  10  nm.  Importantly,  IL 
is  a  maskless  approach  to  lithography 
that  greatly  extends  flexibility  without 
requiring  expensive  masks. 

Several  different  techniques  were 
discussed  that  extend  the  flexibility  of  IL. 
Multiple  IL  exposures,  multiple-beam  IL, 
and  mix-and-match  with  both  high  reso¬ 
lution  (e-beam)  and  with  lower  resolution 
(proximity  optical  lithography)  all  offer 
complimentary  capabilities.  Taken  to¬ 
gether,  these  techniques  provide  a  facile 
suite  of  techniques  applicable  to  a  broad 
range  of  nanoscience  applications,  with  a 
resolution  capability  that  rivals  standard 
nanofabrication  techniques  such  as  elec¬ 
tron-beam  lithography,  but  with  a  large- 
area  capability  that  often  allows  addi¬ 
tional  diagnostic  techniques  that  are  not 
available  for  very  small  samples  -  and  of 
course  can  sustain  volume  manufactur¬ 
ing. 

Directed  self-assembly,  the  combination  of  self-assembly  to  define  features  extending  to 
molecular  dimensions  while  retaining  the  organization  across  multiple  length  scales  from  nm  to 
cm  offered  by  optical  lithography,  is  a  promising  future  direction.  Ultimately,  this  provides  a 
pathway  to  a  sub-nm  patterning  regime. 

IL  provides  the  limiting  resolution  of  optical  lithography,  but  the  periodicity  poses  a  prob¬ 
lem  for  some  applications,  especially  for  the  integrated  circuit  industry.  Imaging  interferometric 
lithography  (IIL)  was  introduced  as  an  extension  of  IL  to  arbitrary  patterns.  Essentially,  IIL  is  a 
combination  of  extreme  off-axis  illumination,  to  reach  the  highest  spatial  frequencies  permitted 
by  a  given  optical  system,  along  with  imaging  pupil  plane  filters,  to  avoid  multiple  counting  of 
low  frequency  information  and  thereby  to  balance  the  image  frequency  response.  With  the  addi¬ 
tion  of  the  concept  of  spatial  frequency  multiplication,  IIL  offers  a  route  for  optical  lithography 
to  the  16-nm  half-pitch  limit  of  the  current  industry  manufacturing  roadmap  extending  to  2019. 

Numerous  application  areas  for  nanoscale  patterns  were  discussed.  Nanoscale  heteroepitaxy 
provides  a  new  paradigm  for  heterostructure  crystal  growth.  The  use  of  nanoscale  seeds  for  nu- 
cleation  of  many  small  crystals  provides  new  capabilities  for  the  relief  of  lattice-mismatch  strain. 
The  important  dimensionless  parameters  are  (1)  the  ratio  of  the  pattern  linear  dimension  to  the 
adatom  mean- free  path  so  that  an  adsorbed  atom  can  sample  the  entire  growing  crystal  for  a  pre¬ 
ferred,  low-energy  incorporation  site,  and  (2)  the  ratio  of  the  pattern  linear  dimension  to  the  criti- 


Fig.  21:  Measured  (a)  and  simulated  (b)  real  and  imaginary 
parts  of  the  refractive  index  for  the  structure  of  Fig.  15.  There 
is  a  region  of  negative  index  around  2  pm.  The  maximal  real 
part  of  the  negative  index  is  around  -2  while  the  imaginary 
part  is  3  to  4,  indicating  a  significant  loss. 


cal  thickness  before  the  generation  of  defects  in  large-area  heterostructure  growth.  These  tech¬ 
niques  are  applicable  to  both  molecular-beam  epitaxy  (MBE)  and  metal-organic  vapor-phase  epi¬ 
taxy  (MOVPE),  the  dominant  growth  techniques  for  advanced  heterostructure  devices.  In  the 
case  of  MBE,  selective  area  growth,  faceting,  and  lateral  overgrowth  and  coalescence  -  all  of 
which  are  rarely,  if  ever,  observed  for  large  area  (>  1  -pm  scale)  patterning  -  were  clearly  ob¬ 
served,  illustrating  the  fundamental  difference  afforded  by  nanoscale  patterning.  Two  examples 
were  given:  (1)  growth  of  a  relaxed  InGaAs  layer  on  a  GaAs  substrate  using  a  2D  SiC>2  masking 
layer;  and  (2)  spatial  phase  separation  of  the  cubic  and  hexagonal  phases  of  GaN  grown  on  a 
(100)/  {111}  patterned  Si  wafer. 

Nanofluidic  transport  and  separation,  particularly  of  biomolecules,  is  another  exciting  appli¬ 
cation  for  nanoscale  patterning.  Here  the  important  dimensionless  parameter  is  the  ratio  of  the 
linear  pattern  dimension  to  the  Debye  screening  length  in  the  ionic  fluids  important  for  biology. 
A  double  layer  of  opposite  charges  forms  at  the  wall  of  a  nanochannel  filled  with  an  ionic  fluid. 
Electrokinetic  (electroosmosis  and  electrophoresis)  transport  is  impacted  by  the  extent  of  this 
double  layer.  This  Debye  length  is  of  the  order  of  10-  to  100-nm  for  most  situations  relevant  to 
biology,  which  sets  the  scale  differentiating  nanochannels  from  the  much  more  widely  studied 
microchannels  where  the  screening  length  is  a  small  fraction  of  the  Debye  length.  An  integrated 
chip  was  demonstrated  that  combines  features  ranging  from  50  nm  to  1  cm  -  a  five  order  of 
magnitude  range!  Initial  flow  measurements  were  presented  demonstrating  the  separation  of  two 
dyes  with  different  charge  states. 

Magnetic  storage  has  long  been  migrating  towards  smaller  features  and  higher  densities  and 
is  now  entering  an  era  of  single-domain,  nanoscale,  multilayer  structures.  Again,  interferometric 
lithography  is  a  powerful  fabrication  technique  for  the  production  of  large  areas  and  for  ultimate 
manufacturing  of  devices  and  circuits  such  as  magnetic  random-access  memories  (MRAM). 

Nanooptics  and  nanophotonics  are  natural  applications  for  IL.  Distributed  feedback  and  dis¬ 
tributed  Bragg  reflector  lasers,  requiring  nanoscale  one-dimensional  patterning  have  long  been  a 
mainstay  for  the  spectroscopic  control  of  diode  lasers.  Recently,  there  has  been  extended  interest 
in  the  use  of  2D  patterning  for  photonic  crystals  to  control  optical  properties.  This  extends  from 
photonic  crystal  waveguides  with  missing  or  modified  rows  of  holes  (posts)  to  guide  optical 
beams  to  the  use  of  photonic  crystals  for  light  extraction  from  high-index,  semiconductor-based 
LEDs.  Three-dimensional  photonic  crystals  can  be  fabricated  both  with  layer-by-layer  tech¬ 
niques  and  by  using  multiple  IL  beams  with  varying  azimuthal  angles.  Metamaterials  and  plas- 
monics  are  another  area  of  great  current  interest.  Using  simple  IL  procedures,  large  areas  of 
resonant  magnetically  coupled  resonators  (tank  circuits)  with  resonances  throughout  the  mid- 
infrared  spectral  region  were  demonstrated.  Very  recently,  a  large-area  negative  index  material  at 
2  pm  was  demonstrated  using  a  relatively  simple  IL  implementation. 

Hopefully  by  now  the  message  is  clear.  Interferometric  lithography  provides  a  readily 
available,  inexpensive,  large-area  route  to  nanoscale  patterning  of  great  importance  to  many  ar¬ 
eas  in  nanotechnology.  Combining  IL  with  self-assembly  techniques  in  “directed  self-assembly” 
extends  the  reach  of  patterning  capabilities  truly  to  molecular  scales.  There  is  a  vast  range  of 
possibilities  across  many  different  fields  and  much  to  be  learned  from  the  application  and  exten¬ 
sion  of  the  techniques  presented  here. 
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Demonstration  of  Two-Photon  Lithography 

Ch.  J.  Schwarzab,  A.V.V.  Nampoothiriac,  J.C.  Jasaparad,  W.  Rudolph3, and  S.R.J.  Brueckae 
Center  for  High  Technology  Materials,  University  of  New  Mexico,  Albuquerque,  NM  87106 

Two-photon  exposure  is  demonstrated  using  ~12  fs  duration  pulses  at  a  center  wave¬ 
length  of  800  nm  and  a  standard  I-line  photoresist.  Using  direct-writing  with  a  focused 
Gaussian  beam,  developed  resist  features  as  small  as  300  nm,  compared  with  the  ~  1  pm 
focal  spot  diameter,  have  been  achieved.  The  experiments  reveal  substantial  differences 
(~  104)  in  the  two-photon  absorption  coefficient  for  multiple  pulse  trains  (100  MHz)  and 
for  single  pulses.  A  simple  model  involving  photoexcitation  of  an  intermediate  confor¬ 
mation  of  the  photoactive  compound  and  relaxation  back  to  the  ground  state  can  explain 

this  observation.  The  resolution  enhancement  of  l/ yfl  resulting  from  the  two-photon  ab¬ 
sorption  is  demonstrated  by  writing  two  lines  as  close  as  possible  to  each  other  with  both 
one-  and  two-photon  absorption  mechanisms. 
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Introduction 

The  continuing  demands  for  smaller  feature  sizes  in  mi¬ 
croelectronics  are  driving  optical  lithography  ever  further 
into  the  sub-wavelength  regime.  Many  different  resolution 
enhancement  techniques,  including  off-axis  illumination, 
optical  proximity  correction,  phase-shift  masks  and  imaging 
interferometric  lithography  have  been  demonstrated.  These 
advances,  along  with  the  continuing  migration  to  shorter 
wavelength  sources  and  to  higher  NA  lenses,  have  been  suc¬ 
cessful  in  extending  the  manufacturing  use  of  optical  lithog¬ 
raphy  to  dimensions  well  beyond  the  expectations  of  even  a 
few  years  ago. 

It  has  long  been  known1"3  that  nonlinear  processes,  such 
as  two-photon  absorption  in  which  the  photoresist  responds 
to  the  square  of  the  intensity,  offer  possibilities  of  higher 
resolution.  This  can  be  understood  from  elementary  argu¬ 
ments.  Consider  an  interferometric  lithography  arrangement 
where  two  coherent  laser  beams  are  incident  on  a  photoresist 
layer  at  angles  of  ±6.  Then  the  aerial  image  intensity  is 
/(x)  oc  cos2(Axsin#)oc  [l  +  cos(2£xsin#)]/2  ,  where  k=  2jt/A 
with  A  the  optical  wavelength,  which  is  the  appropriate  ex¬ 
pression  for  one-photon  absorption.  For  two-photon  absorp- 
tion,  the  resist  responds  to 
/2(x)oc  cos4(Axsin0)oc  [3  +  4cos(2Axsin0)+  cos  (4 Ax  sin  0)^8 

.  The  higher  harmonic  content  is  evident  in  the  last  term  of 
this  expression.  Plotting  these  expressions  shows  that  the 
density  of  peaks  in  the  response  is  the  identical  in  both 
cases,  but  that  the  linewidth  is  reduced  in  the  two-photon 
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case.  Yablonovitch  et  al.  have  proposed2  a  scheme  to  elimi¬ 
nate  the  lower  spatial  frequency  term  and  realize  the  full 
factor  of  two  enhancement  in  resolution  inherent  in  these 
equations.  The  photoresist  exposure  and  development  proc¬ 
ess  is,  in  itself,  highly  nonlinear,  thus  higher  harmonics  of 
the  fundamental  frequency  are  routinely  achieved  in  one- 
photon  lithography  (e.  g.  vertical  sidewalls).  Exploitation  of 
all  of  these  nonlinear  processes  offers  significant  potential 
for  extending  the  resolution  range  of  optical  lithography  yet 
further  into  the  subwavelength  domain. 

In  this  paper  we  present  results  of  direct  writing  on  a 
photoresist  using  focused  Gaussian  beams.  The  possible 
enhancement  in  the  area  density  of  structures  using  an  n- 
photon  absorption  process  is  n.  This  results  from  the  reduc¬ 
tion  of  the  FWHM  of  a  Gaussian  by  a  factor  of  yfn  when  it 
is  raised  to  the  n- th  power.  In  the  first  part  of  the  paper,  we 
investigate  two-photon  absorption  of  a  photoresist  and  com¬ 
pare  exposures  with  single  pulses  and  multiple  pulse  trains. 
In  the  second  part  we  demonstrate  the  resolution  enhance¬ 
ment  by  comparing  the  minimum  achievable  separation  of 
two  lines  written  with  single  and  two-photon  exposure. 

Two-photon  exposure  of  isolated  lines 

We  assume  that  a  certain  number  density  of  pho¬ 
toactive  compound  (PAC),  N±,  has  to  be  excited  in  order  to 
expose  the  photoresist,  independent  of  whether  one-  or  two- 
photon  absorption  occurs.  This  can  be  expressed  as 

Nlh  =  V  =  7^  f /, (t)dt  =  N2=  f  I22 (t)dt  (1) 

hvj  2  hv2J 

where  a  (/ 3 )  is  the  one-  (two-)  photon  absorption  coefficient 
in  units  of  cm'1  (cm/W),  I  is  the  laser  intensity,  v  is  the  laser 
frequency,  and  the  index  1  (2)  describes  the  one  (two-) 
photon  excitation.  Note  that  V\  =  2v2  since  both  absorption 
processes  excite  the  same  transition.  Typical  values  for  the 
one-photon  absorption  coefficient  for  conventional  photore- 
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sist  are  engineered  to  be  on  the  order  of  a  «104  cm-1,  while 
for  the  two-photon  absorption  coefficient  b  «10-9  cm/W6,7. 

Thus  an  intensity  on  the  order  of  I  ~  a/ /3  ~  1013  W/cm2  is 

required  for  two-photon  lithography.  In  order  to  keep  the 
total  energy  realistic  and  to  avoid  thermal  damage  to  the 
resist  and  the  underlying  materials,  this  intensity  must  be 
deivered  using  a  short-pulse  (femtosecond)  source  to  take 
advantage  of  the  very  high  peak  powers.  It  is  emphasized 
that  it  is  the  peak  intensity,  not  the  exposure  dose  that  is  in¬ 
creased  by  this  large  factor.  As  is  discussed  below,  with  fs 
pulse  illumination,  the  exposure  energy  remains  comparable 
to  that  for  a  one-photon  case. 

A  schematic  diagram  of  the  experimental  setup  is  shown 
in  Fig.l.  A  Kerr-lens  modelocked  Ti:sapphire  laser  with  a 
repetition  rate  of  100  MHz  provides  pulses  with  a  duration 
of  10  fs  at  a  center  wavelength  of  800  nm.  The  pulses  are 
focused  with  a  microscope  objective  onto  the  photoresist. 
Assuming  Gaussian  beam  characteristics  the  focal  spot  di¬ 
ameter  was  ~  1  mm.  In  order  to  avoid  temporal  broadening 
in  the  objective,  the  pulses  were  dispersion-precompensated 
with  a  prism  pair  and  a  third-order  diffraction  mirror4.  The 
autocorrelation  of  the  pulses  obtained  in  the  focal  plane  of 
the  objective  is  shown  in  inset  a)  of  the  Fig.l  and  indicates  a 
pulse  width  of  12  fs.  We  used  a  Shipley  510  I-line  photore¬ 
sist  in  a  diluted  form  (1:1),  which  was  measured  to  have  a 
film  thickness  of  about  250  nm  after  spinning.  The  absorp¬ 
tion  coefficient  is  a  «0.5xl04cm-l  at  400  nm  according  to 
the  manufacturers.  There  is  essentially  no  single  photon  ab¬ 
sorption  at  the  fundamental  of  our  laser  (800  nm).  Conse¬ 
quently,  no  exposure  of  the  resist  was  observed  with  our 
laser  operating  in  continuous  wave  (cw)  mode  to  total  flu- 
ences  well  above  those  required  for  two-photon  exposure. 

To  study  the  two-photon  absorption  quantitatively  we 
proceeded  as  follows.  In  order  to  write  a  line,  the  sample 
was  moved  perpendicular  to  the  beam  using  a  piezo  driven 
translation  stage.  A  small  longitudinal  translation  component 


a)Autocorrelation  measure 


Fig.  1:  Experimental  arrangement  for  fs-pulse  train,  two-photon 
exposure  of  photoresist.  Inset  a)  shows  the  pulse  autocorrelation 
measurement  of  a  12-fs  pulse  at  the  focal  point  of  the  lens.  Inset  b) 
shows  the  geometry  of  sample  motion  during  the  experimental 
scans. 


characterized  by  the  angle  cp  (see  Fig.l,  inset  b))  was  intro¬ 
duced  to  assure  that  the  sample  passed  through  the  focal 
plane  during  the  scan.  Consequently  the  intensity  at  the 
sample  varies  along  each  line.  Several  lines  were  written 
with  different  scan  speeds,  varying  the  total  dose  (see  Fig.2). 

Each  line  was  exposed  to  a  different  dose,  that  is  to  a  dif¬ 
ferent  number,  m,  of  pulses  per  line  element.  As  expected 
longer  lines  were  observed  with  higher  doses.  The  length,  1, 
of  the  developed  line  as  a  function  of  m  is  shown  in  the 
lower  part  of  Fig.2.  This  data  was  used  to  determine  the  two- 
photon  absorption  coefficient.  The  number  density  of  ex¬ 
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Fig.  2:  Top  panel  shows  the  developed  sample.  The  verti¬ 
cally  separated  traces  correspond  to  different  developed  lengths 
(/)  due  to  different  scan  speeds.  The  bottom  panel  shows  the  fit 
of  these  exposed  length  results  to  a  simple  model  for  extracting 
the  two-photon  absorption  coefficient. 
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cited  molecules  is  related  to  the  intensities  according  to  Eq. 
(1).  A  permanent  structure  can  be  expected  in  a  region  (r,z) 
characterized  by 


Nth  -  2 \hv~ mf  /2(z’r’^dt  =  N2(r’z)  (2) 

The  length  of  the  observed  lines,  /,  can  be  obtained  from 
the  requirement  N2  (z,r  =  0)>  Nth .  For  a  Gaussian  pulse 
profile  we  find 


l(m)  =  z0  cot  (cp) 


P 


JjtT 


nV2 


2 hv  N 


th 


y[&\n2 


1/2 


-1 


(3) 


where  z0  is  the  Rayleigh  range  of  the  beam,  I0  =  I(r  =  0,  t  = 
0)  ,  and  r  is  the  pulse  width.  From  a  fit  of  Eq.  (3)  to  the  data 
we  find  /3  «  2.6x10" 14  cm/W.  This  deviates  considerably 
from  typical  literature6  values,  which  are  on  the  order  of  - 
10"9  -10"10cm/W. 

To  investigate  this  further  the  photoresist  was  exposed 
by  single  pulses  from  a  Ti:  sapphire  amplifier.  The  pulse  du¬ 
ration  was  -30  fs.  For  a  spot  diameter  of  330  |im,  an  expo¬ 
sure  threshold  energy  of  25  qJ  was  determined.  From  Eq.  (2) 
and  m  =1  we  obtain  (3  -  6x1  O'9  cm/W  assuming  again  a 
Gaussian  spatial  and  temporal  profile.  This  value  is  consis¬ 
tent  with  that  found  for  other  organic  materials?.  This 
threshold  corresponds  to  a  dose  of  -  25  mJ/cm2  comparable 
to  the  single  photon  (I-line)  dose-to-clear  for  this  resist. 

For  a  tentative  explanation  of  the  difference  between 
single  and  multiple  pulse  exposure  we  look  at  the  photo¬ 
chemical  processes  following  the  excitation.  With  single 
photon  absorption  experiments  it  was  found  that  the  pho¬ 
toactive  compound  undergoes  a  rapid  transition  to  an  inter¬ 
mediate  conformation  (IC)  with  a  lifetime  extending  to  as 
much  as  several  hundred  ms  that  can  absorb  the  incident 
radiation8.  Qualitatively,  we  can  explain  the  observed  dis¬ 
crepancy  between  exposures  using  single  pulses  and  pulse 
trains  if  we  assume  that  the  excited  IC  can  decay  back  into 
the  ground  state  of  the  initial  compound,  see  Fig.  3.  Based 
on  a  simple  rate  equation  analysis,  we  calculated  N2  as  a 
function  of  m  for  this  scenario  and  for  the  case  when  every 
absorption  event  leads  immediately  to  the  formation  of  the 
final  compound.  From  the  graphs  in  Fig. 3  it  is  clear  that  pa¬ 
rameters  can  be  found  that  vary  the  excitation  efficiency  by 
several  orders  of  magnitude.  More  detailed  experiments  and 
modeling  are  necessary  to  confirm  this  hypothesis. 

The  minimum  observed  linewidth  was  -  300  nm,  much 
less  than  the  laser  spot  size.  Because  of  other  nonlinearities 
in  the  resist  response,  similar  sub-optical  resolution  results 
can  be  achieved  with  one-photon  excitation.  A  multiple  line 
exposure,  showing  an  increase  in  the  available  packing  den¬ 
sity  of  the  lines  and  providing  clear  evidence  of  the  im¬ 
proved  resolution  associated  with  two-photon  absorption  is 
described  below. 


Resolution  enhancement  using  two-photon  lithography 

In  order  to  determine  the  minimum  spacing  possible  and 
to  demonstrate  the  improved  resolution  of  two-photon,  di- 


excited  state  intermediate 


(m)  #  of  pulses 

Fig.  3:  Possible  mechanism  for  the  observed  differences  in 
two-photon  absorption  coefficient  between  the  multiple-  and 
single-pulse  experiments.  Top  level  diagram  shows  reabsorp¬ 
tion  in  an  intermediate  conformation  and  relaxation  back  to  the 
ground  state  of  the  PAC.  The  bottom  plot  shows  the  results  of  a 
simple  rate  equation  analysis  of  this  model  that  can  qualita¬ 
tively  explain  the  observations. 


rect-write  optical  lithography,  pairs  of  lines  were  written. 
We  compared  a  two-photon  exposure  at  800  nm  with  a  one- 
photon  exposure  at  400  nm.  The  latter  wavelength  was  ob¬ 
tained  by  frequency  doubling  the  output  of  the  fs  Ti:  sapphire 
laser.  Because  different  focusing  lenses  were  used,  we  first 
determined  the  spot  sizes  with  a  knife-edge  method.  The 
results  were  2oo0  ~  17.6  |im  (800  nm)  and  -13.2  |im  (400 
nm).  It  should  be  noted  that  even  though  the  focal  spot  size 
was  smaller  for  the  400  nm  one-  photon  exposure,  the 
achievable  developed  linewidth  was  smaller  for  the  800  nm, 
two-photon  exposure,  as  shown  in  Fig. 4.  Also,  in  the  two- 
photon  case  the  line-edge  slope  is  higher  than  in  the  one- 
photon  case  -  as  expected. 

In  the  next  step,  two  lines  were  written  as  close  to  each 
other  as  possible  in  this  setup  without  having  them  merge 
into  a  single,  wide  line.  In  order  to  compare  both  exposure 
types  near  threshold  we  varied  the  scan  speed  (exposure 
dose)  during  one  scan  to  assure  that  the  threshold  was 
reached.  Pairs  of  lines  with  different  separation  were  written. 
A  smaller  distance  between  the  lines  was  achieved  in  the 
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Two-photon  lines  One-photon  line 

800-nm  exposure  400-nm  exposui 


Fig.  4:  Comparison  of  one-photon  (400  nm)  and  two- 
photon  (800  nm)  exposures  of  both  single  and  double  lines.  The 
two-photon  results  show:  a  smaller  achievable  linewidth  (even 
with  a  larger  spot  size);  an  improved  edge  definition;  and  the 

expected  l/ yfl  improvement  in  achievable  linear  packing  den¬ 
sity. 

two-photon  case  (10.2  qm)  compared  to  the  one-photon  case 
(12  pm).  The  corresponding  pairs  with  minimum  spacing  are 
shown  in  Fig.4.  Normalizing  the  minimum  achievable  sepa¬ 
ration  of  the  line  pairs  to  the  corresponding  spot  diameters, 
gives  a  ratio  of  the  distance  between  the  two-photon  exposed 
and  the  one-photon  exposed  line  pairs  of  0.64,  which  is  close 

to  the  expected  ratio  of  l/  V 2  . 

For  these  first  experiments,  we  have  investigated  the 
resolution  for  serial  writing  of  individual  structures.  While 
this  is  of  intrinsic  interest,  extending  these  results  to  parallel 
lithographic  processes  will  be  necessary  for  application  to 
manufacturing.  There  are  at  least  two  approaches  to  this  re¬ 
quirement.  Yablonovitch  et  al2  have  shown,  using  coher¬ 
ently  related,  frequency  shifted  sources,  that  the  ultimate 
doubled  resolution  of  periods  of  X/4  can  be  achieved  in  a 
single  exposure,  at  least  for  grating  structures.  Alternatively, 
two  mask  exposure  patterns  can  be  interpolated,  achieving 
the  same  two-photon-enhanced  resolution  for  arbitrary  pat¬ 
terns,  but  at  the  expense  of  a  second  alignment  and  exposure 
step.  This  interpolation  is  essentially  equivalent  to  the  serial 
process  we  have  demonstrated. 

Conclusion 

The  two-photon  absorption  coefficient  of  the  photoresist 
Shipley  510  was  measured  to  be  /3  ~  6x1 0"9  cm/W.  A  dra¬ 
matic  drop  in  the  exposure  efficiency  was  observed  with 


multiple  pulse  excitation.  This  is  tentatively  explained  by  the 
existence  of  a  photoactive  intermediate  state  whose  excita¬ 
tion  leads  to  re-population  of  the  initial  ground  state. 

We  have  demonstrated  that,  in  direct  writing  using  Gaus¬ 
sian  beams,  two-photon  exposure  enhances  the  resolution  by 

a  factor  of  ~  y[2  providing  a  factor  of  two  increase  in  the 
achievable  writing  density  in  two  dimensions.  This  is  possi¬ 
ble,  since  the  two-photon  process  involves  higher  harmonic 
spatial  frequencies  due  to  the  intensity  squared  resist  re¬ 
sponse.  In  addition,  an  improvement  of  the  contrast  for  a 
given  resist  chemistry  is  reported.  As  lithography  progresses 
to  vacuum  uv  wavelengths,  the  advantage  of  improved 
resolution  at  a  wavelength  for  which  lenses  are  still  transpar¬ 
ent  may  be  significant.  With  currently  available  fs  laser 
technology,  three-photon  absorption  at  the  fundamental  or 
two-photon  absorption  of  the  second  harmonic  could  effec¬ 
tively  extend  this  technique  to  shorter  wavelengths  and 
higher  resolution.  Alternatively,  photoresists  optimized  for 
nonlinear  absorption  could  possibly  adapting  existing  nar¬ 
row-bandwidth  UV  laser  sources  and  high-NA  optics  to 
nonlinear  exposure  schemes. 
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5.1  Introduction 

Moore’s  law,  the  continued  doubling  of  semiconductor  integrated  circuit  capabil¬ 
ity  on  an  -18-month  time  scale,  has  become  ingrained  not  only  in  technology  but 
also  in  popular  culture.  Not  only  electronics,  but  also  communications  (in  band¬ 
width  capability  of  a  fiber  and  in  total  installed  bandwidth)  and  storage  technolo¬ 
gies  (in  magnetic  storage  bits/cm2)  have  their  own  “Moore’s  law”  describing  ex¬ 
ponential  increases  in  capability.  Together,  these  technologies  define  the  “infor¬ 
mation  age”  and  are  dramatically  impacting  society. 

Advances  in  optical  lithography  have  been  responsible  for  much  of  this  im¬ 
provement,  by  some  estimates  as  much  as  50%  of  the  advance  in  scaling  of  sili¬ 
con  circuits.  Descriptions  of  the  evolution  of  optical  lithography  from  the  begin¬ 
ning  of  silicon  integrated  circuits  [wavelength  of  432  nm  (G-line  of  Hg);  prox¬ 
imity  printing]  to  today’s  advanced  scanners  [wavelength  of  193  nm  (ArF  laser); 
4x  reduction  optics;  NA  -  0.75,  phase  shift  masks  with  optical  proximity  correc¬ 
tion]  have  been  presented  elsewhere  by  the  pioneers  of  the  field.1 

Many  corollaries  to  Moore’s  law  have  been  advanced,  aimed  at  pointing  out 
impending  limits  or  attendant  consequences.  One  familiar  one  is  that  the  cost  of  a 
fabrication  facility  is  also  exponentially  increasing,  is  today  at  several  billion  dol¬ 
lars,  and  simple  extrapolation  will  tell  you  when  it  will  exceed  the  gross  world 
product.  Another,  more  pertinent  to  this  discussion,  is  that  the  “end”  of  optical 
lithography  is  always  two  IC  generations  away.  The  purpose  of  this  article  is  to 
examine  the  fundamental  issues  that  determine  the  limits  of  optical  lithography. 
While  some  comments  will  be  made  concerning  practical  limitations,  our  pur¬ 
pose  is  to  outline  the  realm  of  the  possible  in  contrast  to  the  probable.  There  are 
many  practical  issues  that  will  ultimately  limit  the  ability  to  continue  the  down¬ 
ward  scaling  of  lithography.  And  new  technologies  such  as  nanoimprint  and  self- 
assembly  are  being  advanced  that  may  make  the  limits  of  optical  lithography  a 
moot  point.  My  own  bias  is  that  combinations  of  these  techniques  will  enable 
inexpensive  access  to  the  nanoscale  without  requiring  radically  new  approaches. 
Ultimately,  the  manufacturing  decision  will  be  made  by  economics,  not  by  theo- 
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retical  discussions  of  possibilities.  Nevertheless,  there  is  some  utility  in  asking 
what  physics  and  chemistry  allow,  and  how  possibly  we  might  get  there. 

A  useful  starting  point  is  the  International  Technology  Roadmap  for  Semi¬ 
conductors  (ITRS).2  This  document  is  continually  updated  by  the  semiconductor 
industry;  the  latest  version,  published  in  December  2001,  provides  the  best  un¬ 
derstanding  of  the  semiconductor  industry  both  as  to  what  is  needed  and  how 
they  are  going  to  get  it.  The  table  relevant  to  lithography  is  reproduced  here  to  set 
the  stage  on  the  scales  that  will  be  required  by  this  industry. 

This  table  is  divided  into  near-term,  with  one-year  increments,  and  long-term 
(shaded)  sections,  with  three-year  increments.  Memory  requirements  are  still 
driving  lithography,  although  the  distinction  between  memory  and  microproces¬ 
sors  is  lost  as  the  roadmap  progresses. 

While  much  of  the  present  discussion  is  framed  in  the  context  of  silicon  inte¬ 
grated  circuits,  it  is  worth  noting  that  lithography  has  a  much  broader  impact. 
The  National  Nanotechnology  Initiative  in  the  United  States3,  and  similar  initia¬ 
tives  elsewhere  in  the  world,  is  recognition  of  the  emerging  importance  of  proc¬ 
esses  characterized  by  nanometer  scale  lengths,  at  which  macroscopic  scaling 
breaks  down  and  quantum  and  interface  effects  dominate  material  behavior.  The 
nanometer  scale  is  also  the  scale  of  many  biological  processes.  Continued  ad¬ 
vances  in  our  abilities  to  manipulate  material  at  these  scales  will  likely  lead  to 
many  new  technologies.  A  characteristic  of  nature  is  the  integration  of  many 
length  scales  to  produce  complexity  and  enhance  functionality.  Manifestly,  litho¬ 
graphic  techniques,  spanning  many  length  scales,  will  play  a  critical  role  in  the 
development  of  nanotechnology. 

5.2  Optical  Lithography 

The  starting  point  for  a  discussion  of  optical  lithography  is  typically  framed  in 
terms  of  the  Rayleigh  resolution  criteria  that  describe  the  resolution  and  depth-of- 
focus  of  an  optical  system.  These  are: 


CD  =  ATi 


A 

NA 


(5.1) 


Table  5.1  ITRS  key  lithography-related  characteristics  by  product  type 


Year  of  Production 

2001 

2002 

2003 

2004 

2005 

2006 

2007 

2010 

2013 

2016 

DRAM  '/2  Pitch 
(nm) 

130 

115 

100 

90 

80 

70 

65 

45 

32 

22 

MPU  V2  Pitch  (nm) 

150 

130 

107 

90 

80 

70 

65 

45 

32 

22 

MPU  Printed  Gate 
Length  (nm) 

90 

75 

65 

53 

45 

40 

35 

25 

18 

13 

MPU  Physical  Gate 
Length  (nm) 

65 

53 

45 

37 

32 

28 

25 

18 

13 
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and 


DOF  =  k 


2  2[l-cos(NA)]  K2NA2 


(5.2) 


where  CD  is  the  critical  dimension  or  smallest  printed  feature  (line  or  space), 
DOF  is  the  depth-of- focus,  X  is  the  optical  wavelength,  NA  is  the  numerical  aper¬ 
ture  of  the  optical  system  and  Ki  and  k2  are  constants,  typically  of  order  unity. 
These  equations  describe  the  aerial  image  resolution  and  depth-of-focus  of  a  con¬ 
ventional,  incoherently  illuminated,  optical  system.  Innovations  in  both  the  mask 
and  the  illumination  and  imaging  systems  allow  some  improvement,  usually 
denoted  as  a  smaller  Ki  in  the  context  of  Eq.  (5.1).  These  will  be  described  below. 
As  we  will  see,  this  is  only  a  part  of  the  story.  Equally  important  is  the  interaction 
of  the  actinic  radiation  with  the  photoresist.  Nonlinearities  in  this  interaction  can 
be,  and  are  being,  used  to  further  extend  the  accessible  range  of  optical  li¬ 
thography. 

Rayleigh  first  obtained  these  equations  in  connection  with  the  resolving 
power  of  telescopes  used  for  astronomical  observations.  For  what  follows,  it  is 
worth  noting  that  the  resolution  limit  [Eq.  (5.1)]  was  intended  to  refer  not  to  the 
width  of  a  single  object,  but  to  the  ability  to  distinguish  two  separate  objects. 
“According  to  this  (Rayleigh’s)  criterion,  two  images  are  just  resolved  when  the 
principal  maximum  of  one  coincides  with  the  first  minimum  of  the  other.”4  The 
physical  basis  of  these  equations  is,  of  course,  the  diffraction  of  light.  For  a  point 
source  at  infinity,  the  radial  dependence  of  the  intensity  distribution  in  the  focal 
plane  of  an  unaberrated  optical  system  is  given  by 


m= 2 


JjjlxHAr/X) 

2^-NA  r/A 


(5.3) 


where  J\  is  the  Bessel  function  of  the  first  order.  The  Rayleigh  resolution  crite¬ 
rion  is  that  the  second  peak  should  be  at  the  first  zero  of  /(r)  or  r0  =  0.61A/NA 
which  corresponds  to  Eq.  (5.1)  with  Ki  =  0.61.  The  important  point  is  that  the 
Rayleigh  resolution  equation  refers  to  the  ability  to  distinguish  that  there  are  two 
spots,  not  to  the  size  of  a  single  spot.  This  has  critical  implications  for  the  limits 
of  lithography  as  we  shall  see  below. 

Three  approaches  to  reducing  the  CD  are  suggested  by  Eq.  (5.1):  reduce  the 
wavelength,  increase  the  optical  system  NA,  and  reduce  Ki.  Remarkable  progress 
has  been  made  in  each  of  these  directions.  As  noted  above,  the  wavelength  has 
progressed  from  432  nm  (G-line  of  Hg)  to  193  nm  (ArF  excimer  laser).  There  are 
substantial  efforts  underway  to  further  reduce  the  wavelength  to  157  nm  (F2  ex¬ 
cimer  laser);  as  well  as  preliminary  investigations  of  even  shorter  wavelengths 
such  as  126  nm  (Ar2  excimer)  and  121  nm  (hydrogen  Lyman-a).  Each  of  these 
wavelength  shifts  must  be  accompanied  by  an  extensive,  and  expensive,  materi¬ 
als  effort  to  establish  both  optical  and  photoresist  materials  suitable  for  the  new 
wavelength.  All  of  the  above  wavelengths  are  in  spectral  regions  where  both 
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transmissive  and  reflective  materials  are  available,  although  good  optical  proper¬ 
ties  and  material  durability  under  actinic  radiation  clearly  get  more  difficult  to 
achieve  with  each  generation.  There  is  also  a  large,  worldwide  effort  underway 
on  extreme  ultraviolet  lithography  (EUV)  using  a  13-nm  wavelength  laser- 
produced  plasma  source.  At  these  short  wavelengths,  a  fully  reflective  system 
must  be  used.  The  main  point  of  this  chapter  is  to  discuss  the  limits  of  optical 
lithography,  so  EUV  will  not  be  further  discussed  except  to  note  that  EUV  is  an 
optical  imaging  technology  and  all  of  the  concepts  introduced  herein  apply  in  this 
short  wavelength  regime  as  well. 

The  next  parameter  is  the  numerical  aperture  (NA)  of  the  optical  system. 
Again,  remarkable  progress  has  been  made  with  NA’s  increasing  from  -0.1  in 
the  first  optical  steppers  to  about  0.7  today,  and  probably  over  0.8  in  the  near  fu¬ 
ture.  The  usual  approximation  that  the  DOF  is  inversely  proportional  to  NA2 
breaks  down  in  this  high  NA  regime  and  the  DOF  saturates  at  about  A/2.  Eq.  5.2 
is  somewhat  oversimplified  as  the  NA  is  image  dependent,  and  Eq.  5.2  is  a  worst 
case.  Each  new  wavelength  has  to  be  introduced  at  an  NA  comparable  to  the  last 
generation  if  any  resolution  enhancement  is  to  be  achieved. 

Finally,  there  is  Ki.  To  some  extent,  K\  is  a  process  control  parameter.  Figure 
1  shows  the  progression  of  the  intensity  profile  across  two  spots  as  Ki  (or  equiva¬ 
lently  the  distance  between  the  spots)  is  varied.  For  Ki  =  1,  the  two  spots  are 
clearly  resolved  with  the  intensity  dip  between  the  spots  reaching  almost  to  zero, 
for  the  Rayleigh  limit,  Ki  =  0.6,  the  intensity  dip  is  just  visible.  If  the  nonlineari- 
ties  in  the  resist  allow  the  region  between  the  spots  to  be  unexposed  and  at  the 
peaks  to  be  exposed,  two  lines  can  be  printed.  This  is  clearly  a  process  control 
issue  as  well  as  a  strictly  imaging  issue.  The  intensity  varies  from  spot  to  spot 
and  from  exposure  to  exposure;  tighter  control  is  required  to  reliably  print  two 
features  for  the  lower  K\.  As  Kx  is  further  decreased,  the  two  spots  merge  into  a 
single  spot  and  there  is  no  way  to  resolve  them.  As  we  will  see  below,  it  is  possi¬ 
ble  to  reduce  Ki  further  by  modifying  the  optical  system.  Several  different  ap¬ 
proaches  to  this  end  have  been  investigated.  They  involve  variations  to  the  mask, 


Position  - =>• 


Figure  5.1  Intensity  profiles  of  two  spots  in  a  imaging  system  as  a  function  of  Ki. 
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to  the  illumination  system,  or  to  the  imaging  optics.  Figure  5.2  shows  a  generic 
lithography  system.  The  various  “knobs”  that  have  been  explored  to  impact  the 
resolution  are  indicated  in  the  brackets.  These  will  be  discussed  below  after  some 
additional  background. 

5.3  Fundamental  limits  of  optical  lithography. 

Fourier  optics5  provides  a  good  pathway  to  understanding  the  ultimate  limits 
of  lithography.  The  highest  spatial  frequency  that  can  be  supported  by  optics  in 
free  space  corresponds  to  two  coherent  counterpropagating  plane  waves  (or  more 
accurately  Gaussian  beams),  which  interfere  with  a  period  of  A/2.  It  is  possible  to 
approach  this  limit  very  closely  with  a  relatively  simple  interferometric  lithogra¬ 
phy  (IL)  set-up  where  the  two  coherent  optical  beams  are  incident  on  a  wafer  at 
angles  of  ±6  equally  disposed  about  the  surface  normal  giving  a  period  of 
A/2sin#6"9  As  an  example,  Fig.  5.3  shows  a  photoresist  grating  at  a  period  of  108 
nm  written  using  a  213-nm  coherent  laser  source  (fifth  harmonic  of  a  1.06-pm 
YAG  laser).  This  corresponds  to  Q~  80°  or  NA  =  sin#~  0.985.  For  IL  the  aerial 


Source  Illumination  Mask  Imaging  Wafer 

System  Lens 


[A,]  [OMUL]  [OPC,  PSM]  [Pupil  plane  [Photoresist 

filters,  ex  process 
Higher  NA]  nonlinearities] 

Figure  5.2  Generic  lithography  system  consisting  of  a  source,  an  illumination  system, 
a  mask,  an  imaging  optical  system  and  a  photoresist  coated  wafer.  Techniques  for  en¬ 
hancing  the  resolution  are  indicated  in  the  brackets. 
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Figure  5.3  108-nm  pitch  photoresist  grating  written  at  A  =  213  nm  and  NA  -  0.985. 
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image  is  simply 


I(x)  =  I  c 


2;zNA  2;zNA 

+i - x  —i - x 

?  /I  +  e  2 


=  2Ir 


1  +  COS 


47iNA 


V 


A 


(5.4) 


The  corresponding  frequency  space  picture  is  shown  in  Fig.  4.  The  aerial  im¬ 
age  contains  a  dc  term  and  the  positive  and  negative  frequencies  of  the  cosine. 
The  very  high  NA  puts  these  frequencies  very  close  to  the  limiting  frequencies  of 
±2/A  represented  by  the  darker  (inner)  circle.  The  outer  circle  represents  the  in¬ 
creased  spatial  frequency  space  that  is  available  for  optical  propagation  in  a  me¬ 
dium  of  refractive  index  n.  For  IL  exposures  such  as  shown  in  Fig.  5.3,  immer¬ 
sion  is  a  relatively  simple  optics  issue  and  indices  of  -  1.5  are  feasible,  even  at 
the  very  shortest  current  DUV  wavelength  of  157  nm. 10,11  For  an  imaging  system, 
immersion  will  be  much  more  problematic. 

Also  shown  in  Fig.  5.4  outside  of  both  optics  bandpass  circles  are  two  addi¬ 
tional  frequency  components  at  ±4NA/A  that  arise  because  of  nonlinearities  in  the 
exposure  and  develop  process.  As  is  seen  in  Fig.  5.3,  the  final  resist  profiles  ex¬ 
hibit  nearly  vertical  sidewalls,  corresponding  to  this  higher  frequency  content. 
Thus  the  result  of  the  total  photoresist  process  -  application,  prebake,  exposure, 
postbake,  develop  -  is  to  add  substantial  higher  harmonic  content  into  the  final 
image  compared  with  the  strictly  band-limited  aerial  image.  These  nonlinearities 
allow  extension  of  optics  beyond  the  single-exposure  linear  systems  limits  as  will 
be  discussed  in  Sect.  5.8. 

Resist  processing  is  a  complex,  3D  chemical  and  physical  process  with 
nonlocal  effects  due  to  diffusion  and  dissolution  kinetics.  A  simplified  model12, 
that  neglects  these  nonlocal  effects,  is  sufficient  for  the  purposes  of  this  presenta- 


Figure  5.4  Frequency  space  representation  of  the  grating  in  Fig.  5.3.  There  are  aerial 
image  components  at  dc  (0)  and  at  ±2NA/A.  The  maximum  frequency  permitted  by 
the  optical  system  is  2/A,  or  2n/A  for  a  transmission  medium  of  index  n.  Process 
nonlinearities  add  higher  harmonics. 
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tion.  In  this  model  the  final  resist  thickness,  T(x ),  is  given  by 


T(x)  =  max 


r  l-  eF(x)/Fo  ^ 

n 

'T0 

1- 

,  \-e~x  . 

,o> 

_ 

V  J 

_ 

> 

(5.5) 


where  F(x)  is  the  local  optical  fluence  (J/cm2);  F0  is  the  fluence  to  clear  the  resist, 
and  n  is  a  coordination  number  that  describes  the  resist  nonlinearity.  Figure  5 
shows  this  function  for  various  values  of  n.  Novolak-based13  I-line  (365  nm)  re¬ 
sists  typically  have  n’s  of  -  5  to  6.  Today’s  high-speed  chemically-amplified  re¬ 
sists,  as  a  consequence  of  the  large  thermal  amplification  inherent  in  their  opera¬ 
tion,  have  much  higher  effective  n’ s.  To  a  first  approximation,  it  is  sufficient  to 
model  their  response  as  the  sigmoidal  threshold  function  shown  for  n  —>  <*>. 

The  result  of  applying  this  sigmoid  response  function  to  the  sine-wave  aerial 
image  is  the  square  wave  response  observed  in  Fig.  5.3  with  the  frequency  con¬ 
tent  show  in  Eq.  (5.5)  where  £#is  the  nonlinear  operator  associated  with  the  proc¬ 
essing.  Note  that  this  frequency  content  extends  well  beyond  the  second- 
harmonic  shown  in  Fig.  5.4,  decaying  only  as  1//  where  /  is  the  harmonic  number. 


r(x)  =  ^[/(x)]  =  r0^^X 
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This  expression  clearly  highlights  the  higher  harmonics  added  by  the  processing. 
Other  post-develop  processing  also  adds  higher  harmonic  content  to  the  final  pat¬ 
tern.  Examples  are  an  isotropic  oxygen  plasma  ash  to  thin  the  resist  lines,  or  pat¬ 
tern  transfer  into  the  underlying  materials.  These  and  other  fabrication  steps  are 
routinely  used  in  the  semiconductor  industry  as  standard  manufacturing  proc¬ 
esses.  All  of  them  modify  the  frequency  content  of  the  pattern,  most  often  adding 
higher  harmonics. 

These  higher  harmonics  offer  the  possibility  of  extending  optical  lithography 
beyond  the  linear  systems  limits  imposed  by  diffraction.  The  difficulty  is  that  all 


Figure  5.5:  Resist  response  for  different  coordination  numbers. 
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of  the  harmonics  are  in  phase  with  the  fundamental  frequency  component  and  are 
generally  smaller  in  amplitude,  so  the  impact  of  the  nonlinearities  is  to  sharpen 
the  image,  but  not  to  increase  the  pattern  density  beyond  the  Rayleigh  limit.  In 
the  next  section  we  will  describe  a  strategy  for  increasing  the  pattern  density,  still 
with  IL  and  periodic  patterns  as  the  example.  The  application  to  arbitrary  patterns 
will  be  discussed  later  in  the  chapter. 


5.4  Nonlinear  Lithographic  Processes  to  Extend  the  Pattern  Density 


In  the  previous  section,  we  showed  that  diffraction  limits  the  one-dimensional 
pattern  density  in  a  single  exposure  to  -  A/NA,  while  the  nonlinearities  in  the 
processing  results  in  many  higher  harmonics  extending  in  frequency  well  beyond 
the  diffraction  limits  of  Eq.  (5.1).  These  nonlinearities  can  be  used  to  extend  the 
available  pattern  density,14  leading  to  the  title  of  this  chapter.  The  key  is  to  apply 
the  nonlinearities  between  exposure  steps  and  combine  images  in  such  as  way  as 
to  reduce  or  eliminate  the  lower  frequency  components,  leaving  the  higher  den¬ 
sity  of  structures. 

A  one-dimensional  example  of  a  spatial  period  division  is  shown  in  Fig.  5.6 
to  illustrate  the  general  concept.  The  top  panel  Fig.  5.6(a)  shows  a  photoresist 
pattern  atop  a  sacrificial  layer  on  a  substrate.  The  period  of  the  pattern  is  limited 
by  the  optics  to  A/2NA;  the  CD,  in  contrast,  is  not  limited  by  the  optics  and  is 
chosen  to  be  less  than  A/8NA.  As  discussed  above,  this  does  not  violate  any  opti¬ 
cal  limits.  In  Fig.  5.6(b),  the  photoresist  pattern  has  been  transferred  into  the  sac¬ 
rificial  layer,  in  the  experimental  case  shown  below  the  sacrificial  layer  was 
Si3N4  and  lift-off  and  etching  was  used  to  transfer  the  pattern  into  this  layer.  In 
Fig.  5.6(c),  a  second  photoresist  layer  has  been  spun,  exposed  and  developed. 
The  period  and  CD  is  the  same  as  in  the  first  case,  but  the  pattern  has  been 
shifted  by  A/4NA  to  interpolate  the  second  pattern  with  the  first.  Finally  this  pat¬ 
tern  is  also  transferred  into  the  sacrificial  layer  [Fig.  5.6(d)],  that  layer  is  then 
used  as  a  mask  to  transfer  the  total  pattern  into  the  substrate  layers,  and  finally 
the  sacrificial  layer  is  removed  leaving  a  final  pattern  at  a  period  of  A/4NA,  twice 
as  dense  as  the  single  exposure  optical  limit. 

The  mathematics  follows  simply, 
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Figure  5.6  Sequence  of  steps  to  double  the  ID  pattern  density  by  spatial  period  division. 


The  result  is  the  combination  of  two  exposures  with  the  nonlinearities  applied  to 
each  exposure  independently.  The  symbol  used  to  represent  the  combination  is 
left  ambiguous;  in  this  case  the  combination  is  simple  addition.  In  another  case, 
for  example  the  use  of  two  resist  layers  sensitive  at  different  wavelengths,  it 
might  be  multiplication.  The  pi  A  phase  shift  in  the  second  exposure  results  in  a 
factor  of  (-1/  in  the  Fourier  coefficients,  with  the  net  result  that  all  of  the  odd 
terms  cancel  leaving  a  sin(x)/x  pattern  at  twice  the  original  period  or  half  the 
pitch. 

A  demonstration  of  this  process  is  shown  in  Fig.  5.7.  The  top  panel,  Fig. 
5.7(a),  shows  the  etched  grooves  in  a  sacrificial  nitride  layer  with  a  period  p  of 
365  nm  and  a  CD  of  -  50  nm.  These  grooves  were  made  by  the  process  outlined 
in  Fig.  5.6:  using  positive  photoresist,  inverting  the  pattern  with  a  Cr  liftoff  step, 
using  the  Cr  as  a  hard  mask  to  etch  the  groove  into  the  nitride  layer,  and  finally 
stripping  the  Cr.  The  line-edge  roughness  is  a  result  of  the  Cr  grains  and  is  not 
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(a)  (b)  (c) 


Figure  5.7  Demonstration  of  spatial  period  division  using  multiple  exposures  and 
nonlinearities,  a)  50-nm  wide,  360-nm  pitch  grooves  etched  into  nitride  mask  layer;  b) 
180-nm  pitch  nitride  grooves  after  interpolating  second  360-nm  pitch  grating;  c)  final 
~  1 80-nm  pitch  grating  etched  into  Si. 


inherent  in  the  lithography  process.  Figure  5.7(b)  shows  the  nitride  film  after  the 
second  pattern  transfer,  the  period  has  been  halved  from  380  nm  to  180  nm.  Fi¬ 
nally  the  bottom  panel,  Fig.  5.7(c),  shows  the  result  of  a  pattern  transfer  into  the 
<110>  Si  using  anisotropic  KOH  etching  and  stripping  of  the  sacrificial  nitride 
layer.  The  alternations  in  both  spacing  between  lines  and  linewidths  are  indica¬ 
tions  of  imperfections  in  the  pi  A  shift  and  the  CD  of  the  second  exposure,  none¬ 
theless,  this  result  proves  the  concept. 

5.5  Arbitrary  patterns  -  Single-Exposure  Limits  of  Optics 

The  discussion  so  far  has  been  presented  for  IL,  using  periodic  patterns  whose 
Fourier  content  is  restricted  to  a  small  number  of  points  in  spatial  frequency 
space.  Realistic  integrated  circuit  patterns  are  much  more  complex,  containing 
regions  of  isolated  features,  regions  of  dense  patterns,  shapes  such  as  ell’s  and 
tee’s,  etc.  The  consequence  in  spatial  frequency  space  is  that  there  are  a  great 
many  spatial  frequencies  in  a  typical  pattern,  theoretically  as  many  as  there  are 
pixels  in  the  image  or  ~4xl010  for  a  100-nm  CD  and  a  2x2  cm2  field.  Realisti¬ 
cally,  there  are  fewer  frequencies  in  typical  integrated  circuit  patterns,  which  are 
more  regular  than  a  totally  random  Rorschach  pattern,  but  still  many  more  than 
could  addressed  by  any  conceivable  summation  of  individual  IL  exposures. 

Imaging  addresses  this  issue  by  capturing  a  large  number  of  spatial  frequen¬ 
cies  in  a  single  exposure.  The  lens  is  a  low-pass  filter  that  restricts  the  spatial  fre¬ 
quency  components  collected  to  a  smaller  range  than  that  available  from  the 
transmission  medium.  The  photoresist  is  a  square  law  detector,  responding  to  the 
intensity  not  the  electric  field,  and  adds  additional  nonlinearities  as  discussed 
above.  For  coherent  imaging  the  low-pass  constraint  is  NAA,  for  incoherent  im¬ 
aging  the  limit  is  2NA/A,.  Numerous  techniques  have  been  introduced  to  capture 
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more  of  the  spatial  frequencies,  and  thus  produce  a  better  image  at  higher  resolu¬ 
tion;  these  are  discussed  in  Sections  5.5  and  5.6  for  conventional  imaging  and 
resolution  enhancement  techniques,  respectively.  In  this  section,  the  ultimate  lim¬ 
its  of  optics  are  presented,  i.  e.,  the  arbitrary  image  corollary  to  the  A/4  resolution 
limit  of  IL  for  dense  patterns. 

Figure  5.8  shows  an  example  of  a  test  pattern,  consisting  of  a  series  of  nested 
“ells”  with  isolated  lines  along  with  a  large,  low-frequency  box  that  is  often  used 
to  evaluate  lithographic  system  limits.  The  calculation  was  carried  out  simply  by 
Fourier  transforming  the  pattern,  applying  a  low-pass  filter  at  the  optical  limit  of 
2/A,  transforming  back  into  real  space,  and  applying  a  sigmoid  nonlinearity  to 
mimic  the  resist  response.  The  main  impact  of  this  processing  for  the  largest  pat¬ 
tern,  CD  =  A/3,  is  a  slight  rounding  of  the  comers  of  the  “ell”  patterns.  For  the 
CD  =  A/3.5  pattern,  the  rounding  is  more  pronounced,  the  corners  of  the  “ell’s” 
are  less  well  defined,  and  some  edge  roughness  is  apparent.  For  the  CD  =  A/4 
pattern,  the  edge  roughness  is  pronounced  and  a  hole  has  appeared  in  the  large 
box 

The  reason  for  this  progressive  deterioration  of  the  image  is  evident  from 
Fig.  5.9  which  shows  the  amplitudes  of  the  Fourier  coefficients  for  this  pattern, 
scaled  to  the  optical  frequency  space  for  the  CD  =  A/3  case.  There  is  spectral  in¬ 
tensity  at  low  frequency,  corresponding  to  the  overall  shape  of  the  test  structure, 
and  peaks  in  the  fx-  and  ^-directions  corresponding  to  the  repetitive  pattern  of  the 
nested  ‘ells.’  For  this  Manhattan  geometry  integrated  circuit  pattern,  which  is  laid 
out  on  a  rectangular  grid,  the  majority  of  the  spectral  intensity  is  in  the  vertical 
and  horizontal  directions.  The  spectral  content  in  each  of  these  directions  is  not  a 
simple  8-function  as  it  would  be  for  a  perfectly  repetitive  stmcture,  but  rather 
shows  some  breadth  associated  with  the  information  that  there  are  a  limited  num¬ 
ber  of  repeating  lines  with  a  limited  extent.  The  spatial  frequency  scale  of  this 
stmcture  is  on  the  order  of  1/(^CDL),  where  n  is  the  number  of  nested  lines  in  the 
radial  direction  and  L  is  the  length  of  the  lines  in  the  perpendicular  direction.  For 
CD  =  A/3,  all  of  this  information  is  contained  within  the  bandpass  limits  of  op- 


CD  =  A/3  CD  =  A/3 .4  CD=A/4 


Figure  5.8  Calculated  real  space  images  for  a  dense  (half-pitch)  lithography  test  pat¬ 
tern  with  equal  lines  and  spaces  for  three  values  of  the  CD. 
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Figure  5.9  Spatial  frequency  space  plot  of  the  Fourier  coefficient  amplitudes  for  the 
pattern  of  Fig.  5.8,  scaled  to  the  optical  wavelength  for  the  A/3  case.  The  solid  outer 
circle  is  the  optical  limit  of  2/A  the  inner  dotted  circles  are  at  2NA/A  and  NA/A,  respec¬ 
tively  for  a  NA  of  0.7. 


tics,  and  all  of  it  is  retained  in  the  final  image.  It  is  important  to  note  that  the  ae¬ 
rial  image  for  this  case  is  sinusoidal,  only  the  fundamental  frequency  defining  the 
period  of  the  nested  'ells’  is  transmitted  by  the  optics.  The  nonlinearity  of  the 
resist,  as  described  above,  restores  the  higher  harmonics  and  allows  a  developed, 
photoresist  pattern  with  vertical  sidewalls.  For  smaller  CDs,  the  Fourier  space 
plot  expands  accordingly  and  the  higher  frequencies  approach  the  optical  cutoff. 
For  CD  =  A/4,  the  center  frequency  of  each  of  the  high  frequency  spectral  areas  is 
just  at  the  cutoff,  and  the  higher  frequencies  are  not  included  in  the  image.  The 
loss  of  these  high  frequency  terms  is  reflected  in  the  deterioration  of  the  image. 

From  this  discussion,  the  ultimate  single-exposure  resolution  of  optics  is  lim¬ 
ited  to  a  pattern  dependent  dense  (half-pitch)  CD  of  -  A/3.  The  techniques  to  ap¬ 
proach  this  limit  are  discussed  below.  Given  the  enormous  effort  being  applied 
and  the  strong  record  of  advances  made  to  date,  it  seems  reasonable  to  postulate 
that  this  limit  will  be  closely  approached  as  the  field  progresses.  To  provide  some 
definiteness,  this  limit  is  a  dense  CD  of  64  nm  for  a  wavelength  of  193  nm  (ArF) 
and  of  52  nm  for  a  wavelength  of  157  nm  (F2).  Further  reductions  are  possible 
with  the  use  of  immersion  and  of  multiple  exposures  with  nonlinear  mixing,  cov¬ 
ering  essentially  all  of  the  range  of  feature  sizes  demanded  by  the  present  road¬ 
map. 

5.6  Conventional  Imaging 

The  illumination  scheme  has  a  strong  impact  on  conventional  imaging.5  The  lim¬ 
iting  cases  are  coherent  and  incoherent  illumination.  In  coherent  illumination,  a 
single  plane  wave  (or  more  generally  a  Gaussian  beam  with  a  radius  of  curvature 
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matched  to  the  optical  system)  is  incident  on  the  mask,  most  often  at  normal  in¬ 
cidence.  The  optical  system  then  acts  as  a  low  pass  filter  transmitting  all  of  the 
Fourier  components  scattered  by  the  mask  up  to  a  frequency  of  NA/A  at  unity 
amplitude,  and  completely  eliminating  all  higher  frequencies.  The  square  of  the 
sum  of  these  fields  then  describes  the  aerial  image  at  the  wafer  plane.  Finally  the 
resist  nonlinearity  again  sharpens  the  profile. 

For  incoherent  illumination,  a  large  number  of  incoherently  related  waves  il¬ 
luminate  the  mask  with  a  uniform  distribution  of  incident  angles,  the  field- 
squared  image  is  obtained  for  each  of  these  plane  waves  and  the  photoresist  re¬ 
sponds  to  the  sum  of  these  images.  Higher  frequencies  are  included  in  the  image, 
up  to  2NA/A,  as  a  result  of  the  off-axis  illumination.  However,  the  amplitude  of 
these  high  frequency  terms  is  reduced  since  they  are  present  in  only  a  small  sub¬ 
set  of  the  images.  This  gives  rise  to  the  well-known  modulation  transfer  function, 
MTF,  of  an  optical  system  that  is  -  0.5  at  NA/A  and  goes  to  zero  at  2NA/A.  Be¬ 
cause  each  of  the  individual  images  must  be  positive  everywhere,  there  is  a 
higher  baseline  exposure  over  the  entire  image.  Figure  10  shows  the  resulting 
images  for  Ki=  0.6  at  an  NA  of  0.7. 

The  coherent  image  contains  all  frequencies  up  to  NA/A  with  the  same  inten¬ 
sity  ratios  as  in  the  Fourier  transform  of  the  mask  as  a  result  of  the  interference  of 
the  zero-order  transmission  of  the  mask  with  the  scattered  fields  transmitted 
through  the  lens.  In  addition,  the  square-law  intensity  detection  results  in  addi¬ 
tional  frequency  components  ranging  from  zero  to  2NA/A.  These  components 
arise  from  the  autocorrelation  of  the  imaging  fields  and  do  not  necessarily  corre¬ 
spond  to  image  spatial  frequencies.  An  advantage  of  coherent  imaging  is  that  all 
of  the  Fourier  components  interfere  coherently,  the  requirement  is  only  that,  at 
any  point  on  the  image,  the  total  intensity  must  be  positive.  Coherent  imaging 
tends  therefore  to  have  the  lowest  baselines,  or  unwanted  intensity  across  the 
unexposed  regions  of  the  image.  However,  the  hard  cutoff  of  the  lens  tends  to 
result  in  ringing  of  the  intensity  leading  to  line-edge  roughness,  as  can  be  seen  in 
Fig.  5.10.  For  incoherent  illumination,  each  image  spatial  frequency  is  multiplied 
bv  a  modulation  transfer  function  (MTF1  that  is  unitv  at  zero  freauencv.  is  -0.4  at 


Figure  5.10  Coherent  (left)  and  incoherent  images  at  Ki  =  0.6  and  NA  =  0.7  (160  nm 
at  A  =  193  nmt. 
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modulation  transfer  function  (MTF)  that  is  unity  at  zero  frequency,  is  -0.4  at 
NA /A  and  goes  to  zero  at  2NA/A.  The  net  result  is  that  incoherent  illumination 
contains  higher  image  frequencies,  but  with  a  lower  transfer  function  at  mid- 
spatial  frequencies  than  coherent  illumination.  The  ringing  associated  with  the 
frequency  cutoff  of  coherent  illumination  is  eliminated  by  the  gradual  reduction 
of  the  frequency  response  for  incoherent  illumination.  Partial  coherence  provides 
a  different  balance  of  these  frequency  responses.  In  any  case,  conventional  imag¬ 
ing  does  not  provide  any  spatial  frequencies  above  2NA/A  in  the  aerial  image. 
For  the  cases  of  coherent  and  partially  incoherent  illumination,  it  is  not  possible 
to  simply  define  an  image  intensity  MTF,  rather  the  result  depends  on  the  details 
of  the  image  because  of  the  autocorrelation  contributions  discussed  above.  More 
detailed  discussion  of  these  issues  is  provided  in  Refs.  1,  4,  and  5.  For  simplicity, 
all  of  the  results  presented  in  this  paper  were  calculated  with  a  simple,  scalar  im¬ 
aging  theory.  Particularly  at  high  NAs  it  will  be  important  to  extend  these  results 
to  a  full  vector  imaging  code. 

5.7  Resolution  Enhancement  Techniques  to  Reduce  K\. 

A  number  of  techniques  have  been  introduced  to  reduce  Ki  below  the  Rayleigh 
limit  of  -  0.6.  These  are  optical  proximity  correction  (OPC),  off-axis  illumination 
(OAI),  phase-shift  masks  (PSM),  and  imaging  interferometric  lithography  (IIL). 
The  basic  idea  of  each  of  these  techniques  is  to  provide  a  higher  frequency  con¬ 
tent  in  the  final  image.  A  more  complete  discussion  of  the  relationship  between 
these  techniques,  and  of  their  Fourier  optic  interpretation,  has  been  presented 
elsewhere.15,16  Only  a  brief  overview  is  included  here. 

5.7.1  Optical  Proximity  Correction  (OPC) 

With  OPC,  sub-resolution  features  (corresponding  to  high  spatial  frequencies)  are 
added  to  the  mask,  significantly  increasing  its  complexity,  to  boost  the  magni¬ 
tudes  of  the  high  spatial  frequencies  in  the  Fourier  transform  beyond  those  of  the 
image.  For  example  these  can  be  sub-resolution  lines  that  do  not  print  but  make  a 
isolated  line  appear  more  like  a  dense  array  of  lines  to  address  dense-isolated  fea¬ 
ture  biases.  These  magnitudes  are  attenuated  by  the  optical  system  but  are  still 
larger  than  they  would  have  been  without  the  OPC,  resulting  in  an  image  closer 
to  the  desired  pattern,  but  no  longer  simply  related  to  the  mask  pattern.  This  ap¬ 
proach  is  appropriate  for  partially  coherent  illumination.  For  coherent  illumina¬ 
tion,  the  electric  field  transfer  function  is  already  unity  up  to  NA /A  and  there  is 
nothing  to  be  gained.  It  is  intuitively  clear  that  this  approach  can  deal  with  fre¬ 
quencies  near  NA/A  where  the  incoherent  MTF  is  -  0.5  but  is  inadequate  to  add 
substantial  frequency  content  out  towards  2NA/A  where  the  MTF  is  very  low. 
Because  of  these  restrictions  in  frequency  space,  OPC  extends  Ki  to  -  0.5  but  not 
below.  As  a  result  of  diffraction,  these  changes  are  nonlocal;  e.g.  the  features  im¬ 
pact  an  area  of  -  A2  that  may  encompass  more  than  the  single  desired  feature.  As 
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a  result,  OPC  modeling  is  complex  and  numerically  intensive.  The  pattern  is  no 
longer  the  “what  you  see  is  what  you  get”  paradigm  that  was  sufficient  for  lower 
resolutions.  The  complexity  of  OPC  masks  can  be  as  much  as  lOx  greater  (more 
features)  than  conventional  masks,  increasing  the  burden  and  cost  of  fabricating 
the  masks. 

5.7.2  Off-Axis  Illumination  (OAI) 

In  OAI,  the  (coherent  or  partially  coherent)  illumination  beam  is  tilted  away  from 
the  optical  system  normal  to  allow  higher  spatial  frequencies  to  pass  through  the 
optical  system.  These  tilts  are  restricted  to  a  fraction  r|  <  1  of  NA/A  in  order  to 
transmit  the  zero-order  beam  through  the  optical  system  and  the  maximum  spa¬ 
tial  frequency  is  given  by  (l+r|)NA/A  <  2NA/A.  Both  annular  and  quadrupole 
illumination  have  been  explored.  As  is  the  case  with  any  partial  coherence,  the 
baseline  is  always  higher  than  for  coherent  illumination  since  each  image  can 
only  have  positive  intensities.  Figure  5.11  shows  the  basic  concept  of  OAI  for 
illumination  of  a  simple  grating.  The  grating  pitch  is  greater  than  NA/A,  but  less 
than  (l+r|)NA/A.  At  normal  incidence  both  first  order  diffracted  beams  are  out¬ 
side  of  the  lens  pupil,  but  for  off-axis  illumination,  one  of  the  diffracted  orders  as 
well  as  the  zero-order  transmitted  beam  are  collected  by  the  lens  and  an  image  of 
the  grating  is  formed  by  their  interference  in  the  photoresist. 

An  issue  with  OAI  is  that  the  multiple  exposures  result  in  placing  greater 
weight  on  the  lower  frequencies  as  illustrated  in  Fig.  5.12  which  shows  the  fre¬ 
quency  space  coverage  for  a  quadrupole  illumination  with  an  offset  of  r|  =  0.75. 
The  axis  of  the  quadrupole  is  usually  shifted  45°  relative  to  the  Manhattan  ge¬ 
ometry  axes  since  this  gives  a  higher  response  to  the  dominant  frequency  compo¬ 
nents.  The  numbers  and  the  shading  indicate  the  relative  weighting  of  the  spatial 
frequencies  within  each  area.  These  spectral  overlaps  restrict  Ki  to  ~  0.43.  Pupil 
plane  filters  provide  an  additional  degree  of  freedom  for  OAI.15'18  If  these  over¬ 
laps  are  eliminated  with  pupil  plane  filters,  OAI  can  be  extended  to  Ki  ~  0.3. 


Normal  Incidence  Off-Axis  Illumination 


Figure  5.11  Illustration  of  the  higher  spatial  frequencies  possible  with  off  axis  illumi¬ 
nation. 
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5.7.3  Phase-Shift  Masks  (PSM) 

For  PSM,  the  mask  is  drastically  modified  (to  a  3D  object)  to  enhance  the  ampli¬ 
tudes  of  the  fundamental  frequency  components  (out  to  NA/A)  while  reducing  the 
amplitude  of  the  de-component  of  the  electric-field.  Because  of  the  square-law 
resist  response,  this  dramatically  boosts  the  quadratic  image  intensity  compo¬ 
nents  between  NA/A  and  2NA/A  allowing  Ki’s  of  -  0.35.  As  was  the  case  for 
OPC,  the  simple  relationship  between  the  mask  and  image  pattern  is  lost  requir¬ 
ing  extensive  modeling,  design  and  mask  fabrication  efforts.  Figure  5.13  shows 
the  general  concept  of  phase  shift  for  a  chromeless,  strong  phase-shift  case.  The 
lefthand  column  shows  a  conventional  coherent  imaging  of  a  grating  of  period  p 
that  is  not  captured  by  the  imaging  system.  The  right  column  shows  a  chromeless 
phase-shift  mask  with  a  period  of  2 p,  the  depth  of  the  etching  is  (?z-l)A/2  so  that 
there  is  a  A/2  phase  shift  for  the  transmitted  fields.  The  result  is  that  the  zero- 
order  transmission  is  identically  zero  and  the  diffracted  orders  interfere  as  in  IL 
to  produce  a  grating  at  p.  The  maximum  frequency  that  can  be  printed  is  2NA/A. 
An  issue  with  PSM  is  that  the  mask  is  transformed  dramatically  from  the  conven¬ 
tional  imaging  paradigm  requiring  significant  modeling  efforts.  Strong  phase 


Figure  5.12  Frequency  space  coverage  for  quadmpole  OAI  with  rj  =  0.75.  The  over¬ 
laps  give  greater  weight  to  the  low  frequency  components. 


Binary  Crome  Mask  Cromeless  Phase 

Shift  Mask 


Figure  5.13  Principle  of  phase  shift  masks.  The  maximum  spatial  frequency  is  2NA/A. 
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shift  masks  produce  extra  lines  that  need  to  be  compensated  for  with  a  second 
mask  and  a  second  pass  through  the  lithography  tool. 

5.7.4  Imaging  Interferometric  Lithography  (IIL) 

IIL  is  a  newly  introduced19,20  imaging  concept  that  is  closely  related  to  OAI,  with 
the  major  difference  that,  for  relatively  low  NA  optics,  the  offsets  are  chosen  be¬ 
yond  NA/A  requiring  a  separate  optical  system  to  arrange  for  the  zero-order  ref¬ 
erence  beam  at  the  wafer.  For  continuous  coverage  along  the  principal  (fx,fy )  fre¬ 
quency  axes,  the  maximum  frequency  at  the  wafer  is  as  high  as  3NA/A  allowing 
Ki’s  of  as  low  as  0.2.  With  the  advent  of  very  high  NA  lenses,  IIL  is  essentially 
OAI  with  the  addition  of  pupil  plane  filters  to  eliminate  the  multiple  coverage  of 
frequency  space  that  favors  the  low  frequencies  as  discussed  above.  As  with 
OAI,  the  mask  and  image  patterns  are  identical  (except  for  magnification  scaling) 
and  traditional  binary  chrome-on-glass  masks  are  used. 

Significant  benefits  of  IIL,  especially  for  high-NA  optics,  include  the  ability 
to  optimize  the  polarization  for  the  highest  spatial  frequencies,  and  to  vary  the 
relative  intensities  of  the  multiple  exposures  to  optimize  the  image  contrast.  In 
addition,  the  ability  to  vary  the  pupil  aperture  for  different  exposures  gives  some 
control  over  the  unwanted  autocorrelation  (dark-field  imaging)  contributions  to 
the  image.  Figure  5.14  shows  the  limiting  results  for  an  NA  of  0.9  and  a  CD  of 
A/3  (Ki  =  0.3)  for  incoherent  illumination,  coherent  illumination,  and  imaging 
interferometric  lithography.  The  top  panels  show  the  calculated  image  results  for 
the  test  pattern,  the  bottom  panels  show  the  corresponding  frequency  space  cov¬ 
erages.  Incoherent  illumination  resolves  the  individual  nested  ‘ells’  but  clearly 
suffers  from  a  serious  loss  of  linewidth  and  foreshortening  of  the  isolated  lines. 
The  outermost  6 ell’  is  also  significantly  narrower  than  the  others.  The  frequency 
space  coverage  is  indicated  by  the  shaded  circle  with  an  NA  of  0.9.  For  coherent 
illumination,  the  pattern  is  totally  unresolved.  The  spatial  frequency  correspond¬ 
ing  to  the  line/space  pattern  is  1.5/A  >  NA/A  and  is  not  imaged.  Finally,  IIL  with 
pupil  plane  filters  does  a  very  good  job  of  imaging  the  pattern.  The  set  of  filters 
used  for  the  calculation  is  shown  in  the  frequency  space  diagram.  The  central 
exposure  was  at  an  NA  of  0.45  while  the  offset  NAs  were  at  0.9  with  r|  =  1  and 
with  additional  filters  to  eliminate  overlaps.  In  this  case,  all  of  the  beams  are 
transmitted  through  the  lens  pupil,  there  is  no  need  for  the  interferometers  around 
the  outside  of  the  lens. 

5.7.5  Combinations  of  RETs 

Application  of  these  techniques  is  not  mutually  exclusive.  Clearly,  OPC  will  be 
combined  with  the  other  techniques  to  address  remaining  image-specific  issues. 
OAI  and  IIL  are  closely  related.  In  the  context  of  lower  NA  lenses,  IIL  is  OAI 
with  a  sufficiently  large  offset  that  the  zero-order  beam  is  not  captured  by  the 
lens  but  rather  is  reintroduced  after  the  lens.  With  the  advent  of  very  high  NA 
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Figure  5.14  Comparison  of  incoherent  illumination,  coherent  illumination  and  imag¬ 
ing  interferometric  lithography  for  a  CD  of  A/3  and  an  NA  of  0.9  =  0.3). 

lens,  IIL  effectively  reduces  to  OAI  with  the  use  of  pupil  plane  filters  to  elimi¬ 
nate  overlaps  in  the  spatial  frequency  coverage  between  different  exposures  and 
assure  a  uniform  electric  field  transfer  function  across  the  entire  frequency  space. 
All  of  the  beams  now  pass  through  the  lens  so  the  need  for  interferometric  length 
control  is  reduced  to  that  for  the  imaging  system. 


5.8  Nonlinear  Processes  Applied  to  Arbitrary  Patterns 

The  nonlinear  concepts  introduced  in  Sect.  5.2  apply  equally  well  to  the  case  of 
arbitrary  patterns.  The  general  concept  is  to  print  as  much  as  diffraction  allows  in 
a  single  exposure  (Sect.  5.7),  transfer  the  information  to  a  temporary  storage 
layer  (often  a  sacrificial  layer  that  will  be  removed  later),  and  interpolate  subse¬ 
quent  exposures  that  are  also  transferred  into  the  storage  layer.  Finally  use  the 
storage  layer  combining  the  information  from  all  of  the  exposures  to  transfer  the 
complete  pattern  into  the  underlying  layers.  An  example  of  this  process  for  the 
test  pattern  introduced  above  for  a  dense  CD  of  32  nm  (A/6)  with  an  exposure 
wavelength  of  193  nm  is  shown  in  Fig.  5.15.  As  discussed  in  Sect.  5.2,  it  is  criti- 
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Figure  5.15  Model  calculation  for  interpolating  two  images,  each  near  the  single¬ 
exposure  limit  of  optics  to  provide  a  final  pattern  at  twice  the  usual  optics  limits.  The 
calculation  is  for  a  32-nm  dense  CD  at  a  wavelength  of  193  nm.  The  top  panels  show  the 
results  of  the  two  exposures  and  their  combination  for  a  simple  mask  pattern.  The  second 
row  of  panels  shows  similar  results  with  a  small  amount  of  OPC  to  counteract  some  of 
the  edge  ripple  in  the  top  images.  The  third  row  shows  the  Fourier  content  of  the  aerial 
image,  bandpass  limited  at  2/A,  and  the  bottom  panel  shows  the  Fourier  content  of  the 
image  out  to  4/A,  after  the  resist  nonlinearity  is  applied. 

cal  to  apply  the  nonlinearities  separately  to  each  exposure.  If  the  multiple  expo¬ 
sures  are  accumulated  in  a  single  resist  layer,  the  patterns  merge  and  enhanced 
resolution  is  not  realized. 

The  calculation  for  Fig.  5.15  was  based  on  the  full  transmission  bandpass  of 
optics  and  is  similar  to  the  calculations  of  Fig.  5.8.  The  top  panels  show  the  re- 
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suits  of  the  two  exposures  and  their  combination  using  a  straightforward  mask 
without  any  attempt  at  OPC.  Line  edge  variations  are  clearly  evident  for  this 
level  of  exposure.  The  left-most  panels  show  the  result  of  the  first  exposure.  Al¬ 
ternate  lines  of  the  nested  “ell”  are  printed,  so  that  the  pitch  is  the  same  as  in  each 
of  the  single  exposure  examples,  but  the  exposure  dose  is  adjusted  (increased)  to 
get  the  required  CD.  This  is  similar  to  industry  practice  for  logic  chips  that  have  a 
lower  density  than  memory  devices.  The  left-hand  panel  in  the  third  row  shows 
the  aerial  image  frequency  content,  limited  by  the  2/A  by  the  transmission  limit  of 
optics.  The  bottom  left-hand  panel  shows  shows  the  Fourier  content  of  the  final 
image  after  the  resist  nonlinearity  has  been  imposd.  There  is  now  evident  fre¬ 
quency  content  at  both  the  fundamental  (passed  by  the  optics)  and  the  second 
harmonic  (beyond  the  optics  cutoff).  There  are  higher  frequency  components  as 
well  that  are  not  shown  in  the  figure.  The  inner  shaded  circle  denotes  the  2/A  op¬ 
tics  frequency  limit. 

The  middle  panels  show  similar  results  for  the  second  exposure  in  which  two 
of  the  “ells”  and  the  large  box  are  printed.  Because  of  the  low  frequency  content 
of  the  large  box,  the  lower  frequencies  are  emphasized  more  in  the  aerial  image. 
Finally,  the  right-most  panels  show  the  result  of  combining  the  two  images  in  a 
sacrificial  layer.  The  final  image  is  the  desired  dense  CD  image.  The  fundamental 
frequency  components  partially  cancell,  resulting  in  a  relative  emphasis  of  the 
second  harmonic  frequency  content.  It  is  instructive  to  compare  the  frequency 
content  of  Fig.  5.15  with  that  of  Fig.  5.9. 

As  the  linewidth  is  decreased,  e.g.  as  the  resist  threshold  is  shifted  to  lower 
intensities,  the  variations  in  exposure  at  different  positions  on  different  lines— the 
linewidth  ripple— becomes  more  severe.  This  is  evident  by  comparing  Fig.  5.15 
for  CD  =  A/6,  as  compared  with  Fig.  5.8  for  CD  =  A/3.  These  linewidth  variations 
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Figure  5.16  Aerial  image  optical  intensity  along  the  cuts  indicated  by  the  dotted  lines 
in  Fig.  5.15.  The  bottom  trace  is  in  the  absence  of  any  OPC  correction.  The  top  trace  is 
with  the  OPC  illustrated  in  Fig.  5.17.  The  resist  threshold  intensity  (dashed  lines)  is 
also  shown  for  imaging  at  A/3  and  A/6. 
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are  related  to  intrinsic  variations  in  the  aerial  image  rather  than  to  any  process 
variability  in  this  idealized  calculation.  This  result  also  implies  that  process  con¬ 
trol  issues  will  be  more  significant  at  these  high  exposure  levels  as  the  linewidth 
variation  for  a  small  change  in  intensity  becomes  more  severe. 

This  is  illustrated  more  clearly  in  Fig.  5.16  which  shows  the  aerial  image 
intensity  profiles  along  the  cuts  indicated  by  dotted  lines  in  the  left-hand  images 
of  Fig.  5.15,  passing  through  two  vertical  bars  of  the  “ells.”  Because  the 
amplitudes  of  the  two  lines  are  not  precisely  equal,  there  is  some  narrowing  of 
the  second  line  in  the  final  image.  This  can  be  addressed  by  widening  of  the  mask 
pattern  at  this  point  as  shown  in  the  OPC  mask  (Fig.  5.17).  The  frequency  of  the 
intensity  oscillation  is  set  by  the  bandpass  limits  of  optics  and  is  the  same  for 
both  lines,  so  the  intensity  (depth  of  the  null)  is  the  only  possible  adjustment; 
each  line  is  the  same  width.  If  the  threshold  is  moved  up  to  the  mid-intensity 
levels,  as  it  was  for  the  results  of  Fig.  5.8  (e.  g.  for  a  CD  of  A/3)  and  as  shown  by 
the  dash-dot  lines  labeled  “threshold  (CD  -  A/3)”  the  impact  of  these  variations 
on  the  final  printed  image  is  much  less  pronounced.  For  this  calculation,  the 
definition  of  this  OPC  was  an  ad  hoc  process  of  adding  and  subtracting  bars  in 
the  areas  that  showed  the  most  ripple.  An  extensive  investigation  was  not  carried 
out  and  better  results  are  certainly  available;  these  results  are  sufficient  to  define 
the  concent.  The  message  is  that  mask  correction,  mask  CD  control,  and  nrocess 


Figure  5.17  OPC  additions  (black)/subtractions  (white)  to  the  simple  mask  (shown  in 

grey) 
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is  that  mask  correction,  mask  CD  control,  and  process  control  all  will  become 
more  demanding  as  increasing  orders  of  nonlinear  interpolations  are  attempted. 

The  division  of  the  pattern  into  two  (or  more)  exposures  is  essentially  a  map 
coloring  problem,  where  the  rule  is  never  have  two  objects  with  the  same  color 
closer  than  the  Rayleigh  resolution  limit  for  a  single  exposure.  Another  example 
is  shown  in  Fig.  5.18  for  a  typical  SRAM  pattern  level.  In  principle,  the  tech¬ 
nique  can  be  extended  to  multiple  exposures,  and  therefore  to  CDs  much  smaller 
than  the  Rayleigh  criterion.  In  practice,  nonuniformities  and  process  control  is¬ 
sues  will  be  more  significant  the  larger  the  number  of  exposures,  and  hence  the 
thinner  the  lines  required  in  each  exposure  in  comparison  with  the  limiting  pitch 
of  -  A/3  (A/4)  for  arbitrary  (periodic)  patterns. 


Figure  5.18  Example  of  interpolation  of  two  exposures  to  increase  the  final  pattern  den¬ 
sity  for  an  arbitrary  pattern  (a  typical  SRAM  level). 


5.9  Summary  and  Conclusions. 

The  fundamental  limits  of  optical  imaging,  assuming  aberration-free  lenses  have 
been  discussed.  For  a  conventional  illumination  single  exposure,  diffraction  lim¬ 
its  the  ability  to  resolve  two  objects  to  a  of  -0.6  giving  a  dense  (half-pitch)  CD 
of  0.9A  for  an  NA  of  0.7.  However,  in  the  absence  of  an  optical  system,  optics 
can  support  much  higher  spatial  frequencies  of  up  to  2/A  providing  dense  (half¬ 
pitch)  lines  and  spaces  at  a  CD  of  A/4  for  a  fully  periodic  pattern.  For  maskless, 
interferometric  lithography  this  limit  is  support  by  experimental  results.  For  non- 
repetitive  patterns  the  smallest  CD  supported  by  optics  has  to  be  relaxed  some¬ 
what  to  -  A/3  (pattern  dependent)  to  allow  access  to  all  of  the  necessary  spatial 
frequency  information.  Numerous  techniques  have  been  introduced  to  extend  the 
results  of  practical  imaging  systems  to  these  limits.  Both  phase  shift  masks  and 
imaging  interferometric  lithography  with  very  high  NA  lenses  are  promising  al¬ 
ternatives  that  extend  the  spatial  frequency  coverage  of  conventional  imaging  and 
closely  approach  this  single-exposure  optics  resolution  limit. 
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All  of  these  results  scale  as  the  refractive  index  of  the  transmission  medium. 
This  allows  an  additional  reduction  of  line  width  of-  1.5,  but  at  the  expense  of  a 
much  more  complex  optical  system. 

Imaging,  exposure,  development,  and  pattern  transfer  are  all  highly  nonlinear 
processes.  The  optics  limit  is  on  the  closest  resolvable  distance  between  two  ob¬ 
jects,  not  on  the  linewidth  of  an  individual  object. 

Table  5.2  shows  the  limits  of  optical  lithography,  assuming  a  nominal  200- 
nm  photon.  ArF  lasers  at  193  nm  are  coming  into  widespread  use  in  the  silicon 
manufacturing  industry.  Coherent  sources  at  213  nm  (fifth  harmonic  of  YAG)  are 
commercially  available.  The  photoresist  infrastructure  at  these  wavelengths  is 
already  largely  in  place.  There  are  ongoing  efforts  to  further  reduce  the  linewidth. 
All  of  these  results  scale  accordingly.  As  mentioned  in  the  introduction,  EUV  is 
an  electromagnetic  imaging  technology  and  the  same  arguments  apply,  with  the 
difference  that  very  high-NA  lenses  are  precluded  by  geometric  considerations. 
Table  5.2a  presents  the  single  exposure  limits  for  both  interferometric  (periodic 
patterns)  and  optical  imaging  (arbitrary  patterns)  lithographies.  Table  5.2b  shows 
the  corresponding  limits  with  a  single  interpolation  step.  Larger  numbers  of  in¬ 
terpolation  steps  are  possible  at  the  price  of  increased  complexity  and  reduced 
process  latitude. 

Nonlinearities  typically  introduce  higher  harmonics  of  the  spatial  frequencies 
of  the  image.  Combining  images  after  applying  these  nonlinearities  independ¬ 
ently  to  each  image  is  shown  to  provide  sufficient  flexibility  to  extend  optical 
resolutions;  a  factor  of  two  seems  readily  accessible.  In  principle  higher  multipli¬ 
ers  are  possible,  but  process  control  and  alignment  issues  will  become  increas¬ 
ingly  difficult.  Combining  nonlinearities  requires  transfer  into  a  sacrificial  layer 
or  multiple  layers  of  resist.  Multiple  exposures  in  the  same  level  of  resist  are  not 
able  to  achieve  the  higher  resolution. 

It  is  useful  to  compare  these  tables  with  Table  5.1  that  sets  out  the  ITRS  li¬ 
thography  requirements.  The  ultimate  decisions  as  to  lithographic  technique  will 
be  made  on  economic  grounds.  While  optics  can  be  extended  well  beyond  to¬ 
day’s  requirements  based  on  the  present  analysis,  there  are  many  other  issues  that 
have  not  been  discussed  in  this  chapter.  These  have  to  do  with  depth-of-field,  and 
in  general  with  process  latitude  issues.  Impacts  of  imperfection  in  the  mask,  the 
optical  system,  the  focal  plane,  etc.  have  been  ignored  and  deserve  detailed 
analysis. 


Table  5.2a:  Single  Exposure  Limits  of  Optical  Lithography  (A  =  200  nm  and  n  =  1.5) 


Technique 

Half-Pitch  CD 

Pitch 

Interferometric  lithography  in  air  (or  vacuum) 

A/4  (50  nm) 

All  (100  nm) 

Immersion  interferometric  lithography  (n  =  1.5) 

A/4n  (33  nm) 

A/2n  (66nm) 

Nonrepetitive  patterns  (imaging)  in  air  (or  vacuum) 

A/3  (67  nm) 

A/ 1.5  (133  nm) 

Single-exposure  imaging  with  immersion 

A/3  n  (44  nm) 

A/1.5 n  (88  nm) 
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Table  5.2b  Double  Exposure  Limits  of  Optical  Lithography  (A  =  200  nm  and  n  =  1.5) 


Technique 

Half-Pitch  CD 

Pitch 

Multiple  exposure  IL  in  air* 

A/8  (25  nm) 

AJ 4  (50  nm) 

Multiple  exposure  IL  with  immersion* 

A/8n  (17  nm) 

A/6  (34  nm) 

Multiple  exposure  imaging  in  air* 

A/6  (33  nm) 

A/3  (67  nm) 

Multiple  exposure  imaging  with  immersion* 

AJ6n  (22  nm) 

A/3n  (44  nm) 

*In  principle,  the  frequency  multiplication  can  be  repeated  multiple  times,  a  single  dou¬ 
bling  of  the  spatial  frequency  in  both  the  x-  and  y-  directions  is  assumed,  as  illustrated  in 
Figs.  5.15  and  5.16. 


Nanotechnology  is  emerging  as  a  major  research  field.  One  of  the  require¬ 
ments  for  many  nanotechnology  directions  is  a  facile  and  inexpensive  patterning 
technique  to  reach  the  nanoscale.  Parallel  writing  is  an  important  advantage  of 
optical  lithography,  macroscopic  areas  are  printed  in  a  single  exposure  with  high 
uniformity  and  placement  accuracy.  Based  on  the  analysis  and  results  presented 
herein,  optical  lithography,  and  especially  interferometric  lithography,  will  play  a 
significant  role  in  the  development  of  this  new  field. 
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Abstract 

Two-photon  absorption  of  photosensitive  media  can  produce 
interference  fringes  with  double  spatial  frequency.  This  requires  the 
employment  of  multiple-frequency  beams,  which  interfere  with  one 
another  to  produce  a  stationary  image  with  double  spatial 
resolution.  The  required  beams  were  produced  by  frequency 
filtering  of  broadband  radiation  from  a  cw  mode-locked 
femtosecond  Ti:sapphire  laser  (lambda  =  790  nm)  in  a  dispersion- 
free  pulse  shaper.  Then  the  two  multifrequency  rays  converged 
from  opposite  edges  of  a  lens,  focusing  on  Kodak  commercial  film. 
The  laser  intensity  was  high  enough  to  produce  a  two-photon 
exposure.  The  doubling  of  the  spatial  frequency  of  the  interference 
pattern  has  been  observed,  but  the  contrast  ratio  of  the  pattern 
was  limited  by  competition  from  the  more  usual  one-photon 
absorption.  Laser  pulse  parameters  for  a  single-pulse  two-photon 
exposure  have  been  estimated.  (Copyright)2002  Society  of  Photo- 
Optical  Instrumentation  Engineers. 

©2005  SPIE—The  International  Society  for  Optical  Engineering. 
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ABSTRACT 

The  limit  of  optical  lithography  is  the  minimum  pitch  between  features.  This  pitch  limit  is  given  by  A  =  A/lnNA  ,  where 

A  is  the  optical  wavelength,  n  is  the  refractive  index  of  the  final  medium  of  the  optical  system  which  is  typically  air  (n  = 
1),  and  NA  is  the  numerical  aperture  of  the  exposure  device.  A  great  deal  of  work  has  been  done  to  decrease  exposure 
wavelengths  and  increase  the  NA  of  exposure  tools,  however,  until  recently  very  little  effort  has  been  applied  towards  an 
immersion  medium  with  n  >  1.  This  paper  examines  extending  minimum  pitches  through  the  use  of  such  media. 
Exposures  are  at  a  wavelength  of  213  nm,  close  to  the  current  state-of-the-art  193-nm  lithography  node.  The  possible 
limits  of  lithography  are  examined  using  193-nm  resists  exposed  in  air  and  comparing  these  limits  to  those  possible 
when  implementing  liquid  immersion  lithography  (LIL)  exposures.  Two  immersion  liquids  were  examined:  deionized 
water,  and  Krytox®  a  Perfluoropolyether  (PFPE)  oil.  These  liquids  were  compatible  with  193-nm  resist.  A  resolution 
enhancement  factor  of  28%  for  Krytox®  and  of  41%  for  DI  water  was  demonstrated.  Images  of  good  dense  lines  with  a 
half  pitch  of  54  nm  are  presented. 

Keywords:  liquid  immersion  lithography,  interferometric  lithography,  DI  H20,  193  nm  photoresist,  Krytox,  PFPE,  sub 
100-nm  half-pitch 


1.  INTRODUCTION 

As  the  microelectronic  industry  pushes  towards  smaller  feature  sizes,  greater  demands  are  placed  on  lithography. 
Modern  lithography  now  commonly  produces  features  smaller  than  the  optical  wavelength.  Through  the  use  of  optical 
enhancement  techniques  such  as  off-axis  illumination  (OAI),  optical-proximity  correction  (OPC),  phase-shift  masks 
(PSM)  and  imaging  interferometric  lithography  (IIL),  industry  can  extend  the  current  limits  of  state  of  the  art  lithography 
tools.  These  techniques  are  quickly  being  pushed  to  their  limits  while  next  generation  lithography  technologies  are  still 
under  development.  Industry  needs  to  find  new  ways  to  extend  their  lithography  capabilities  toward  the  theoretical 
limits.  The  ultimate  minimum  feature  optical  lithography  can  produce  is  a  line  and  space  whose  pitch  is  determined  by 
the  well-known  formula  A  =  A/ 2 nNA  ,  where  A  is  the  wavelength  of  light,  n  is  the  refractive  index  of  the  exposure 
medium,  and  NA  =  sin(6/)  is  the  sine  of  the  half  angle  of  the  light  at  the  exit  face  of  the  lens. 

Historically,  the  exposure  wavelength  A  has  been  steadily  decreased  and  the  numerical  aperture  NA  has  been  steadily 
increased  to  achieve  to  higher  resolution.  Current  leading  edge  exposure  tools  operate  at  a  wavelength  of  193  nm  with 
NA  ~  0.7  with  projections  to  NA  ~  0.85  in  the  near  future.  Currently  there  is  no  optical  node  beyond  157  nm  but  research 
is  underway  for  even  shorter  usable  wavelengths  sources.  There  is  still  a  limit  on  traditional  refractive-optics-based 
lithography  as  there  are  no  known  transparent  lens  materials  below  ~  105  nm.1  Interest  has  recently  grown  dramatically 
in  exploring  the  use  of  immersion  media,  between  the  exit  face  of  the  lens  and  the  photoresist  surface,  to  extend  optical 
lithography.2, 3,4,5  Currently  air  is  the  only  exposure  medium  being  used  for  commercial  exposure  tools,  with  n  =  1.  The 
use  of  immersion  media  to  extend  optical  resolution  is  well  known  from  optical  microscopy.  Through  the  use  of  other 
media,  such  as  de-ionized  water  with  a  value  of  n  ~  1.44  at  193  nm,  much  smaller  minimum  feature  sizes  can  be 
achieved.  From  the  pitch  equation  this  can  be  thought  of  either  as  an  effective  wavelength  Ae^  = ~  134  nm  ;  or 

alternatively  as  an  increase  to  an  effective  NAe^  -  ft  sin#  <  1.44  .  Thus,  the  use  of  water  immersion  lithography  could 

potentially  extend  193-nm  wavelength  optical  lithography  by  44%  or  almost  two  generations,  twice  the  resolution 
enhancement  offered  by  the  transition  to  157  nm.  Application  of  immersion  techniques  at  157  nm  will  have  a  similar 
impact;  however  a  suitable  liquid  has  yet  to  be  identified. 


In  order  to  provide  a  perspective  on  semiconductor  industry  lithographic  requirements,  Table  1  reproduces  material  from 
the  2001  International  Technology  Roadmap  for  Semiconductor  (ITRS).6  A  technology  node  is  defined  by  ITRS  as  “the 


half-pitch  of  the  first-level  interconnect  dense  lines  is  most  representative  of  the  DRAM  technology  level  required  for 
the  smallest  economical  chip  size.  For  logic  such  as  microprocessors  (MPUs),  physical  bottom  gate  length  is  most 
representative  of  the  leading-edge  technology  level  requires  for  maximum  performance”,  “The  smallest  half-pitch  is 
typically  found  in  the  memory  cell  area  of  the  chip.  Each  technology  node  step  represents  the  creation  of  significant 
technology  progress  -  approximately  70%  of  the  preceding  note,  50%  of  two  preceding  nodes.”6 


Year  Of  Production 

2001 

2002 

2003 

2004 

2005 

2006 

2007 

2010 

2013 

2016 

DRAM 

DRAM  V2  Pitch  (nm) 

130 

115 

100 

90 

80 

70 

65 

45 

32 

22 

Contact  in  resist  (nm) 

165 

140 

130 

110 

100 

90 

80 

55 

40 

30 

Contact  after  etch  (nm) 

150 

130 

115 

100 

90 

80 

70 

50 

35 

25 

Overlay 

46 

40 

35 

32 

28 

25 

23 

18 

13 

9 

CD  control  (3  sigma)  (nm) 

15.9 

14.1 

12.2 

11.0 

9.8 

8.6 

8.0 

5.5 

3.9 

2.7 

MPU 

MPU  V2  Pitch  (nm) 

150 

130 

107 

90 

80 

70 

65 

45 

32 

22 

MPU  gate  in  resist  (nm) 

90 

70 

65 

53 

45 

40 

35 

25 

18 

13 

MPU  gate  length  after  etch  (nm) 

65 

53 

45 

37 

32 

28 

25 

18 

13 

9 

Contact  in  resist  (nm) 

165 

140 

122 

100 

90 

80 

75 

50 

37 

27 

Contact  after  etch  (nm) 

150 

130 

107 

90 

80 

70 

65 

45 

32 

22 

Gate  CD  control  (3  sigma)  (nm) 

5.3 

4.3 

3.7 

3.0 

2.6 

2.4 

2.0 

1.5 

1.1 

0.7 

White  -  Manufacturable  Solution  Exist,  And  Are  Being  Optimized 
Light  Gray  -  Manufacturable  Solution  are  Known 
Dark  Gray  -  Manufacturable  Solutions  are  NOT  Known 


Table  1  -  Lithography  technology  requirements.6 


This  paper  presents  initial  exposure  results  for  immersion  lithography  at  a  wavelength  of  213  nm,  close  to  the  current 
193-nm  exposure  wavelength  used  in  industry.  A  maskless  interferometric  lithography  approach  that  gives  essentially 
ideal  contrast  fringes  was  used.7  Exposures  were  carried  out  at  a  number  of  angles  corresponding  to  different  NA’s.  The 
smallest  half-pitch  achieved  was  54  nm,  limited  by  the  image  resolution  of  the  resist  systems  investigated  rather  than  by 
any  optical  effects;  the  optical  half  pitch  limit  {XI An)  in  these  experiments  was  -  34  nm. 

2.  EXPERIMENTAL  SETUP 

The  light  source  was  a  Coherent  Infinity  40-100  Nd:YAG  pulsed  (~  2  ns)  laser  operating  at  the  fifth-harmonic 
wavelength  of  213-nm.  This  source  has  much  higher  coherence  length  (~  0.6  m)  than  a  193  nm  excimer  laser  that 
simplifies  the  experimental  arrangement  for  interferometric  lithography.  193-nm  resists  work  well  at  this  wavelength.  A 
simple  diverging  lens  expands  the  /4-inch  beam  to  about  2. 5 -inches  at  the  immersion  cell  2-inch  entrance  window. 

A  Fresnel  corner  cube  interferometer  immersion  cell  was  designed  and  constructed  for  this  work.  Figure  1  illustrates 
how  the  comer  cube  mirror  folds  half  of  the  incoming  UV  beam  onto  the  wafer  to  form  a  sinusoidal  interference  pattern 

on  the  surface  of  the  wafer.  The  period  of  the  grating  exposed  in  the  resist  is  given  by  A  =  2.n^m(0)  (n  =  ^  f°r  exposure 

in  air).  Therefore  rotating  the  corner  cube  interferometer  with  respect  to  the  incoming  UV  beam  sets  the  period.  With 
this  arrangement  the  two  beams  are  always  symmetrically  incident  on  the  wafer  for  any  pitch.  For  immersion 
lithography  the  entire  Fresnel  corner  cube  mirror  interferometer  needs  to  be  immersed  in  the  liquid.  To  achieve  this  a 
cell  is  made  to  completely  enclose  a  Fresnel  corner  cube.  Advantages  of  this  arrangement  are  rapid  changes  of  pitch 
without  requiring  any  realignment  other  than  rotation  of  the  corner  cube,  and  direct  measurements  of  the  refractive  index 
by  exposing  a  wafer  half  immersed  in  liquid  and  half  in  the  air  space  above  the  liquid.  The  ratio  of  the  periods  for  these 
two  gratings  is  a  good  measure  of  the  liquid  refractive  index.  A  disadvantage  is  the  relatively  long  liquid  path  length  - 
1  cm,  which  precludes  the  use  of  highly  absorbing  liquids.  Fortunately,  as  is  discussed  below,  both  of  the  liquids 
explored  in  this  work  had  sufficiently  low  absorption  at  213  nm  that  this  was  not  an  issue. 


A  window  on  the  cell  allows  for  the  external  exposure  source  to  be  coupled  into  the  cell.  A  rendering  and  an  actual 
image  of  the  immersion  cell  used  in  this  work  are  shown  in  Figure  2  with  a  covering  lid  removed  for  clarity. 


0=45°  '  0=60° 

Figure  1  -  Fresnel  comer  cube  mirror  interferometer  diagram. 


Figure  2  -  Rendering  and  photo  of  immersion  cell. 


The  only  other  optical  components  in  the  exposure  setup  are  a  pair  of  213  nm  45°  high  power  dielectric  mirrors,  used  to 
steer  the  beam  around  the  optical  table.  Figure  3  shows  a  diagram  of  the  exposure  setup. 


21  3nm 
5(0  NdYAG 


Immersion 
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Figure  3  -  Diagram  of  exposure  setup. 


3.  EXPERIMENTAL  RESULTS 


3.1  Immersion  medium  index  and  absorption  measurements 

Measurements  were  made  on  two  immersion  liquids,  deionized  (DI)  water  and  Krytox®  a  perfluoropolyether  (PFPE)  oil, 
to  determine  the  absorption  coefficient  and  index  of  refraction  at  the  213-nm  exposure  wavelength.  Figure  4  shows  the 
absorption  measurements  for  both  DI  water  and  Krytox®.  The  absorption  coefficient  for  DI  water  is  k  =  2.28T0"2  cm"1 
(base  e)  at  213  nm;  the  absorption  coefficient  for  Krytox®  GPL100  is  k  =  5.45T0"2  cm'1  (base  e)  at  213  nm.  For  both 
liquids  the  1/e  lengths  are  much  greater  than  the  immersion  distance  for  the  imaging  experiments. 

In  order  to  measure  the  index  of  refraction,  it  is  necessary  to  measure  the  pitches  of  the  gratings  exposed  by  both  the 
liquid  immersed  part  and  the  air  exposed  part  on  the  same  resist  sample.  The  pitch  was  measured  by  using  the  %  inch 
diameter  213-nm  beam  at  low  power  on  a  developed  resist  sample  mounted  on  a  stepper  rotation  stage.  The  pitch  is 
simply  obtained  by  measuring  the  angle  <9  between  the  0-order  reflected  beam  and  the  lst-order  diffracted  beam.  Because 
the  exposure  angle  determined  by  the  Fresnel  corner  cube  interferometer  was  identical  for  both  gratings,  the  difference 
between  the  two  pitches  is  due  to  the  ratio  of  their  respective  refractive  indices.  This  measurement  was  repeated  at  a 
number  of  exposure  pitches;  the  number  reported  here  is  the  average  of  about  5  measurements  for  each  liquid.  The 
measured  index  of  refraction  for  DI  water  is  n  =  1.42,  for  Krytox  it  is  n  =  1.29. 


Figure  4  -  Measured  absorption  curves  for  DI  H20  &  Krytox® 


3.2  Exposures  in  Air 

This  work  used  two  193-nm  resists,  both  represent  commercially  available  resist  for  the  year  2000.  Each  was  provided 
by  a  major  resist  manufacturer.  The  first  sets  of  resist  samples  were  completely  exposed  in  air.  It  was  necessary  to 
characterize  the  resist  in  air  to  confirm  the  BARC  and  resist  coats  were  good,  the  development  program  was  adequate, 
and  to  validate  that  there  were  no  contamination  issues.  Once  the  193-nm  resist  processing  issues  were  worked,  a 
baseline  was  established. 

The  next  few  sections  will  present  the  SEM  images  of  the  baseline  process,  for  three  sets  of  exposure  for  the  two  193-nm 
resists.  In  all  cases  resist  A  is  100-nm  thick  on  45-nm  thick  DUV32-6  BARC,  and  resist  B  is  150-nm  thick  on  45-nm 
thick  DUV32-6  BARC. 


3.2.1  Exposures  at  NA  =  0.50 

The  first  set  of  exposures  in  air  were  done  at  a  NA  =  0.50,  which  is  equivalent  to  an  interference  angle  of  30°.  Most 
photolithography  steppers  used  today  in  industry  are  capable  of  exposing  images  at  NA  =  0.50.  The  pitch  made  by  the 
corner  cube  interferometer  at  30°  in  air  is  213  nm.  At  this  particular  NA  the  feature  pitch  is  equal  to  the  exposing 
wavelength  (A  =  s 1  =  213  nm). 


At  a  NA  =  0.50  excellent  lines  were  achieved  as  seen  in  Figure  5,  with  a  measured  pitch  of  222  nm.  The  dense  lines  pitch 
of  222  nm  is  slightly  larger  than  the  target  of  213nm  (28.7°  instead  of  30°).  It  is  of  no  surprise  that  excellent  lines  were 
patterned,  as  this  represents  the  technology  node  for  which  these  193-nm  resist  were  designed. 


Figure  5  -  222  nm  pitch  with  -71  nm  lines  in  resist  A  (n  =  1  (air),  NA  =  0.50). 


As  expected  at  a  NA  =  0.50  excellent  lines  were  achieved  in  resist  B  as  seen  in  Figure  6,  with  a  measured  pitch  of  213 
nm.  There  are  some  standing  wave  patterns  on  the  resist  sidewalls  however.  The  thinner  resist  A  has  no  visible  standing 
wave,  most  likely  due  to  the  resist  thickness  being  to  small  for  more  than  one  standing  wave  to  exist 
((/  =  100ww)<  (2  •  -j  =  106. 5wh)).  Resist  B  is  thick  enough  to  have  two  standing  wave  half-periods 

((f  =  150 nm)  <  (3  •  |  =  159.75ww)) . 


Figure  6-213  nm  pitch  with  -102  nm  lines  in  resist  B  (n  =  1  (air),  NA  =  0.50). 


3.2.2  Exposures  at  NA  =  0.71 

The  second  set  of  exposures  in  air  were  done  at  a  NA  =  0.71,  which  is  equivalent  to  an  interference  angle  of  45°.  The 
pitch  made  by  the  corner  cube  interferometer  at  45°  in  air  is  150  nm. 

As  the  pitched  is  reduced  to  151  nm  with  a  NA  =  0.71  it  was  still  possible  to  get  good  lines  in  resist  A.  The  measured 
pitch  in  Figure  7  is  151  nm.  The  sidewall  profiles  of  the  lines  are  more  rounded  than  the  lines  patterned  at  NA  =  0.50. 


Figure  7-151  nm  pitch  with  ~64  nm  lines  in  resist  A  (n  =  1  (air),  NA  =  0.71). 


It  was  possible  to  get  good  lines  as  well  in  resist  B  as  seen  Figure  8,  with  a  measured  pitch  of  151  nm.  The  sidewall 
profiles  of  the  lines  have  standing  waves,  and  are  tapered  towards  the  bottom. 


Figure  8-151  nm  pitch  with  ~85  nm  lines  in  resist  B  (n  =  1  (air),  NA  =  0.71). 


3.2.3  Exposures  at  NA  =  0.98 

The  third  and  last  set  of  exposures  in  air  were  done  at  NA  =  0.98,  which  is  equivalent  to  an  interference  angle  of  80°. 
This  NA  represents  the  practical  limit  of  how  small  a  pitch  can  be  made  from  at  213-nm  light  source  in  air.  The  pitch 
made  by  the  corner  cube  interferometer  at  80°  in  air  is  108  nm,  whereas  the  theoretical  limit  is  106.5  nm  in  air  (A/2). 

As  the  pitch  is  reduced  to  108  nm  at  NA  =  0.98,  near  the  theoretical  limit  of  106.5  nm  at  NA  =  1  in  air,  we  were  still  able 
to  get  lines  in  resist  A.  Figure  9  has  a  measured  pitch  of  108  nm.  Many  of  the  lines  started  to  collapse  due  to  the  rounded 
profiles  of  the  resist. 


Figure  9-108  nm  pitch  with  ~60  nm  lines  in  resist  A  (n  =  1  (air),  NA  =  0.98). 


For  resist  B  as  the  pitched  is  reduced  to  108  nm,  near  the  theoretical  limit  of  106.5  nm  in  air,  all  the  lines  collapsed,  see 
Figure  10.  Resist  B  was  too  thick  to  pattern  lines  this  small,  with  a  measured  pitch  of  108  nm.  Due  to  the  large  aspect 
ratio  (height/width)  the  lines  collapsed  from  capillary  forces  exerted  on  the  lines  during  the  drying  of  the  wafer  after 
develop  and  rinse.  As  the  water  dries  unevenly  between  the  lines,  capillary  forces  acting  on  the  walls  of  the  lines  pull 
them  over. 


Figure  10-108  nm  pitch  with  ~66  nm  lines  in  resist  B  (n  =  1  (air),  NA  =  0.98). 


3.3  Exposure  in  DI  H20 

After  the  resist  samples  were  baselined  in  air,  the  immersion  cell  was  filled  with  deionized  water  and  immersion- 
exposed  resist  samples  were  made.  In  order  to  ensure  that  the  DI  water  was  not  contaminated  the  cell  was  drained, 
rinsed,  and  filled  with  fresh  DI  water  between  runs.  There  were  no  obvious  problems  with  contamination  during  these  DI 
water  immersion  experiments. 

The  next  few  sections  will  present  the  SEM  images  of  the  DI  water  immersion  lithography,  for  three  same  three  sets  of 
exposure  as  done  in  air  for  the  two  193-nm  resists.  Again  in  all  cases,  resist  A  is  100-nm  thick  on  45-nm  thick  DUV32-6 
BARC,  and  resist  B  is  150-nm  thick  on  45-nm  thick  DUV32-6  BARC. 


3.3.1  Exposures  at  NA  =  0.50 

The  first  set  of  immersion  exposures  in  DI  water  were  done  at  NA  =  0.50,  which  is  equivalent  to  an  interference  angle  of 
30°.  The  pitch  made  by  using  the  comer  cube  interferometer  at  30°  in  DI  water  is  151  nm  instead  of  the  213nm  for  air. 

The  difference  is  that  the  pitch  is  smaller  by  a  factor  of  nu  o  .  This  DI  water  immersion  exposure  is  also  equivalent  to 

exposing  at  NA  ~  0.71  in  air. 


Resist  A  performed  very  well  when  immersed  in  DI  water.  At  NA  =  0.50  excellent  lines  were  achieved  as  seen  in  Figure 
1 1 ,  with  a  measured  pitch  of  1 5 1  nm.  These  lines  even  look  better  than  the  151  nm  pitched  lines  exposed  in  air.  This 
may  be  due  to  the  higher  index  of  refraction  of  the  DI  water,  which  provides  a  partial  top  antireflection  coating  and 


Figure  11-151  nm  pitch  with  ~75  nm  lines  in  resist  A  (n=  1.42  (H20),  NA  =  0.50). 


Resist  B  also  performed  very  well  when  immersed  in  DI  water.  At  NA  =  0.50  good  lines  were  achieved  with  a  measured 
pitch  of  148  nm.  Similarly,  these  lines  even  look  better  than  the  151  nm  pitched  lines  exposed  in  air.  There  still  are 
standing  waves  on  Resist  B  lines. 


Figure  12  -  148  nm  pitch  with  -56  nm  lines  in  resist  B  (n=  1.42  (H20),  NA  =  0.50). 


3.3.2  Exposures  at  NA  =  0.71 

The  second  set  of  immersion  exposures  in  DI  water  were  done  at  NA  =  0.71,  or  an  interference  angle  of  45°.  The  pitch 
made  by  the  corner  cube  interferometer  at  45°  in  DI  water  is  107  nm.  This  is  equivalent  to  exposing  at  NA  ~  0.995  in  air. 


As  the  pitched  is  reduced  to  108  nm  in  DI  water  immersion  at  NA  =  0.50  good  resist  lines  were  resolved  in  resist  A.  The 
measured  pitch  in  Figure  13  is  108  nm.  Unlike  the  exposure  of  108-nm  pitch  lines  in  air,  none  of  the  lines  collapsed  in 
DI  water  immersion  lithography,  as  seen  in  Figure  13;  again  possibly  because  of  the  top- ARC  effect  of  the  liquid. 


Figure  14  -  -108  nm  pitch  with  -59  nm  lines  in  resist  B  (n=  1.42  (H20),  NA  =  0.71). 


3.3.3  Exposures  at  NA  =  0.98 

The  last  set  of  immersion  exposures  in  DI  water  were  done  at  NA  =  0.98,  which  is  equivalent  to  an  interference  angle  of 
80°  or  an  air-referenced  NA  of  1.41.  The  pitch  made  by  the  corner  cube  interferometer  at  80°  in  DI  water  is  77  nm.  Only 
through  immersion  lithography  can  you  image  this  small  at  this  wavelength.  Unfortunately,  neither  of  the  193-nm  resists 


was  capable  of  imaging  lines  on  a  77-nm  pitch.  The  surface  of  the  resist  was  roughened,  and  some  resemblance  of  a 
grating  pattern  could  be  discerned.  No  pictures  were  recorded. 

3.4  Exposure  in  Krytox® 

The  last  sets  of  immersion  resist  samples  were  done  in  Krytox®.  The  immersion  cell  was  filled  with  filtered  Krytox® 
GPL  100  PFPE  oil.  There  were  no  observed  problems  with  contamination  when  running  these  Krytox®  immersion  runs. 
The  Krytox®  immersion  samples  were  rinsed  in  Vertrel®  XF  after  exposure  to  remove  any  residual  Krytox®  film  on  the 
surface  of  the  resist. 

The  next  few  sections  will  present  the  SEM  images  of  the  Krytox®  immersion  lithography,  for  the  same  three  sets  of 
exposure  as  done  in  air  for  the  two  193-nm  resists.  In  all  cases  resist  A  is  100-nm  thick  on  45-nm  thick  DUV32-6 
BARC,  and  resist  B  is  150-nm  thick  on  45-nm  thick  DUV32-6  BARC. 

3.4.1  Exposures  at  NA  =  0.50 

This  set  of  immersion  exposures  in  Krytox®  were  done  at  NA  =  0.50,  which  is  equivalent  to  an  interference  angle  of  30°. 
The  pitch  made  by  the  corner  cube  interferometer  at  30°  in  Krytox®  is  165  nm.  This  is  equivalent  to  exposing  at  a  NA  ~ 
0.65  in  air. 


Resist  A  also  performed  very  well  when  immersion  in  Krytox®.  At  a  NA  =  0.50  excellent  lines  were  achieved  as  seen  in 


Figure  11,  with  a  measured  pitch  of  161  nm.  There  still  is  a  bit  of  rounding  of  the  resist  pattern.  The  index  of  reflection 


measurement  for  Krytox  on  this  sample  is_l  .289  ±  0.0018 


Figure  16-160  nm  pitch  with  ~65  nm  lines  in  resist  B  (n  =  1.29  (Krytox®^  NA  =  0.50). 


3.4.2  Exposures  at  NA  =  0.71 

Attempts  were  made  to  get  a  set  of  immersion  exposures  in  Krytox®  with  a  NA  =  0.71,  which  is  equivalent  to  an 
interference  angle  of  45°.  Pitch  made  by  the  comer  cube  interferometer  at  45°  in  Krytox®  is  117  nm.  This  is  the 
equivalence  of  exposing  at  NA  ~  0.91  in  air. 


Resist  A  also  performed  well  when  immersed  in  Krytox®,  at  NA  =  0.71.  Lines  were  achieved  as  seen  in  Figure  17,  with  a 
measured  pitch  of  1 1 6  nm.  There  is  resist  scumming  and  footing  between  the  lines  in  the  resist  pattern.  The  index  of 
measurement  for  Krytox®  on  this  sample  is  1.302  ±  0.0014. 


Figure  17-116  nm  pitch  with  ~48  nm  lines  in  resist  A  (n=  1.29  (Krytox®),  NA  =  0.71). 


No  images  were  recorded  of  resist  B  in  Krytox®  with  a  NA  =  0.71. 

3.4.3  Exposures  at  NA  =  0.98 

Attempts  were  made  to  get  a  set  of  immersion  exposures  in  Krytox®  with  a  NA  =  0.98,  which  is  equivalent  to  an 
interference  angle  of  80°.  Pitch  made  by  the  comer  cube  interferometer  at  80°  in  Krytox®  is  83  nm.  This  is  no 
equivalence  of  exposing  in  air,  only  through  immersion  lithography  can  you  image  this  small.  However  as  no  surprise, 
no  good  samples  were  made  in  either  resist  A  or  resist  B. 

No  images  were  recorded  of  resist  A  in  Krytox®  with  a  NA  =  0.98. 

Figure  18  show  an  attempt  to  make  a  grating  in  Krytox®  at  NA  =  0.98.  As  can  be  seen  the  resist  was  too  thick  and 
imaging  resolution  of  the  resist  is  questionable.  The  measured  pitch  is  85  nm.  The  index  of  measurement  for  Krytox®  on 
this  sample  is  1.272  ±  0.0010. 


Figure  18  -  85  nm  pitch  with  ~37  nm  lines  in  resist  B  (n  =  1.29  (Krytox®),  NA  =  0.98). 


4.  SUMMARY 

4.1  Minimum  Pitch  Comparisons  For  Various  Exposure  Media 

Immersion  lithography  reduces  the  minimum  pitch  that  can  be  written  by  a  factor  of  the  index  of  refraction  n  of  the 
immersion  medium.  The  following  table  gives  the  theoretical  minimum  pitch  for  several  exposure  media  at  different 
wavelengths. 


Exposure  Medium 

Air 

DI  Water 

Krytox® 

Exposure  Wavelengtnfltm)^ 

Pitch  [Half-Pitch]  (nm) 

Pitch  [Half-Pitch]  (nm) 

Pitch  [Half-Pitch]  (nm) 

257 

128  T641 

94  [47] 

Unknown 

213 

107  [54] 

76  [38] 

83  [42] 

193 

97  [491 

66  [331 

73  [371 2 

157 

79  [40] 1 

58  [29] 3 

1 . )  Minimum  pitch  in  Argon  gas,  or  vacuum. 

2. )  Interpolated  index  of  refraction  n  for  Krytox®  at  193  nm. 

3. )  M.  Switkes,  M  Rothschild,  “Immersion  lithography  at  157  nm”  5 


Table  2  -  Minimum  theoretical  exposure  pitch  for  various  exposure  mediums. 


4.2  Resist  Results 

Both  of  the  193-nm  resists  investigated  showed  comparable  imaging  properties  and  were  capable  of  imaging  dense  lines 
with  a  151-nm  pitch  in  air.  If  resist  B  had  been  thinner  it  is  believed  that  both  resist  would  have  been  able  to  image  dense 
lines  down  to  108-nm  pitch,  which  would  represent  the  practical  limit  of  photolithography  at  a  213-nm  exposure 
wavelength.  Both  193-nm  resists  appeared  to  be  compatible  with  the  DI  water  and  Krytox®  immersion  media.  The 
immersion  exposure  gave  better  line  profiles  then  the  equivalent  air-exposure  at  the  same  pitch.  Neither  of  the  193-nm 
resists  was  capable  of  imaging  dense  lines  at  the  highest  resolution  attempted  (77-nm  pitch  in  H20;  85-nm  pitch  in 
Krytox®).  This  is  likely  due  a  combination  of  increasing  aspect-ratio  and  diffusion-limited  resolution  of  the  resist  at 
these  small  scales. 

4.3  Conclusions 

Optical  photolithography  is  constantly  evolving  to  image  ever-decreasing  feature  sizes.  Liquid  Immersion  lithography 
(LIL)  is  a  good  candidate  for  a  new  image  enhancing  technique  to  further  push  the  limits  of  optical  lithography.  With 
immersion  lithography  it  is  possible  to  print  smaller  dense  lines  on  a  wafer  than  currently  possible  in  air.  Liquid 
immersion  lithography  can  work  in  conjunction  with  other  image  enhancing  techniques  such  as  off-axis  illumination, 
optical-proximity  correction,  phase-shift  masks,  and  imaging-interferometric  lithography.  Polarization  issues  will 
become  more  pronounced  at  the  higher  angles  possible  in  the  resist  with  immersion  lithography. 

It  has  been  demonstrated  in  this  work  that  it  possible  to  printing  dense  lines  using  liquid  immersion  lithography  at  an  NA 
of  0.71;  this  is  equivalent  to  printing  dense  lines  in  air  with  an  NA  of  0.98.  Dense  lines  with  a  half-pitch  of  54  nm  in  a 
193-nm  resist  were  demonstrated,  with  exposures  both  in  air  (NA  =  0.98)  and  in  liquid  immersion  media  (NA  ~  0.7). 

Two  immersion  liquids  have  been  identified  that  have  more  than  adequate  transparency  at  the  193-nm  lithography  node: 
deionized  water  and  Krytox®  PFPE  oil.  At  the  157-nm  lithography  node  PFPE  oils  still  have  some  transmission, 
although  path  lengths  need  to  be  extended. 

It  is  foreseeable  that  with  resist  improvements,  liquid  immersion  lithography  technology  can  be  extended  to  the  32-nm 
half-pitch  technology  node.  Based  on  this,  optical  lithography  could  be  extended  until  2016,  according  to  the  initial 
timetable  of  the  2002  International  Technology  Roadmap  for  Semiconductor  (ITRS).  This  would  ease  the  timescales  of 
switching  over  to  a  next  generation  lithography  (NGL)  technology. 

4.4  Future  studies 

For  the  continuation  of  this  work,  a  thinner  version  of  the  193-nm  resist  used  should  by  evaluated,  in  attempt  to  make 
dense  lines  with  an  80-nm  pitch.  Improvements  are  continually  being  made  to  resists;  additional  193-nm  resists  should 
be  evaluated  for  immersion  lithography. 


Two  immersion  fluids  have  been  identified  for  193-nm  lithography,  and  just  one  for  157-nm  lithography.  Other  liquids 
that  are  transparent  and  should  be  evaluated  at  these  DUV  wavelengths  are  cryogenic  liquids,  such  as  liquid  Argon. 


Research  is  needed  to  find  other  immersion  fluids,  for  193  nm,  157  nm,  and  any  future  optical  lithography  wavelengths. 
Another  approach  is  to  modify  PFPE  oils  to  give  better  transmittance  or  higher  index  of  reflection  to  be  used  at  1 93  nm 
and  157  nm. 


The  immersion  cell  designed  for  this  work  was  made  to  be  compatible  with  many  types  of  illumination  sources.  Future 
evaluation  can  be  done  at  157  nm  or  other  emerging  DUV  light  sources. 

Immersion  lithography  will  need  to  be  implemented  and  demonstrated  in  a  lens  system  using  masks.  Wafer  handling 
systems  will  need  to  be  demonstrated  for  putting  wafers  into  and  out  of  immersion  lithography  systems.  Evaluations  of 
process  latitude;  imaging  parameters  -  depth  of  focus  (DOF),  resolution,  polarization  effects,  applicability  of  RETs; 
throughput;  and  cost-of-ownership  (COO)  models  for  LIL  need  to  be  developed. 

Liquid  immersion  lithography  has  great  potential  to  extend  optical  lithography  as  a  new  optical  enhancement  technique. 
The  liquid  immersion  lithography  work  done  in  this  work  has  shown  that  it  can  extend  current  193-nm  resist  technology 
to  its  limits.  Much  more  work  is  needed  to  fully  assess  the  manufacturing  viability  of  this  new  resolution  enhancement 
technique. 
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ABSTRACT 

The  limitations  of  a  conventional  optical  lithography  system  are  consequences  of  the  limited  spatial  frequency 
coverage  (~  NA4)  of  the  optical  system.  To  improve  the  resolution  of  printed  patterns,  Imaging  Interferometric  Lithog¬ 
raphy  (IIL)  exploits  interference  phenomena  to  produce  sub-wavelength  structures  on  the  wafer  and  provides  a  simple 
approach  to  attainment  of  the  ultimate  spatial  frequency  coverage  of  2/A,  independent  of  the  optical  system  NA.  In  the 
first  promising  experiments,  different  divisons  of  frequency  space  between  multiple  exposures  were  investigated  empiri¬ 
cally.  The  use  of  multiple  exposures  includes  a  better  coverage  than  available  from  a  single  exposure,  automated  soft¬ 
ware  routines  or  strategies  need  to  be  developed  to  find  the  optimum  setup  that  makes  the  best  compromise  against  all  of 
the  desirable  lithography  specifications.  From  a  comprehensive  grating  analysis,  we  derive  a  set  of  optimal  parameters 
for  exposure  energy  ratio  among  exposures  and  search  a  proper  set  to  improve  the  aerial  image  quality.  Additionally, 
comparing  with  partial  coherent  imaging  schemes,  we  examine  the  strengths  and  weaknesses  of  IIL  through  different 
analyses  and  these  studies  provide  additional  support  for  IIL  as  a  promising  alternative  RET  for  deep  sub-wavelength 
optical  lithography. 

Keywords:  Imaging  Interferometric  Lithography,  gratings,  Fourier  analysis,  MEEF,  OPE 


1.  INTRODUCTION 

Imaging  interferometric  lithography  (IIL)  exploits  interference  phenomena  to  overcome  the  spatial  frequency 
resolution  limitation  (~  NA IX)  imposed  by  a  conventional  optical  lithography  system,  to  improve  the  resolution  of  aerial 
images  and  produce  sub-wavelength  structures,  and  in  the  process  provides  a  simple  approach  towards  attainment  of  the 
ultimate  optical  spatial  frequency  coverage  of  2/A  1-5 .  Since  IIL  involves  the  use  of  multiple  exposures  and  off-axis  illu¬ 
mination,  automated  software  strategies  for  IIL  simulations  should  be  developed  to  find  the  optimum  solution  for  meet¬ 
ing  all  of  the  specifications  needed  for  the  lithography  process  and  to  produce  a  robust,  manufacturable  process.  For  the 
simple  example  in  Ref.  5,  where  alternative  three-exposure  IIL  strategies  containing  the  same  spatial  frequency  coverage 
were  empirically  investigated,  the  final  image  results  were  significantly  different  due  to  different  frequency  parsings 
between  the  on-axis  and  off-axis  exposures,  and  one  was  clearly  more  robust  to  alignment  and  intensity  variations.  As  a 
first  step  towards  avoiding  such  undesirable  solutions,  we  examine  pertinent  equations  in  IIL  theory  to  understand  the 
factors  that  play  major  roles  in  the  aerial  image  enhancement  process.  A  detailed  Fourier  series  analysis  of  the  IIL  on- 
axis  and  off-axis  exposures  in  one-dimensional  gratings  is  presented  in  Section  3  to  fully  understand  the  essentials  of 
constructing  a  fine-resolution  aerial  intensity.  Common  performance  measures  are  evaluated  via  simulations  in  MAT- 
LAB®.  These  measures  will  then  be  used  to  study  the  relative  strengths  and  weaknesses  of  IIL,  in  comparison  with  more 
traditional  partial-coherent  illumination  imaging.  In  Section  4  with  block  simulations,  we  extend  the  grating  analysis  into 
two  dimensions  and  an  optimization  procedure  for  illumination  energy  in  each  exposure  is  described.  Finally,  we  ana¬ 
lyze  the  results  and  conclude  with  future  study  in  Section  5. 
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2.  FORMULATE  IIL  SCHEME  WITH  ENERGY  RATIO  FACTORS 


For  Manhattan  type  of  mask  where  patterns  are  assembled  by  dif¬ 
ferent  sizes  of  rectangles,  IIL  needs  three  different  types  of  exposures:  on- 
axis  (lon ),  off-axis  along  x  (/^  x)  and  off-axis  along  y  (/^  ).  And  the 

intensity  of  the  aerial  image  from  an  IIL  scheme  can  be  expressed  as: 

im.(x>y)  =  a  •  4 Xx>y) + Io//_x(x’y) + 1off_y^y)  <4 

The  first  parameter,  a,  represents  the  energy/intensity  ratio  between  IIL  on- 
axis  and  off- axes  exposures.  Although  the  square-law  intensity  from  an 
asymmetric,  single  off-axis  exposure  covers  both  positive  and  negative  fre¬ 
quencies,  we  require  two  exposures  for  each  type  of  off-axis  exposure  along 
x  andy  to  ensure  image  symmetry  with  respect  to  focus  (e.  g.  to  restore 
telecentricity.).  With  Figure  1  showing  the  Fourier  spatial  frequency  cover¬ 
age  of  each  exposure,  a  parsing  scheme  between  lon  and  /  can  be  ex¬ 
pressed  with  three  parameters:  NAon  ■>  NAoff  are  the  numerical  aperture 
sizes  of  the  optical  system  for  the  IIL  on-axis  and  off-axis  exposures  sepa-  Figure  1 :  One-dimensional  NA  frequency 

rately  and  NA„dc  indicates  the  tilt  angle  in  the  off-axis  illumination.  In  parsings  for  IIL  scheme 

order  to  generalize  the  problem  for  any  illumination  light  source,  we  use  X  as  the  light  wavelength  to  normalize  the 
spatial  frequency  domain  shown  in  Figure  1. 

Formulating  the  IIL  scheme,  the  intensity  from  one  IIL  off-axis  exposure  along  x  is  obtained  by  using  tilted  co¬ 
herent  illumination  with  a  plane  wave  as  e~  j2nfoDcX  where  angle  9 = sin1  (NAdc) = sin1  (foDC  ■  2) .  This  off-axis  illumination 
causes  the  spatial  frequency  components  of  the  electrical  field  after  the  mask  E(f)  to  be  downshifted  to  E(f  +  foDC)  • 

These  components,  after  passing  through  the  lens  pupil  with  size  of  NA0ff  _x  ,  which  is  denoted  as  f  (f+f  )  are  in- 
terfered  with  an  extra  reference  beam,  a  plane  wave  y  .  e-J27rfoDCx  which  is  the  zero-order  component  of  the  mask  trans¬ 
mission.  The  reference  plane  wave  is  also  tilted  by  the  same  angle  6  but  with  an  intensity  ratio  of  y1  with  respect  to  the 
off-axis  illumination  intensity.  For  an  intensity  ratio  y  <  1 ,  this  can  be  produced  by  using  an  intensity  attenuator  in  the 

system  setting  for  IIL  in  Figure  2.  In  summary,  the  image  reconstruction  for  this  off-axis  exposure  can  be  treated  as  be¬ 
ing  similar  to  the  recording  process  of  holography6  and  expressed  as: 

I'offJx)  =  |f  ■' {Kff  Jfx  +  fonc)}  +  r •  e^f 

=  /+(F  ~l{Eoff_x{fx+foDc)} Mr  -l{Zoff_*(f*+fonc)}j+  (2) 

^•(F  ~'{EoffJfx+fonc)}-^X)  +  r{(F  ~'{EoffMx+foJi)-e~i2XfODCX) 

We  denote  the  information  in  the  Fourier  domain  as  /’  ^  x(fx)  =  F  j/'o//.  ^(x)}  and  F  _1  {•}  is  the  inverse  Fourier  trans¬ 
form.  Also  Eoff(fx )  is  defined  as  the  high  spatial  frequency  portion  of  the  mask’s  transmittance  function  T(fx)  that  passes 
through  the  pupil  filter  in  the  optical  system  as  Hoff{fx )  with  size  of  NAoff  •  It  can  be  expressed  as 

Eoff(fx)  =  T(fx)  •  Hoff(fx  -  foDC)  (3) 

If  T(fx)  is  calculated  from  a  binary  image  mask  and  Hoff{fx )  is  a  simple  low-pass  cutoff  pupil-place  filter,  we  perform 

off-axis  exposures  twice  at  6  and  —6  along  the  x-axis  for  telecentricity  reasons  and  then  transfer  the  result  to  the  Fou¬ 
rier  domain.  This  symmetry  becomes  critical  as  the  image  through  focus  is  investigated.  Exactly  at  focus  there  is  no  dif¬ 
ference  between  twice  the  intensity  at  one  offset  angle  or  using  both  plus  and  minus  offset  angles.  The  final  equation  for 
one  (along  x)  off-axis  exposure  is  : 

L.ffjfx)  =  F  with  °}  +  F  {l"0ff_x(x)  with  -0} 

=  2-{r2-S(fj}  +  2-[EoffJfx+foDC)®E\fff-fx+foDC)}  +  2-r-lEoffJf)  +  E\ff_x(-fx)} 


(4) 


In  our  previous  paper5,  the  second  term  in  Eq.  (4)  was  referred  to  as  the  quadratic  term  and  the  third  term  was  called  the 
linear  term  since  it  is  proportional  to  the  transmittance  function  of  the  mask. 

Now  the  ILL  scheme  in  Eq.  (1)  with  all  three  types  of  exposures  (on-axis,  off-axis  along  x  and  off-axis  along  y) 
can  be  formulated  as 

F  {inL&y)}  = 

(4  (++)}  (5) 

+2  +2 -{Eojr_x(fx +foDC,fy)®eoffJ-fx  +foDC,fy)}+2-rx  ■{Eqff_x{fx,fy)+eig_x(-fx,fy)} 

+2-{r/  +2  \Eoffy{fx,fy  +foDC)®roff_y{fx,-fy  +/4  +2  ■ry-{Eoff_yifx,fy)+roffJ-fx,fy)} 

where  three  energy  ratio  parameters  need  to  be  optimized:  a,yx  and  yy . 


2.  ONE  DIMENSIONAL  GRATING  ANALYSIS  OF  IIL 


Considering  the  transmittance  function  from  an  idealized 
grating  mask  along  x-axis  with  period  p  as: 


i 

1/2 

0 


0  <|  x  |<  s/2 
|  x  |=  s/2 
s/2  <|  x  |<  p/2 


(6) 


where  grating  space  is  centered  at  the  origin  and  space  width  is  de¬ 
noted  as  S  An  this  ID  case  IIL  only  needs  two  different  types  of 
exposure:  on-axis  (Ion )  and  off-axis  along  x  (/  t  ),  since  there  is  no 

image  information  in  the  off-axis  exposure  along  y  (  /  )  which 


only  adds  an  intensity  baseline  or  a  DC-term  to  the  aerial  image.  Figure  2:  One-dimensional  IIL  setup 

To  generalize  the  Fourier  series  in  Eq.  (6),  we  set  the  fun¬ 
damental  frequency  f  =\/p  =  \  which  is  equivalent  to  normalizing  the  mask  period  equal  to  1 ,  we  have  the  real-value 


coefficients  Ck  in 

+oo  +oo 

T(x)  =  £  ck  ■  eilnkx  =  £  (s  •  sinc(ks))  ■  ej2,lkx  (7) 

k  =  -oo  k  =  -oo 

Then  Eoff  x(x)  from  Eq.  (3)  that  through  a  simple  cutoff  pupil  filter  can  be  expressed  as 

V.W  =  F  1  [EoffMd\  =  tcr  ei2Kkx  =  Re{^//_,  W}  +  J  (*)} 

k\ 

where  kx  and  k2  are  specified  by  the  limit  of  the  pupil  H  A  f ) .  By  transforming  Eq.  (5)  back  to  the  spatial  domain,  the 
IIL  intensity  result  for  ID  grating  is  shown  as 

InL^annA')  =  « •  4„  W  +  2  •  /  +  2  •  (Ref^^x)}2  +  Im{£//r(x)|2}  +  4-y  Re{4#;c(x)}  (9) 

Since  the  advantage  of  IIL  is  in  its  capability  to  print  much  smaller  and  dense  patterns  than  the  conventional  lithography 
by  retaining  more  of  the  high  frequency  image  components  from  the  mask  in  Eo ^  x  (x)  that  are  missed  by  Ion  (x)  ,  the 

limiting-case  scenario  in  IIL  is  when  only  the  first  order  frequency  component  is  captured,  k,  =  k2  =1  in  Eq.  (8),  result¬ 
ing  in 

^  x  sinOr,?)  /  x 

Eoff  X(x)  = - (cos(2^x)  +  7sm(2^x)j 

n 

Then  in  this  case,  IIL  ID  grating  intensity  with  a  coherent  exposure  on  I (m  (x)  is 


(10) 


IIL  _  grating 


(x)  —  Id’S  +  2 •  y  +  2  • 


sin2(;rs) 


n 


+  i4 


sin(^)| 


n 


7 


•  cos(2;rx) 


[while  kj  =k2  =1] 


(11) 


Clearly,  the  first  term  is  the  DC  term  and  we  can  obtain  the  equa¬ 
tion  for  contrast/visibility  as 


V  (a,/)  = 


4  ■  y 


sin(;rs) 


a  ■  s2  +  2  ■  /2  +  2  ^  sinv;’r's) 


(12) 


[while  k[  =k2  =1] 


By  setting  5  =  0.5  for  dense  grating  as  duty  cycle  =  50%,  we  are 
able  to  generate  a  visibility  contour  plot  shown  in  Figure  3.  The 
plot  indicates  an  important  conclusion:  for  lithography  purpose,  we 
would  like  to  adjust  the  energy  ratio  parameters  (a9y)  to  gain  the 

best  image  contrast  as  possible.  Furthermore,  while  searching  the 
optimum  solutions  of  energy  ratio  for  a  2D  mask,  a  proper  probing 
range  for  a  is  (0  ...  4)  and  y  is  (0  ...  1.5). 


Visibility  contour  for  fine  gratings  with  duty  cycle  =  50% 


0.4  0.6  0.8  1  1.2  1.4  1.6 

y  parameter  for  the  off-axis  exposure 


Figure  3:  Visibility  contour  plot  for  ID  grating 


Comparing  the  aerial  image  analysis  of  one-dimensional 
gratings  from  both  IIL  and  partial-coherent  illumination  schemes 
can  help  us  to  understand  the  essential  differences  between  these 
two  imaging  approaches.  For  simulation  convenience  and  as  a  set¬ 
ting  for  the  rest  of  this  paper,  we  set  NAon  =  0.8  and  NA()jr  =  0.8  for  IIL  optical  system  and  set  the  illumination  off-axis 


angle  NAqdc  =  0.84  in  order  to  have  higher  frequency  coverage  of  the  mask.  The  illumination  light  source  has  wave¬ 


length  X  in  general.  As  to  the  partial  coherent  illumination  scheme,  NA  =  0.8  and  the  partial  coherence  factor  G  is  set 
to  0.6.  [is  this  a  scalar  calculation  or  do  you  treat  polarization?]  Furthermore,  to  quickly  review  printed  pattern  re¬ 
sults  directly  from  aerial  images,  a  constant  threshold  model8  (CTM)  for  photoresist  with  a  threshold  value  Tho  was 

applied  for  all  simulations.  The  Tho  values  for  critical  dimension  (CD)  measurements  in  Figure  4  are  different  for  the 


partial  coherent  (  Tho  =  0.3)  and  IIL  (  Tho  =  0-6  or  0.7),  since  IIL  has  additional  intensity  baseline  2 y2  in  Eq.  (9)  that 
requires  a  high  contrast  resist.  For  analysis  on  frequency  parsing  strategies,  we  do  IIL  simulations  with  two  different 
coverage  configurations:  (i)  allowing  overlapped  between  on- axis  and  off-axis  coverage  without  any  pupil  filtering,  (ii) 
no  overlapping  coverage  in  between  while  applying  moon-shaped  pupil  filters  (circular  filters  with  a  second  overlapped 
circle  missing,  see  Fig.  4)  during  the  off-axis  exposures. 

The  first  study  of  interest  is  the  difference  between  printed  patterns  and  the  grating  masks  as  optical  proximity 
error  (OPE)  measurement  at  different  duty  cycles.  The  OPE  plots  in  Figure  4  show  that  IIL  in  both  configurations  obvi¬ 
ously  can  print  more  smaller  sub-wavelength  features  than  the  conventional  partial  coherent  scheme.  To  study  the  sensi¬ 
tivity  of  the  printed  critical  CD  on  the  wafer  as  we  gradually  change  the  reticle  CD  in  gratings,  we  can  use  the  MEEF9 
concept  and  measure  the  results  when  the  gratings’  dark  lines  approach  the  sub-wavelength  limit.  Shown  in  Figure  4,  the 
MEEF  values  from  the  partial  coherent  illumination  are  gradually  diverging  from  ideal  value  one;  but  IIL  with  help  from 
the  off-axis  exposures  can  have  the  deviation  happen  later  at  a  smaller  sub-wavelength  feature,  which  means  more  fine 
features  can  be  printed  with  IIL  technique. 

In  Figure  4(c),  IIL  with  moon-shaped  pupil  filters  has  the  cutoff  frequency  between  on-axis  and  off-axis  expo¬ 
sures  at  0.8  /  A  .  This  abrupt  frequency  transition  cause  MEEF  lines  to  go  up  and  down  irregularly  around  one,  not  as 
smooth  as  MEEF  lines  for  the  partial  coherent  scheme  in  Figure  4(a).  To  reduce  the  abrupt  transition  between  on-axis 
and  off-axis  exposure  coverage,  a  certain  degree  of  overlapping  between  on-axis  and  off-axis  coverages  should  be  con¬ 
sidered  as  the  case  in  Figure  4(b)  without  pupil  filters,  which  MEEF  lines  shows  less  roughness  than  in  Figure  4(c).  Fur¬ 
thermore  by  using  apodized  pupil  filter  designs  such  as  Gaussian  pupil  filters,  a  smoother  transition  can  be  expected  in 
the  MEEF  and  OPE  plots. 

As  to  the  depth  of  focus  (DOF)  analysis,  since  the  reference  beam  in  IIL  coming  from  a  large  incident  angle 
upon  the  wafer,  IIL  theoretically  have  less  DOF  than  the  partial  coherent  illumination  for  larger  patterns,  [not  correct,  if 
only  the  off  axis  beam  comes  in  at  high  angle  and  the  information  beams  are  at  a  lower  angle,  the  depth-of-field 
corresponds  to  the  information  beams,  the  best  situation  is  symmetrized  with  the  reference  beam  from  one  side 


and  the  information  beams  centered  at  the  negative  angle,  you  haven’t  shown,  or  probably  done,  any  calc,  on 
DOF!]  But  overall  on  sub-wavelength  gratings,  IIL  has  better  exposure  latitude  performance  than  the  partial  coherent 
illumination  method  due  to  aerial  image  enhancement  (higher  visibility)  in  IIL.  However  in  optical  lithography,  the  defi¬ 
ciency  of  DOF  is  becoming  less  critical,  since  it  can  easily  overcome  by  reducing  the  thickness  of  the  photoresist  layer 
and  also  can  be  overcome  by  advance  pupil  filter  designs10.  Therefore,  this  IIL  drawback  is  more  than  compensated  by 
the  benefits  offered  by  IIL  for  sub-wavelength  masks. 
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(a)  Partial  coherent  illumination  scheme 
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(b)  IIL  scheme  without  pupil  filters 
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Figure  4:  OPE  and  MEEF  measurements  with  different  illumination  schemes 

3.  SEARCHING  OPTIMUM  ENERGY  RATIOS  BY  2D  GRATING  ANALYSIS 


To  find  suitable  energy  ratios  y  and  Y  in  Eq.  (5)  for  individual  /  and  /  ,  ID  grating  analysis  is  not  suf- 

ficient  for  processing  2D  masks,  because  yx  and  y  are  correlated  to  each  other  and  not  always  the  same  due  to  the  ori¬ 
entation  of  patterns.  Furthermore,  running  the  whole  mask  simulations  according  to  different  ^  ^  ?  energy  ratios 

would  be  very  time-consuming  and  would  rapidly  become  impractical  as  the  mask  becomes  more  complicated.  There¬ 
fore,  we  need  a  routine  to  cut  down  the  searching  time  for  optimum  energy  ratios  first  by  doing  simpler  simulations  on 
simple  2D  structures  that  are  related  to  the  mask. 

Giving  a  Manhattan  type  of  2D 
mask  in  Figure  5(a),  the  mask  is  assembled 
by  different  size  of  rectangles.  For  this  rea¬ 
son,  doing  analysis  on  each  different  2D 
block  (rectangle)  extracting  from  the  mask, 
as  shown  in  Figure  5(b),  can  help  us  to  first 
find  some  suitable  sets  of  [a,y  ,y  )  from 

numerous  possible  solutions  and  later  indi¬ 
vidually  apply  and  analyze  their  perform¬ 
ances  on  the  actual  2D  mask.  In  order  to 
consider  the  difference  between  dense  and 
isolated  features,  we  can  set  different  Period  Figure  5:  2D  grating  analysis  from  a  2D  Mask 

X  and  Y  in  Figure  5(b)  for  different  duty 
cycles  along  X  and  Y  axes.  The  performance 

measure  on  a  printed  block  pattern  in  Figure  5(c)  is  done  by  an  error  measure  function  of  yx,  yy )  •  It  includes  three 

common  analyses  used  in  lithography:  (i)  overlay  area  between  the  printed  and  desired  pattern,  (ii)  line-end  shortening 
(LES)  effect,  and  (iii)  comer  rounding  (CR)  effect.  Since  a  resist  CTM  with  a  parameter  Th0  is  used,  by  using  different 
Th0  value  for  CTM  can  be  related  to  different  exposure  energy  we  actually  apply.  Hence,  the  optimization  procedure 
should  output  four  numerical  numbers  (a,y  ,y  ,77z0)  for  actual  exposure  energy.  After  obtaining  block  simulation  results, 

we  can  quickly  evaluate  the  performance  based  on  one  overlay  area  measurement,  two  LES  along  x  and  y,  and  one  CR 
measurement.  Then  depending  on  results  from  the  error  measure  function,  we  can  mle  out  bad  energy  settings  that  cause 
fatal  errors  and  sort  the  suitable  results  by  performance.  At  the  final  stage  we  pick  several  best-performing  settings  from 
block  simulations  and  mn  them  again  with  the  whole  2D  mask.  By  evaluating  the  error  measure  based  on  the  de- 


sired/target  2D  patterns,  we  can  pick  the  optimum  energy  ratio  set  for  IIL  and  the  Th0  value  for  exposure  energy.  [I  like 
this  idea,  you  will  need  to  flesh  it  out  more  in  another  place.] 

A  simulation  example  is  given  by  setting  CDmin  =  0.4A,  with  the  mask  in  Figure  5(a),  where  we  have  set  IIL  op¬ 
tical  system  with  NAon=0.8,  NAoff_x=NAoff_y=0.8  and  NAoDC  x=  NAoDC_y=  0.84  for  simulations.  After  the  optimization 
procedure,  we  obtained  optimum  values  for  energy  ratios  as:  (a  =  1.0,y  =  0.8,  y  =  0.8, Th0  =  0.7)  without  pupil  filters  and 

(a  =  1.0,  y  =  0.5,  y  =  0.5,  Th0  =  0.7)  with  moon-shaped  filters  and  the  results  of  comparing  IIL  with  partial  coherent  illumi¬ 
nation  are  shown  in  Figure  6.  Clearly  the  partial  coherent  illumination  scheme  is  not  able  to  resolve  a  sub-wavelength 
pattern  but  both  IIL  schemes  can.  Comparing  two  IIL  schemes,  even  though  Figure  6(c)  IIL  result  with  the  hard  cutoff 
pupil  filters  is  better  than  Figure  6(b),  the  aerial  intensity  profile  on  both  X  and  Y  slices  contain  more  ripples  that  could 
result  in  small  holes  in  a  large  printed  pattern  due  to  the  energy  variations.  To  increase  the  energy  latitude  or  contrast  in 
IIL,  depending  on  the  actual  mask  in  use,  careful  frequency  parsing  strategy  should  be  made,  [again  need  more  details 
on  model,  vector?,  switch  polarization  x  and  y?....] 


4.  CONCLUSIONS  AND  FUTURE  STUDY 

The  ID  and  2D  grating  analysis  based  on  the  IIL  model  with  energy  ratio  parameters  have  been  very  useful  to 
understand  the  behavior  of  IIL  on-axis  and  off-axis  exposures  and  shows  the  potential  advantage  that  IIL  can  offer  for 
nanotechnology.  Besides  illustrating  that  IIL  can  be  treated  as  a  useful  resolution  enhancement  technique  (RET)  for  sub¬ 
wavelength  patterns,  at  this  point  we  have  learned  how  to  keep  a  safe  threshold  margin  with  new  parameter  restrictions 
and  observed  the  abrupt  changes  when  one  spatial  frequency  component  moves  from  one  exposure  coverage  to  the  other. 
To  smooth  those  jumps  in  MEEF  and  OPE  plots,  we  need  to  take  further  steps  in  the  frequency  parsing  technique  by 
finding  the  optimum  pupil  filters  for  IIL  on-axis  and  off-axis  exposures. 

Beyond  ID  CD  measure  analysis,  the  optimization  strategy  for  IIL  parameters  in  2D  mask  case  has  included 
quantification  of  image  quality  based  on  overlay,  LES  and  CR.  By  doing  2D  block  simulations  first  from  the  mask,  the 
strategy  showed  reliable  results  with  a  given  example.  Currently  we  are  focusing  on  more  accurate  IIL  rigorous  two- 
dimensional  aerial  image  modeling  and  possible  hybrid  schemes  such  as  advance  pupil  filter  designs  and  the  IIL-OPC 
technique. 
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(a)  Partial  coherent  illumination  scheme 
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(b)  IIL  scheme  without  pupil  filters 


(c)  IIL  scheme  with  moon-shape  pupil  filters 


Figure  6:  Simulation  results  under  different  illumination  schemes 
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1  Introduction 

The  relentless  demands  of  Moore’s  law  on  lithography  are 
well  known.  The  current  industry  roadmap1  calls  for  initial 
volume  manufacturing  of  the  45-nm  half-pitch  node  in  the 
2005  to  2007  time  frame,  which  demands  a  decision  on  the 
manufacturing  technology  in  the  near  future.  It  now  ap¬ 
pears  that  next-generation  lithography  techniques  will  be 
hard  pressed  to  meet  this  time  line,  and  attention  has  turned 
to  extensions  of  optical  lithography.  Immersion 
lithography,2-10  particularly  using  water  as  an  immersion 
fluid  at  193  nm,  has  attracted  quite  a  bit  of  recent  attention 
as  a  promising  optical  lithography  extension. 

The  minimum  half-pitch  available  to  a  lithography  tool 
is  given  by  the  well-known  equation, 


A half- min  4„  sin  <9  ’  W 

where  X  is  the  vacuum  exposure  wavelength,  and  n  is  the 
refractive  index  of  the  immersion  medium.  The  two  beams 
propagate  in  the  immersion  medium  at  angles  of  ±0  with 
respect  to  the  wafer  surface  normal  and  interfere  to  create  a 
sinusoidal  intensity  pattern.  For  193-nm  exposures  with  an 
immersion  medium  of  water  (?z  =1.44)  and  a  maximum 
sin  6  of  0.9,  this  minimum  half-pitch  is  37  nm,  suggesting 
that  a  system  with  these  capabilities  should  be  capable  of 
printing  45-nm  half-pitch  patterns. 

Of  course,  imaging  is  more  complex  than  simply  evalu¬ 
ating  the  minimum  feature  size.  Much  effort  has  gone  into 
various  resolution  enhancement  techniques  (for  a  recent  re¬ 


view  of  resolution  enhancement  techniques  see  Ref.  11) 
such  as  optical  proximity  correction  (OPC),  off-axis  illumi¬ 
nation  (OAI),  and  phase-shift  masks  (PSMs).  A  purpose  of 
this  paper  is  to  point  out  that  at  the  very  large  propagation 
angles  in  the  photoresist  necessary  to  image  patterns  at  the 
45-nm  node  using  a  193-nm  source,  the  optical  response 
becomes  very  polarization  sensitive.  This  will  require  that 
the  high  spatial  frequencies  in  the  x  and  y  directions  be 
printed  with  a  different  polarization  to  retain  the  necessary 
contrast.  This,  in  turn,  will  require  some  form  of  multiple 
exposure  lithography  to  enable  the  necessary  polarization 
control.  Two  imaging  schemes  that  lend  themselves  to  this 
constraint  are  imaging  interferometric  lithography12,13 
(IIL),  which  uses  OAI  at  the  limits  of  the  pupil  along  with 
pupil-plane  filters  to  control  the  image  fidelity,  and  dipole 
illumination,  where  different  masks  are  used  for  the  high- 
spatial-frequency  content  in  the  x  and  y  directions  of  a 
Manhattan  geometry  pattern.  Both  of  these  techniques  offer 
the  required  independent  control  of  the  polarization  for  the 
v  and  y  direction  exposures. 

Section  2  discusses  the  polarization  effects  in  immersion 
lithography.  Section  3  presents  modeling  results  for  partial 
coherent  illumination,  IIL,  and  dipole  illumination  for  a 
simple  nested  L-bar  test  structure.  Finally,  conclusions  and 
future  research  directions  are  outlined  in  Sec.  4. 

2  Polarization  Effects  in  Immersion  Lithography 

Just  as  evaluating  the  highest  spatial  frequency  available 
with  an  exposure  tool  is  most  straightforward  in  the  case  of 
a  periodic  grating  pattern,  polarization  effects  are  most 
readily  understood  in  terms  of  a  simple  interferometric  li- 
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thography  (IL)  exposure  with  two  beams  symmetrically  in¬ 
cident  on  the  wafer.14  It  is  straightforward  to  calculate  the 
interference  intensity  pattern  for  two-beam  IL  with  equal 
intensity  beams  with  for  TE  and  TM  polarizations.  The 
results  are 


7teM  —  2/0 

7tmM  =  2/0 


1  +  cos 


4  TTTIVq sjst  sin  ^resist 


l+cos(2  0resist)cos 


4  sin  $resist 


(2) 


Here,  /0  is  the  intensity  of  each  beam  and  the  subscripts 
emphasize  that  the  refractive  index  and  internal  angles 
within  the  resist  are  used  in  evaluating  these  expressions. 
The  TE  result  retains  full  contrast  over  the  entire  accessible 
half-pitch  range  independent  of  0resist  •  The  extra  factor  of 
cos(20resist)  that  appears  in  the  TM  expression  has  dramatic 
consequences  for  lithography.  For  0resist=45  deg  [numeri¬ 
cal  aperture  (NA)  =  1.21  at  a  photoresist  refractive  index  of 
1.71],  the  contrast  of  the  interference  pattern  goes  to  zero, 
and  at  higher  angles  there  is  a  180-deg  phase  shift  so  that 
lines  and  spaces  are  interchanged.  This  reduction  in  con¬ 
trast  and  phase  reversal  arises  because  both  the  v  directed 
and  the  z  directed  fields  interfere  independently  for  the  TM 
polarization.  There  is  a  90-deg  phase  shift  between  the  two 
contributions,  with  the  v  directed  fields  dominating  at  small 
angles  ( 0resist<  45  deg)  and  the  z  directed  fields  at  high 
angles  ( /9resist>  45  deg) . 

Even  at  the  high  NAs  of  current  nonimmersion  lithogra¬ 
phy  tools,  polarization  effects  have  yet  to  be  of  great  sig¬ 
nificance  for  imaging.  Unpolarized  or  circularly  polarized 
illumination,  which  averages  the  response  of  the  two  pos¬ 
sible  polarizations,  has  been  adequate  for  most  imaging. 
This  is  because  the  immersion  fluid  today  is  air  (n=  1), 
while  the  index  of  a  typical  resist  is  considerably  higher 
(^resist~  1-71).  As  a  consequence  of  Snell’s  law,  the  propa¬ 
gation  angles  in  the  resist  are  significantly  smaller  than  in 
the  air.  For  a  period  corresponding  to  the  90-nm  half-pitch 
node,  sin  0—0.54  the  propagation  angle  in  air  is  32  deg, 
while  Assist  is  only  18  deg  and  the  corresponding  contrast 
for  TM  polarization  is  84%.  However,  for  the  45-nm  half¬ 
pitch  node,  the  propagation  angle  of  48  deg  in  the  immer¬ 
sion  fluid  corresponds  to  0resist~  59  deg,  where  the  contrast 
for  TM  polarization  is  reduced  to  21%. 

This  is  illustrated  in  Fig.  1(a),  which  shows  the  intensity 
distribution  versus  position  in  the  resist  for  periods  corre¬ 
sponding  to  the  90- ,  65-  and  45-nm  half-pitch  nodes.  The 
top  trace  is  for  TE  polarization,  which  retains  its  contrast 
independent  of  the  pitch.  The  TM  curves  for  the  90- ,  65-, 
and  45-nm  half-pitch  nodes  at  193  nm  are  also  shown.  Fig¬ 
ure  1(b)  shows  the  pitch  dependence  of  the  TM  visibility, 


fringe  visibility  (7max  7min)/(/max+/min).  (3) 


The  dashed  vertical  lines  correspond  to  the  center  spatial 
frequencies  for  the  three  upcoming  half-pitch  nodes.  The 
shaded  region  (half-pitch  <54  nm)  is  accessible  only  with 
immersion.  Plotted  against  half-pitch,  the  curve  is  univer¬ 
sal,  dependent  only  on  the  optical  wavelength  and  indepen¬ 
dent  of  the  immersion  fluid  optical  properties.  The  cusp  at  a 
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Fig.  1  (a)  Intensity  profiles  in  resist  for  65-nm  half-pitch  TE  polar¬ 
ization  (top)  with  full  contrast,  and  for  90-,  65-,  and  45-nm  half-pitch 
patterns  with  TM  polarization.  The  decrease  of  contrast  is  severe, 
(b)  Visibility  of  the  TM  polarization  pattern  in  the  resist  as  a  function 
of  half-pitch.  The  shaded  region  is  accessible  only  with  immersion. 


half-pitch  of  40  nm  corresponds  to  0resist=45  deg  and  to  the 
phase  reversal  region  for  smaller  half-pitches.  For  the  cen¬ 
ter  frequency  corresponding  to  the  90-nm  node,  the  TM 
visibility  is  84%,  dropping  to  62%  for  the  65-nm  node  (NA 
=0.74)  and  21%  for  the  45-nm  node  (NA=1.07).  The  low 
TM  contrast  at  the  45-nm  node  and  the  proximity  to  the 
phase  reversal  region  implies  that  any  lithography  tech¬ 
nique  will  be  required  to  use  TE  polarization  exclusively 
for  the  highest  spatial  frequencies  in  the  image.  The  devel¬ 
oped  photoresist  line-edge  roughness  is  strongly  dependent 
on  the  visibility15  so  that  this  loss  of  contrast  impacts  the 
quality  of  the  image  in  terms  of  both  resolution  and  line- 
edge  quality. 

The  immersion  fluid  also  impacts  the  reflection  at  the 
immersion  medium-photoresist  interface.  Figure  2  shows 
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Fig.  2  Reflectivity  at  the  fluid:photoresist  interface  for  both  TE  and 
TM  polarizations  as  a  function  of  half-pitch.  The  dashed  horizontal 
line  corresponds  to  a  10%  reflectivity.  Water  immersion  both  en¬ 
ables  a  smaller  half-pitch  and  reduces  the  reflectivity  to  negligible 
levels  over  the  entire  range  of  spatial  frequencies  required  for  the 
45-nm  half-pitch  node. 


the  calculated  reflection  for  both  TE  and  TM  polarizations 
for  the  two  cases  of  air  (n  =  1 )  and  water  (n=  1.44)  as  the 
immersion  fluid,  with  a  photoresist  refractive  index  of  1 .7 1 . 
Water  immersion  both  enables  a  smaller  half-pitch  and  re¬ 
duces  the  reflectivity  to  negligible  levels  for  the  entire 
range  of  spatial  frequencies  required  for  the  45-nm  half¬ 
pitch  node.  As  expected,  the  reflectivity  for  both  polariza¬ 
tions  approaches  unity  at  grazing  incidence.  The  unity 
transmission  (0%  reflection)  into  the  resist  at  Brewster’s 
angle  for  TM  polarization  is  evident  in  both  TM  curves. 

3  Immersion  Lithography  Simulations  for 
Arbitrary  Patterns 

All  of  the  simulations  presented  in  the  following  are  for  the 
pattern  shown  in  Fig.  3  consisting  of  a  series  of  five  nested 
L-bars  at  a  45-nm  half-pitch  CD  and  a  large  box  at  10XCD. 
For  the  nested  features,  the  middle  feature  extends  beyond 
the  others  (a  different  amount  in  the  x  and  y  directions  to 
further  break  symmetry)  to  give  some  indication  of  dense 
isolated  biases.  For  partial  coherent  illumination  and  for 
imaging  interferometric  lithography,  this  mask  is  used  with¬ 
out  any  OPC  or  other  changes.  For  dipole  illumination,  the 
features  are  split  into  two  separate  masks,  as  described 
next.  Extension  of  this  modeling  to  a  variety  of  additional 
structures  of  relevance  to  semiconductor  manufacturing 
will  be  the  subject  of  future  work. 

Simulations  were  carried  out  with  Prolith™  Version 
7.2.2  (KLA-Tencor)  in  full  vector  electromagnetic  mode 
and  with  the  full  physical  model  for  the  photoresist  (PEB 
diffusion,  3-D  etching).  In  all  cases,  the  simulations  used 
the  same  film  stack.  Substrates  were  bare  silicon  with  a 
30-nm-thick  bottom  antireflection  coating  (BARC)  ( n 
=  1.73  +  f 0.395)  and  a  50-nm-thick,  positive  photoresist 
(n  =  1 .7 135  +  f  0.0169;  dill  parameters  A  =  0,  B 
=  1.1  /zm~ 1 ,  C  =  0.0455  cm-2/mJ) .  Standard  postexposure 
bake  (125°C  for  90  s)  and  develop  conditions  (60  s)  were 
used  for  all  simulations. 


10  CD 


Fig.  3  Test  pattern  used  for  the  simulations. 


Two  techniques  were  used  to  force  the  calculations  to 
accommodate  the  important  immersion  physics.  First,  an 
adaptation  to  immersion  was  employed — the  wavelength 
was  reduced  by  the  immersion  fluid  index  (193/1.44=134 
nm),  the  NA  was  replaced  by  NA/^immersion,  and  all  of  the 
indices  of  the  materials  in  the  film  stack  (including  the  Si) 
were  replaced  by  nmm/nimmeTsion.  As  the  previous  section 
indicated,  separating  the  exposures  of  the  high  spatial  fre¬ 
quencies  in  the  x  and  y  directions  and  using  orthogonal 
polarizations  for  the  two  cases,  so  that  all  exposures  are 
dominantly  TE  polarization,  will  be  key  to  imaging  at  the 
limits  of  immersion  lithography.  The  simulations  presented 
next  bear  this  out.  The  present  version  of  Prolith  does  not 
allow  this  flexibility.  The  approach  used  was  to  carry  out 
the  exposures  individually,  with  the  correct  polarizations, 
up  to  the  aerial  image,  extract  and  combine  the  aerial  im¬ 
ages  external  to  Prolith,  and  then  reinsert  the  composite 
aerial  image  and  allow  Prolith  to  carry  out  the  exposure  and 
resist  develop  simulations. 

In  the  following  sections,  simulation  results  for  three 
approaches  to  imaging  the  45-nm  half-pitch  node  at  193  nm 
using  water  immersion  are  presented.  First  is  traditional 
partial-coherent  illumination.  Second  is  IIL,  essentially  off- 
axis  illumination  at  the  edge  of  the  pupil  with  a  small  par¬ 
tial  coherence,  along  with  imaging  pupil-plane  filters  to  en¬ 
sure  uniform  weighting  of  all  spatial  frequencies.  Finally, 
imaging  with  dipole  illumination  (DI) — again  at  the  edges 
of  the  pupil  with  a  small  partial  coherence — with  two 
masks  that  separate  out  the  high  x  and  high  y  spatial  fre¬ 
quency  parts  of  the  image  is  evaluated.  In  both  the  IIL  and 
DI  cases,  the  polarization  was  rotated  between  exposures  to 
ensure  predominately  TE  polarization  exposures  for  the 
high  spatial  frequencies.  OPC  was  not  explored  even 
though  it  can  be  used  to  improve  the  results  as  it  is  impor¬ 
tant  to  understand  the  baseline,  non-OPC,  case  before  add¬ 
ing  the  complications  and  multiple  variations  possible  to 
improve  the  images.  To  bound  the  content  of  this  paper, 
PSMs  were  not  explored  since  both  IIL  and  DI  capture  the 
highest  spatial  frequencies  possible  with  a  given  optical 
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(a) 


Fig.  4  Partial  coherent  imaging  result  for  o-=0.7:  (a)  the  top  view  of 
the  real-space  Prolith  simulation  and  (b)  the  corresponding  fre¬ 
quency  space  picture.  The  outer  circle  is  the  limiting  spatial  fre¬ 
quency  with  a  radius  of  2 n/\.  The  inner  dashed  circle  is  the  maxi¬ 
mum  spatial  frequency  coverage  of  the  tool  with  a  radius  of 
(2 n  sin  e/X).  The  shading  inside  this  circle  is  an  approximate  repre¬ 
sentation  of  the  modulation  transfer  function,  which  decreases  from 
the  center  of  the  circle  to  the  edge.  The  contour  plot  is  the  intensity 
Fourier  transform  of  the  mask.  The  pattern  is  not  resolved  because 
of  the  loss  of  contrast  at  the  high  spatial  frequencies  characteristic 
of  partial  coherent  imaging  and  the  poor  TM  polarization  response  at 
high  angles. 


system  and  hence  are  sufficient  to  show  that  the  45-nm 
node  is  accessible  for  immersion  lithography  at  193  nm. 
PSM  also  enables  printing  of  these  high  spatial  frequencies 
and  it  will  be  important  to  investigate  its  applicability  in 
future  work. 

3.1  Partial  Coherent  Illumination 

Figure  4(a)  shows  the  top-down  view  of  the  developed  im¬ 
age  for  a  partial  coherence  (ratio  of  the  illumination  pupil 
NA  to  the  imaging  pupil  NA)  of  <r=  0.7,  circularly  polar¬ 
ized  illumination  using  a  bright  field  mask  and  positive 
resist.  Clearly  the  dense  line/space  structures  of  the  nested 
L-bars  are  not  captured  in  this  image,  which  rather  shows 
only  the  general  outlines  of  the  entire  pattern.  This  is  true 
across  the  entire  available  range  of  possible  partial  coher¬ 
ence  (0<o-<°°).  The  reason  for  this  is  clear  with  reference 
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to  Fig.  4(b),  showing  the  frequency  space  situation.  The 
outer  circle  in  Fig.  4(b)  represents  the  frequency  space  limit 
of  the  immersion  medium  with  a  radius  of  2 nIX.  This  is  the 
maximum  spatial  frequency  that  can  be  imaged  through  the 
immersion  medium,  corresponding  to  counterpropagating 
plane  waves  at  grazing  incidence  to  the  wafer.  The  inner 
dotted  circle  represents  the  maximum  frequency  allowed  by 
the  optical  system  with  sin  0=0.9  and  with  radius 
2  n  sin  6/X.  The  shaded  region  in  the  center  of  these  circles 
is  a  rough  representation  of  the  frequency  space  coverage 
associated  with  partial  coherent  imaging.  Darker  regions 
correspond  to  a  better  transfer  of  the  frequency  components 
from  the  mask  to  the  image  (effectively  a  higher  modula¬ 
tion  transfer  function).  With  cr=  0.7  and  NA=  1.3,  the  maxi¬ 
mum  frequency  response  extends  to  (l  +  cr)NA/\ 
=  1.53^/X  compared  to  the  half-pitch  center  frequency  of 
1/2  CD—  \A9nl\.  Finally,  the  contour  plot  in  the  figure  is 
the  magnitude  of  the  Fourier  transform  of  the  mask,  scaled 
to  the  imaging  conditions.  Corresponding  to  the  Manhattan 
(v,y)  geometry  of  the  pattern,  the  spectral  content  is  domi¬ 
nantly  along  the/x  and  fy  directions.  There  are  five  regions 
of  significant  frequency  content.  One  is  around  the  origin, 
corresponding  to  the  low-frequency  content  of  the  large 
feature  and  the  overall  shape  of  the  pattern.  There  are  in¬ 
tense  frequency  components  centered  about  the  (±)  fre¬ 
quencies  corresponding  to  the  pitch  of  the  dense  lines  and 
spaces  in  both  the  fx  and  fy  directions.  Partial  coherent 
illumination  has  an  effective  modulation  transfer  function 
that  is  highest  at  low  frequencies  and  approaches  zero  at  the 
highest  frequencies.  The  maximum  frequency  response  for 
cr=0.7,  with  a  low  effective  transfer  function,  is  roughly  at 
the  center  frequency  corresponding  to  the  lines  and  spaces 
of  the  45-nm  half-pitch  node.  This  results  in  an  under¬ 
weighting  of  the  high-frequency  image  components  and  the 
lack  of  resolution  for  the  individual  lines  and  spaces.  The 
polarization  effects  noted  in  the  previous  section  addition¬ 
ally  degrade  the  partial  coherence  image  since  half  of  the 
exposure  intensity  for  the  high  frequencies  is  in  the  TM  or 
“wrong”  polarization,  which  has  very  poor  contrast  and 
further  degrades  the  transfer  function. 

3.2  IIL 

IIL  was  introduced12,13  as  a  technique  to  cover  the  extremes 
of  frequency  space  by  tilting  the  illumination  as  far  as  pos¬ 
sible  off-axis.  With  the  modest  NAs  of  —0.5  to  0.65  that 
were  available  at  the  time,  this  required  tilts  that  were  be¬ 
yond  the  limits  of  the  imaging  pupil  and  consequently  re- 
introduction  of  a  zero-order  beam  on  the  image  side  of  the 
lens.  With  today’s  very  high  NA  lenses  of  sin  0—0.9,  the 
gain  for  the  significant  added  optical  complexity  of  build¬ 
ing  interferometers  around  the  lens  is  small  and  it  is  suffi¬ 
cient  to  set  the  offset  to  the  largest  value  allowed  by  the 
lens  NA.  Thus,  IIL  is  similar  to  quadrupole  off-axis  illumi¬ 
nation  with  a  small  partial  coherence  for  each  off-axis  illu¬ 
mination  beam,  and  with  the  quadrupole  oriented  along  the 
principal  (x,y)  axes  of  a  Manhattan  geometry  pattern  to 
maximize  the  spatial  frequency  capture.  The  polarization  of 
each  of  the  off-axis  illumination  beams  can  be  indepen¬ 
dently  controlled  to  optimize  the  imaging.  Quadrupole  off- 
axis  illumination  inherently  emphasizes  low  frequencies  at 
the  expense  of  high  frequencies13,16  because  each  of  the 
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(b) 


Fig.  5  ML  modeling:  (a)  the  final  real-space  image,  where  the  hori¬ 
zontal  line  denotes  the  position  at  which  the  intensity  profile  shown 
later  in  Fig.  12  is  taken;  (b)  the  pupil  filters  used  in  the  off-axis 
exposures;  and  (c)  the  resulting  frequency-space  coverage.  The 
gray  shadings  simply  denote  the  frequency  coverage,  the  effective 
modulation  transfer  function  (MTF)  for  the  linear  imaging  terms  is 
uniform  across  the  image.  The  polarization  of  the  individual  expo¬ 
sures  is  indicated. 


exposures  covers  the  same  low-frequency  information, 
while  the  individual  high  frequencies  are  covered  only  in 
selected  exposures.  Pupil  plane  filters  can  be  used  to  elimi¬ 
nate  these  multiple  coverages  and  thereby  provide  an  im¬ 
proved  image,  close  to  the  ultimate  frequency- space  limits. 
Thus,  IIL  for  NA— 1.3  becomes  quadrupole  OAI  with  low 
a  and  tilts  to  the  edges  of  the  imaging  pupil  along  with 
pupil-plane  filters  to  eliminate  the  overrepresentation  of 
low  spatial  frequencies. 

Figure  5  shows  the  result  of  this  procedure  for  the 
45-nm  half-pitch  pattern.  The  pattern  [Fig.  5(a)]  is  well 
resolved.  The  impacts  of  the  limited  resolution  include 
rounding  of  the  line  edges  and  some  line  edge  ripple  that 
results  from  the  coherent  exposures.  Both  of  these  issues 
could  be  addressed  with  the  use  of  modest  OPC  on  the 
mask.  The  horizontal  line  denotes  the  position  at  which  the 
intensity  profiles  discussed  in  Sec.  3.3  (Fig.  12)  are  taken. 


(b) 


Fig.  6  Exposures  identical  to  Fig.  5  with  the  exception  of  the  polar¬ 
ization.  The  impact  of  using  the  “wrong”  polarization  for  the  high 
spatial  frequencies  is  clear. 

The  pupil  plane  filters  used  in  this  simulation  are  shown  in 
Fig.  5(b).  The  low-frequency  (center  filter)  exposure  is  car¬ 
ried  out  at  a  lower  NA  of  1.15.  The  offset  exposures  (left 
and  right  filters)  are  at  the  full  NA  of  1.3  with  blocked 
regions  to  avoid  overlaps  in  frequency  coverage  between 
the  various  exposures.  Coherent  illumination  is  assumed 
(o-~0.02)  for  all  of  the  exposures.  Two  offset  exposures 
from  opposite  directions  are  included  in  the  simulation  for 
both  the  v  and  y  offset  directions  to  restore  a  telecentric 
response.  This  has  no  impact  at  focus,  but  is  important  for 
the  depth-of-field  evaluation  discussed  later.  The  polariza¬ 
tion  is  controlled  in  each  of  the  exposures,  as  indicated  in 
Fig.  5(c)  to  ensure  a  dominant  TE  polarization,  especially 
for  the  highest  spatial  frequencies.  Figure  5(c)  shows  the 
resulting  spatial  frequency  coverage  with  the  intensity  Fou¬ 
rier  transform  of  the  image  superimposed.  The  different 
gray  levels  in  this  figure  simply  distinguish  individual  ex¬ 
posure  coverages.  The  effective  MTF  for  the  linear  imaging 
terms  is  uniform  across  the  entire  accessible  frequency 
space.13 

The  impact  of  polarization  control  is  shown  in  Fig.  6. 
This  result  is  for  the  same  pupil  filters  as  for  the  result  of 
Fig.  5,  the  only  difference  is  that  a  fixed  polarization  was 
used  for  all  exposures.  The  impact  of  having  the  “wrong” 
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80  nm  out  of  focus 


(a) 


100  nm  out  of  focus 


Fig.  8  (a)  Result  of  ML  exposures  without  pupil  plane  filters,  where 
the  dense  features  are  not  resolved,  and  (b)  the  corresponding  fre¬ 
quency  space  coverage.  The  multiple  coverage  of  the  low-frequency 
regions  leads  to  the  lack  of  resolution. 


130  nm  out  of  focus 

Fig.  7  Evolution  of  the  ML  image  of  Fig.  5  though  focus.  The  depth 
of  field  is  of  order  200  nm. 


(e.g.,  TM)  polarization  for  the  high  spatial  frequencies  in 
the  fx  direction  is  clear.  Symmetric  results  were  obtained  if 
y  directed  polarization  was  used  for  all  exposures,  the  op¬ 
posite  parts  of  the  L-bar  shapes  did  not  print.  This  result 
confirms  the  simple  IL  visibility  results  already  given — it 
will  be  critical  to  independently  optimize  the  polarization 
for  the  high  spatial  frequencies  in  orthogonal  spatial  direc¬ 


tions  involved  in  printing  45-nm  node  patterns  by  immer¬ 
sion  with  a  193-nm  source. 

The  impact  of  defocus  on  the  IIL  pattern  is  shown  in 
Fig.  7.  At  an  80-nm  defocus,  the  pattern  is  maintained.  At  a 
100-nm  defocus,  the  dense  lines  and  spaces  are  largely  in¬ 
tact  but  dense/isolated  bias  is  evident  in  the  line-end  short¬ 
ening  of  the  isolated  features.  Finally,  at  a  130-nm  defocus, 
the  lines  begin  to  merge  together  and  a  hole  has  developed 
in  the  large  box.  The  addition  of  OPC  should  extend  the 
depth  of  field  (DOF),  which  is  —200  nm  from  these  simu¬ 
lations.  It  is  important  to  recognize  that  the  large  offsets  of 
IIL  (or  DL,  later)  maximize  the  DOF  compared  with  con¬ 
ventional  on-axis  illumination  for  these  high- spatial- 
frequency  structures.  In  the  limit  of  a  periodic  structure 
where  the  off-axis  illumination  is  chosen  to  symmetrize  the 
zero-  and  first-order  diffracted  beams  on  the  image  pupil, 
i.e.,  interferometric  lithography,  the  DOF  approaches  infin¬ 
ity  since  both  beams  are  inclined  at  the  same  azimuthal 
angle  ( 0 ).  The  frequently  cited  Rayleigh  result  for  incoher¬ 
ent  illumination,  DOFoc  k2(X/NA2),  does  not  hold  for  this 
situation. 
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Fig.  9  ML  with  a  larger  o-=0.04.  There  is  an  increased  dense- 
isolated  line  bias  and  poorer  definition  of  the  L-bar  corners  at  this 
higher  cr. 


The  importance  of  including  the  pupil-plane  filters  is 
shown  by  the  simulation  result  in  Fig.  8,  which  includes 
only  the  offset  exposures  but  without  incorporating  any  pu¬ 
pil  plane  filters — e.g.,  quadrupole  illumination  with  the 
quadrupole  oriented  along  the  principal  pattern  axes  and 
with  optimized  polarizations.  The  dense  features  are  incom¬ 
pletely  resolved  and  there  is  a  significant  dense-isolated 
bias.  The  frequency  space  coverage  in  Fig.  8(b)  shows  that 
the  low-frequency  components  of  the  pattern  are  counted 
multiple  times,  while  the  high-frequency  components  are 
counted  only  once.  The  resulting  overemphasis  on  the  low 
frequencies  is  the  reason  for  the  inability  to  resolve  the 
dense  features.  This  exposure  strategy  is  equivalent  to  DL, 
but  with  a  single  mask  with  all  of  the  features  included.  The 
next  section  shows  that  somewhat  better  results  are  avail¬ 
able  by  parsing  the  features  between  two  masks  and  em¬ 
ploying  a  double-exposure  strategy. 

Finally,  the  impact  of  a  slightly  larger  cr  of  0.04  is  shown 
in  Fig.  9.  This  corresponds  to  the  incoherent  addition  of 
coherent  exposures  at  different  offsets.  The  limits  were  set 
by  the  maximum  offset  at  the  limit  of  the  pupil  (0.9n/X)  at 
one  side,  and  by  the  smallest  offset  that  can  capture  the 
highest  spatial  frequencies  in  the  image,  which  from  the 
Fourier  plots  in  Fig.  8  extend  out  to  almost  l.8n/\.  Em¬ 
pirically,  this  restricted  the  cr  to  —0.04  with  a  center  offset 
of  0.88n/\.  The  pupil  plane  filters,  with  the  small  hole  for 
the  zero-frequency  diffraction  enlarged  for  the  larger  cr,  and 
polarization  were  the  same  as  used  in  the  exposure  of  Fig. 
5.  The  image  is  significantly  degraded  with  a  much  more 
significant  dense-isolated  line  bias,  a  narrowing  of  the  out¬ 
ermost  L-bar  and  more  exaggerated  sidewall  ripples,  par¬ 
ticularly  in  the  comers.  All  of  these  imaging  artifacts  could 
be  addressed  with  OPC.  Extension  to  higher  cr’s  results  in 
poorer  images  as  the  lower  offsets  do  not  enable  capture  of 
the  necessary  high  spatial  frequencies  and  thereby  overem¬ 
phasize  the  larger  features. 

3.3  DL 

DL,  where  the  pattern  features  are  split  into  two  masks 
each  with  the  high  spatial  frequencies  in  only  one  direction, 
is  an  alternative  exposure  technique  that  offers  the  neces¬ 
sary  polarization  flexibility.  For  these  simulations,  no  pupil 
plane  filters  were  used  and  all  exposures  were  at  the  full 


(c) 


Fig.  10  DL  with  two  masks  (a)  and  (b)  separating  the  high  small 
features  in  the  x  and  y  directions.  The  large  box  was  eliminated, 
since  it  was  not  possible  to  print  the  pattern  when  it  was  present. 
The  illumination  offsets  were  at  the  edge  of  the  imaging  pupil  and 
the  frequency  space  coverage  was  identical  to  that  shown  in  Fig.  8. 
Because  the  large  box  was  absent,  the  frequency  space  overlaps 
were  tolerable. 


system  NA.  The  wide  range  of  spatial  frequencies  in  the 
present  test  pattern  poses  some  difficulties  for  this  ap¬ 
proach.  Figure  10  shows  an  example  of  the  result  of  coher¬ 
ent  (or— 0.02)  DL  at  an  offset  of  0.89  just  as  was  used  for 
the  IIL  example  of  Fig.  8.  Each  mask  is  printed  with  the 
appropriate  dipole  illumination  and  polarization.  The  large 
box  has  been  removed  from  the  pattern  and  the  remaining 
dense  features  split  into  separate  masks  for  the  x  and  y 
directions,  as  shown.  The  polarization  is  optimized  sepa¬ 
rately  for  exposure  of  each  mask.  The  dense-isolated  bias  is 
more  pronounced  in  this  configuration,  and  the  corners  of 
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Fig.  11  Results  of  three  different  dipole  mask  strategies.  A  single 
bright-field  mask  printing  only  the  lines  parallel  to  the  y  axis  and  the 
large  box.  It  was  not  possible  to  print  both  the  lines  and  the  box  in  a 
single  exposure.  Also  shown  are  two  different  attempts  at  dark-field 
(inverted)  images.  The  second  mask  for  the  first  case  (full  large  box) 
included  only  the  lines  parallel  to  the  xaxis  with  no  box.  The  second 
mask  for  the  second  case  (triangular  box  segment)  was  symmetric 
to  the  mask  shown. 


the  L-bars  are  less  robustly  defined.  Alignment  of  the  two 
masks  between  exposures  is  an  added  requirement  for  this 
dipole  technique. 

Figure  11  shows  the  result  of  a  single-mask  exposure 
including  the  large  box  (left  column)  with  the  appropriate 
DL  and  polarization.  The  individual  lines  close  to  the  box 
are  not  resolved.  The  two  right  columns  show  two  different 
dark-field  (inverted  image)  mask  strategies.  In  the  center 
column,  the  mask  includes  the  entire  box,  while  the  second 
mask  (not  shown)  has  only  the  dense  features  parallel  to  the 
v  axis.  Again  in  this  case,  there  is  difficulty  in  separating 
the  dense  lines  from  the  box,  and  there  is  significant  distor¬ 
tion  of  the  pattern.  The  right-hand  column  shows  a  more 
symmetric  division  of  the  mask.  This  produced  the  best- 
printed  pattern,  but  still  shows  significant  distortion  and 
linewidth  variation.  More  detailed  investigations  are  neces¬ 
sary  to  optimize  these  DL  strategies.  In  particular,  the  use 
of  OPC  with  or  without  pupil-plane  filters  will  certainly 
provide  better  imaging  results. 

Figure  12  shows  intensity  cuts  across  the  aerial  image 
for  IIL  and  for  single  dipole  exposures  for  both  the  bright- 
field  and  dark-field  cases  for  the  full  pattern  including  the 
large  box.  Clearly  IIL  provides  the  best  aerial  image  and 
hence  the  best  developed  image.  In  both  dipole  examples, 
because  of  the  large  low-frequency  content  of  the  aerial 
images,  it  is  impossible  to  choose  an  exposure  threshold 
level  for  either  of  the  dipole  exposures  that  captures  all  of 
the  pattern  features. 

4  Conclusions 

A  simulation  of  arbitrary  pattern  lithography  for  the  45-nm 
half-pitch  node  using  a  193-nm  source  with  an  NA=1.3 
water-immersion  lithography  tool  was  presented.  Of  par¬ 
ticular  importance  is  the  requirement  of  optimizing  the  po¬ 
larization  individually  for  different  parts  of  the  image.  This 
will  require  a  major  change  to  illumination  optics  and  will 
impact  exposure  strategies  and  tool  cost-of-ownership 
evaluations. 
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Fig.  12  Intensity  of  the  aerial  image  across  a  cut  of  the  pattern  for 
IIL,  a  single  dipole  exposure  with  a  positive  tone  mask  (Fig.  11,  left 
column)  and  a  single  dipole  exposure  with  a  negative  tone  mask 
(Fig.  11 ,  middle  column). 


To  access  the  highest  spatial  frequencies — necessary  for 
printing  45-nm  half-pitch  structures — off-axis  illumination 
near  the  edges  of  the  imaging  pupil  will  be  required.  Two 
strategies  for  separating  the  exposures  to  allow  for  the  nec¬ 
essary  polarization  control  were  presented.  IIL,  related  to 
off-axis  illumination  with  the  addition  of  pupil  plane  filters 
to  adjust  the  details  of  the  image,  provides  relatively  robust 
results.  Different  pupil  plane  filter  strategies  were  presented 
along  with  details  of  variation  of  the  image  through  focus. 
DL  with  the  division  of  the  pattern  into  two  masks,  each 
with  patterns  primarily  oriented  in  orthogonal  directions, 
was  also  investigated.  Initial  results  are  not  as  promising  as 
the  IIL  strategy,  but  more  investigation  is  required. 

These  simulations  show  that  45-nm  half-pitch  lithogra¬ 
phy  is  indeed  possible  by  extension  of  193-nm  tools  to 
water  immersion.  Much  more  investigation  is  required  to 
optimize  the  imaging  over  the  range  of  patterns  required  for 
actual  circuits,  and  to  add  OPC  to  provide  more  robust 
results.  This  evaluation  did  not  consider  PSMs,  which  are 
an  alternative  approach  to  reaching  the  frequency  space 
limits  of  optical  systems.  Experimental  confirmation  of  the 
simulations  using  microstepper  platforms  is  also  an  urgent 
need.  Interferometric  lithography  (grating)  investigations 
are  underway  and  are  reported  elsewhere10  and  in  this  vol¬ 
ume.  These  reports  confirm  the  frequency  capabilities  and 
the  IL  polarization  analysis  and  demonstrate  that  work  is 
necessary  on  resists  to  extend  their  resolution  to  the  45-nm 
half-pitch  node.  Ultimately,  decisions  as  to  manufacturing 
direction  are  as  much  economic  as  technical.  Significant 
further  investigation  is  necessary  to  evaluate  all  of  the  pos¬ 
sible  directions;  this  work  is  but  an  initial  contribution. 
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ABSTRACT 

Imaging  Interferometric  Lithography  (IIL)  offers  several  optimization  parameters  such  as  pupil  filtering,  parsing  of 
frequency  coverage,  polarization  control,  and  multiple  exposure  dosage  ratios.  We  discuss  the  optimal  frequency 
coverages  for  IIL  at  NA  =  .9  and  the  effects  of  the  dark  field  (quadratic  image  autocorrelation  terms)  on  the  aerial  image 
under  pupil  filtering.  Next,  comparisons  are  made  of  exposure  latitudes  for  various  dosage  ratios  and  exposures  for 
several  weighted  errors  under  different  conditions.  Lastly,  apodization  of  the  pupil  filters  is  considered  and  shown  to 
alleviate  artifacts  associated  with  Gibbs  phenomena  at  hard  frequency  stops  and  improve  overall  image  fidelity. 
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1.  INTRODUCTION 

As  Ki  decreases  in  optical  lithography,  a  detailed  understanding  of  nonlinear  imaging  effects  becomes  necessary  to 
optimize  image  formation  at  minimum  dimensions.  For  any  lithographic  technique,  high  contrast  and  exposure  latitude 
are  necessary  to  achieve  successful  printing  of  high  quality  patterns,  especially  in  a  high  volume  manufacturing 
environment.  IIL  at  high  NA  [1]  offers  several  resolution  enhancement  techniques  (RET)  such  as  off-axis  illumination 
(OAI)  at  the  largest  oblique  illumination  angle  to  down-shift  the  highest  frequency  components  into  the  pass-band  of  the 
optical  system,  variable  pupil  filtering  strategies  to  improve  image  robustness,  and  polarization  control  to  maximize 
interference  and  print  high  spatial  frequencies  at  the  optical  system  minimum  half-pitch.  This  paper  explores  some  of 
these  strategies  and  demonstrates  their  impacts  on  the  final  images.  With  the  IIL  scalar  model  [2,3],  the  up-shifted  high 
frequency  components  of  a  mask  pattern  retain  the  high  amplitudes  necessary  to  create  high  exposure  latitude  for  high 
resolution  patterns  at  low  Ki  imaging.  At  high  NA,  four  exposure  IIL  is  similar  to  double  dipole  off-axis  illumination 
RET,  however  the  primary  differences  are  the  use  of  coherent  light,  on-axis  exposure,  and  pupil  filters;  all  in 
combination  to  create  high  resolution  images  with  little  image  degradation. 

Since,  numerical  apertures  {NA)  will  approach  >  0.9  in  air  and  ~  1.3  or  beyond  in  immersion  lithography,  IIL  uses 
off-axis  and  on-axis  illumination,  polarization  control  for  proper  interference  at  high  NA  angles  for  each  exposure,  pupil 
filtering  to  the  control  iso/dense  bias,  and  multiple  exposure  ratio  dosage  parameters  to  control  contrast/exposure 
latitude.  These  variables  all  allow  IIL  to  fully  exploit  all  parameters  of  multiple  exposure  optical  lithography.  IIL 
promises  full  spatial  optical  resolution  without  costly  phase-shift  masks  and  with  minimal  optical  proximity  correction 
for  small  critical  dimensions. 

In  this  paper,  we  compare  the  aerial  images  at  high  NA  of  four  and  five  exposure  IIL  under  scalar  and  vector  models. 
We  explore  variations  of  dosage  ratios  between  the  reference  beam  and  the  first  diffraction  order  using  various 
simulation  approaches.  Optimized  ratio  dosages  for  the  multiple  exposures  have  a  strong  effect  on  printability,  exposure 
latitude,  and  defects.  In  addition,  use  of  pupil  filters  and  apodization  can  improve  resolution  and  combat  artifacts 
yielding  a  maximally  robust  aerial  image. 

2.  OPTIMIZING  FREQUENCY  COVERAGE  AND  PARSING 

IIL  provides  the  ability  to  use  the  parsing  of  different  frequency  space  image  components  into  separate  exposures  as  a 
parameter  for  image  quality  and  resolution.  The  tiling  or  parsing  is  not  unique  due  to  the  ability  to  vary  the  NA, 
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polarization,  and  offset  illumination  angles  in  separate  exposures.  To  better  understand  the  impacts  of  these  variables  on 
the  frequency  coverages  and  parsings,  numerical  algorithms  were  created  to  optimize  the  illumination  scheme  for  a 
given  mask  [4].  Afterwards,  an  understanding  is  gained  which  coverages  are  most  likely  to  work  well  for  a  given  NA 
and  mask  pattern.  At  high  NA’s,  the  vector  model  is  necessary  due  to  the  large  propagation  angles.  However,  with  TE 
polarization  control  for  each  exposure,  the  aerial  images  will  have  similar  aerial  image  profiles  when  compared  to  scalar 
aerial  images  in  simulation  [5].  In  general,  the  trends  of  the  aerial  image  should  still  hold  due  to  the  Manhattan  pattern 
assumption,  use  of  pupil  filters,  and  use  of  polarization  control.  The  scalar  model  is  used  in  this  paper  with  care  for 
simplicity  and  to  understand  the  general  trends  of  varying  parameters  of  IIL.  Where  necessary  a  vector  calculation  is 
used  to  validate  the  results.  The  IIL  intensity  is  described  by  the  sum  of  multiple  incoherent  intensities  [2] 
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where  a  is  the  ratio  of  the  dosages  between  the  Ion  (on-axis  intensity)  and  Ioff  (off-axis  intensity)  for  the  horizontal  and 

vertical  frequency  components.  When  the  off-axis  illumination  point  source  is  at  edge  of  the  illumination  pupil  for  the 
horizontal  frequencies  the  off-axis  intensity  is  described  by 
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and  similarly  for  Ioff_y  .  The  quadratic  term  is  summed  over  all  frequencies  except  the  DC  plane  wave.  0(fx,fy )  is  the 
object/mask  spectrum,  P(  fx ,  fy )  is  the  pupil  function  of  a  coherent  optical  imaging  system,  i.e.  a  uniform,  radial  low- 
pass  filter  to  cutoff  frequency  NA  /  A  and  foff  is  the  frequency  offset  of  the  off-axis  illumination  point  source, 
foff-x,y  =  NA  /  X .  The  first  term  is  the  bias,  the  linear  term  is  the  desired  quantity,  and  the  quadratic  term  is  the  nonlinear 
image  that  causes  image  degradation.  In  addition,  the  pupil  filter  (function),  P(fx,fy),  in  each  exposure  can  be  reshaped 
by  apodization  (pupil  filtering). 


2.1.  Algorithms 

A  detail  description  of  the  algorithms  used  to  place  the  frequency  coverage  and  parsing  are  described  in  [5].  The  basic 
principle  is  based  on  transform  coding  commonly  used  in  the  signal  processing  community.  Essentially,  in  coding  the 
most  significant  coefficients  in  a  transformer  coder  are  kept  and  the  rest  are  set  to  zero.  A  transform  coder  is  analogous 
to  the  diffraction  limited  optical  system  where  a  lens  is  viewed  as  a  finite  Fourier  transform  filter.  It  is  optimal  to  keep 
the  most  significant  coefficients  such  that  the  most  significant  planes  waves  are  passed  through  the  pupil  optimally. 
Algorithms  are  developed  to  search  the  frequency  space  to  maximize  the  sum  of  squares  of  the  coefficients  in  each 
exposure.  From  a  lithography  perspective,  the  source  illumination  is  designed  to  place  the  diffraction  orders  in  the 
optimal  places  in  the  pupil  for  the  least  error.  Several  computation  algorithms  with  different  complexity  were  explored. 
Greedy  and  dynamic  programming  types  are  most  useful  for  speed  and  optimality  respectively.  Each  exposure  selects  a 
frequency  coverage  area  and  hence  that  determine  which  Fourier  coefficients  pass  through  the  optical  system.  With  high 
NA’s,  the  algorithms  will  converge  to  the  typical  solution  shown  in  Figure  1  (a),  however  at  smaller  NA’s,  variations  in 
frequency  coverage  are  possible  since  smaller  NA’s  cover  less  frequency  space  area  and  the  promise  of  reintroducing  the 
reference  beam  at  the  image  plane  was  possible  with  low  NA  optical  systems.  In  addition,  alternative  frequency 
coverages  are  discussed  for  experimental  use  and  the  algorithms  can  be  used  to  search  redundant  expansions  instead  of 
typical  Fourier  expansions  to  possibly  find  better  approximations  using  frequency  coverages  with  correct  overlap.  It  is 
known  that  non- orthogonal  expansions  allow  better  approximations.  For  example,  overlapping  frequency  coverages  can 
amplify  certain  frequency  components  to  increase  exposure  latitude. 

2.2.  Filters  and  Coverage  Figures 

In  this  section  the  dynamic  programming  algorithm  result  at  high  NA  is  shown  in  Figure  1.  The  typical  coverage  for  the 
off-axis  illumination  is  at  the  expected  locations  on  vertical  and  horizontal  axes  since  a  majority  of  spatial  frequency 


space  is  covered  with  a  few  high  NA  exposures.  The  four  and  five  exposure  pupil  filters  are  shown  on  the  right  side  of 
the  figure.  The  four  exposure  method  uses  the  left  two  pupil  filters  and  the  third  includes  the  on-axis  pupil  filter  for  the 
five  exposure  system  setup.  An  addition  on- axis  exposure  is  added  to  reduce  the  iso/dense  bias  from  the  off-axis 
exposures.  The  reference  beam  location  in  the  pupil  plane  is  at  the  apex  of  the  triangle  in  the  pupil  filters.  Comparisons 
of  aerial  images  resulting  from  these  configurations  are  discussed  in  the  next  section.  Some  addition  frequency 
coverages  with  different  geometries  at  low  NA  and  with  our  experimental  high  NA  lens  with  an  obscuration  in  the  center 
of  the  pupil  are  discussed  in  [5].  Special  case  NA’s  results  in  typical  coverage  geometries  with  tilted  coverage  to  recover 
corner  frequencies  and  to  move  the  obscuration  slightly  away  from  the  vertical  and  horizontal  axes  for  the  four  and  five 
exposure  systems. 


(a) 
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Figure  1.  (a)  Frequency  space  picture  of  five  exposure  IIL  coverage  where  optical  limit  frequency  is  2/  A .  Concentric  circles  are 
various  on-axis  pupils  of  NA  =  .2,  .4,  .6  and  .8.  (b)  Left  figure  is  vertical  frequency  pupil  filter.  Middle  figure  is  horizontal  frequency 
pupil  filter.  Right  is  on-axis  pupil  filter  for  various  on-axis  NA.  White  represents  transparent  area,  gray  is  opaque  area,  and  black  is 
the  area  outside  the  Fourier  pupil  for  off-axis  and  on-axis  exposures.  Partial,  concentric  circles  are  on-axis  pupil  sizes. 


Next,  typical  quadratic  or  dark  field  images  results  from  three  different  pupil  filtering  configurations  are  shown  in  Figure  2  for  the 
mask  pattern  in  Figure  3  (a).  Figure  2  (a)  shows  the  dark  field  image  from  one  off-axis  exposure  without  any  pupil  filter  and  hence  the 
interference  is  strong  and  causes  a  bias  in  the  square  box  region  leading  to  ringing  defects  as  has  been  seen  previously  [1,2].  Figure  2 
(b)  illustrates  the  dark  field  image  with  the  four  exposure  pupil  filters  in  Figure  1  (b).  The  dark  field  has  a  less  impact  than  previously 
with  binary  filters.  Figure  2  (c)  shows  the  dark  field  image  of  the  off-axis  image  when  an  on-axis  exposure  is  included.  The  bias  in 
the  box  region  is  removed  and  only  a  small  effect  in  the  dense  region  of  the  pattern  is  apparent. 
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Figure  2.  (a)  Typical  quadratic  term  without  pupil  filter.  Strong  DC  bias  in  box  region  is  evident,  (b)  Typical  quadratic  term  (dark 
field)  of  four  exposure  IIL  system  where  pupil  filter  is  triangle  shape,  (c)  Typical  quadratic  term  from  five  exposure  IIL  system  where 
pupil  filter  is  moon  shape.  Peak  value  in  dark  field  image  is  smaller  and  has  a  small  DC  bias  in  dense  line  area. 


3.  COMPARISONS  OF  AERIAL  IMAGES 

Simulations  of  IIL  aerial  images  in  vector  and  scalar  models  are  discussed.  Although  there  are  differences  in  the  precise 
aerial  image  calculation,  the  scalar  model  still  illustrates  the  general  trend  in  the  aerial  image  at  high  NA’s  since 
polarization  control  is  used  in  each  separate  exposure  for  the  rectilinear  patterns  of  interest.  Figure  3  illustrates  the 
various  error  weights  used  to  emphasize  error  in  dense  and  isolate  parts  of  the  pattern  for  aerial  image  comparisons  in 
the  figures  to  follow. 


(a)  (b)  (c)  (d) 

Figure  3.  These  images  represent  the  weights  on  the  pattern  to  emphasize  error  in  important  areas.  White  corresponds  to  weighted 
region.  Black  are  the  typical  lines  of  the  pattern  and  the  gray  background  is  the  space  area  of  the  mask,  (a)  Optimal  mask,  identity 
(unweighted)  weight  (b)  Dense  weight  mask  emphasizing  dense  area  with  weight  =  100.  (c)  Box  weight  emphasizing  large  line  area 
with  weight  =  100.  (d)  Isolated  weight  mask  emphasizing  isolated  lines  with  weight  =  600. 

A  weight,  w(x,  y) ,  is  calculated  as  on  the  error  between  the  binary  mask,  M(x,y) ,  and  the  threshold  aerial  image, 
T (x,  y)  over  the  spatial  coordinates  x,  y  . 
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3.1.1  Scalar  Model  Comparisons 

The  IIL  intensity  scalar  model  typically  shows  a  stronger  biased  aerial  image  that  requires  higher  threshold  values  than 
single  exposure  aerial  images  because  of  the  increase  number  of  exposures.  Later  in  the  vector  model  section,  it  is 
illustrated  that  the  bias  is  lessen  by  optimizing  the  exposure  ratio  dosage  parameters.  Scalar  aerial  images  for  various 
on-axis  pupils  tend  to  be  the  same  for  all  error  measures  except  the  box  weight  error  curve  in  Figure  4  (b).  It  is  seen  in 
section  4  that  the  addition  of  apodization  can  alleviate  the  sensitivity  of  the  box  error  weight  with  respect  to  the  on-axis 
pupil  size.  The  inclusion  of  an  on-axis  exposure  with  pupil  filtering  make  the  error  curves  behaves  closely  to  a  linear 


system.  The  similarity  is  illustrated  in  the  linear  term  nearness  to  the  various  five  exposure  error  curves  to  that  of  the 
four  exposure  error  curves  in  Figure  4. 
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Figure  4.  Error  curves  of  threshold  aerial  images  (a)  Identity  weighted  error  curves,  (b)  Box  weighted  error  curve  (c)  Dense  weighted 
error  curve,  (d)  Isolate  line  error  curve 

There  exists  more  exposure  latitude  in  the  four  exposure  system  since  the  dense  weight  error  curve  in  Figure  4(c) 
appears  wider,  but  with  a  greater  minimum  error.  The  isolated  line  error  curve  in  Figure  4  (d)  shows  that  large  on- axis 
pupil  filter  tends  to  print  the  isolated  lines  better.  This  is  expected  since  the  on-axis  image  defines  the  low  frequency 
portions  of  the  mask.  The  RC-S  error  curve  is  discussed  Section  4. 

The  iso/dense  bias  affects  the  dense  lines  unevenly  in  the  vector  model  and  this  creates  difficultly  to  print  dense 
features  correctly  than  in  the  scalar  model  shown  in  the  threshold  aerial  images  in  Figure  5.  Figure  5  (a)  and  (b)  display 
the  resulting  threshold  image  from  the  scalar  and  vector  models.  The  comer  region  shows  a  strong  iso/dense  bias 
compared  to  the  isolated  lines.  This  is  somewhat  expect  since  the  high  NA  accurately  accounts  for  the  high  spatial 
frequencies  better  at  the  large  propagation  angles.  Notice  the  top  of  the  large  box  has  the  angular  falloff  that  is  still  seen 
in  the  scalar  model.  The  five  exposure  images  in  Figure  5  (c)  and  (d)  from  the  vector  and  scalar  model  are  similar, 
primarily  because  the  five  exposure  system  behaves  more  like  a  linear  imaging  system  than  the  four  exposure  method. 


(a)  (b)  (c)  (d) 

Figure  5.  Threshold  aerial  images  (a)  Scalar  model  4  exposures,  (b)  Vector  model,  4  exposures,  (c)  Vector  model,  5  exposures  (d) 
Scalar  model,  5  exposures 

3.1.2  Vector  Model  Comparisons 

Optimal  values  of  exposure  dosages  typically  depend  on  the  pattern  of  interest  and  the  simulation  tool.  Here  we 
illustrate  the  effect  of  changing  the  a ,  y  parameters.  Many  aerial  images  are  generated  from  PROLITH  vector  model 

with  various  combinations  a,y  and  here  we  show  aerial  image  profiles  and  compute  the  same  error  curves  in  Figure  3  to 
understand  quickly  the  image  quality  to  print  the  best  pattern  possible  for  various  combinations  of  a,y  . 
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Figure  6.  PROLITH  Vector  aerial  image  profiles  for  various  a  and  y  values,  (a)  a  =  .5  (b)  a  =  .8  (c)  a  =  1.1  (d)  a  =  2. 

Figure  6  shows  aerial  image  profiles  for  four  a  ’s  with  varying  y  ’s  values.  It  is  observed  that  the  varying  the  y 
values  increases  the  bias  and  increases  the  exposure  latitude  of  the  dense  lines.  This  behavior  matches  well  with  the 
scalar  aerial  image  intensity  in  eqn.  1.1.  The  square  of  y  is  proportional  to  the  bias  and  y  multiplies  the  linear  term;  this 

effect  is  observed  in  the  vector  model  in  Figure  6  and  consequently  in  the  error  curves  of  Figure  8.  As  y  is  increased  the 
bias  shifts  the  error  curve  right  and  tends  to  widen  the  error  curve.  From  observations  in  Figure  6,  7  and  8,  the  y  control 
over  the  DC  background  bias  is  important  since  small  changes  in  y  affect  the  aerial  image  bias  significantly.  Typical 
dosages  should  be  between  5 E0  and  2 E0  (.2  and  .5  threshold  values).  Because  the  simple  resist  model  is  based  on  a 
threshold  function,  it  is  important  to  control  the  bias  of  the  multiple  exposure  system  through  y  since  biasing  also  limits 
resolution.  The  gain  by  using  a  high  y  values  is  offset  by  a  large  DC  background  without  a  large  gain  in  the  exposure 
latitude  in  the  linear  term  amplification.  Primarily,  y  is  used  to  put  the  bias  in  the  resist  threshold  range  so  that  the 
dense  structures  are  at  the  correct  threshold  values.  From  observations  in  PROLITH,  optimal  value  of  y,a  are  .8,  .5 

respectively.  This  gives  a  threshold  value  of  approximately  .37.  Comparisons  in  Figure  6,  show  that  the  on-axis  aerial 
image  should  have  similar  contrast  to  the  off-axis  aerial  images  to  create  a  robust  photoresist  image  with  similar  contrast 
for  on-axis  and  off-axis  intensities.  For  a  fixed  y,  a  large  a  value  amplifies  the  isolate  structures  and  decreases  the 
exposure  latitude  of  the  dense  lines.  Using  a  small  a  decreases  the  exposure  latitude  of  the  on-axis  image  (isolated 
structures)  and  hence  reduces  the  image  quality  of  the  photoresist  image.  In  Figure  7,  the  middle  group  of  curves  with 
y=. 8  and  with  a  equal  to  .5,  the  error  curve  is  widest  in  comparison  with  the  four  other  curves  with  other  a  values. 
For  example,  the  error  curve  becomes  larger  with  decreasing  a ,  but  after  a  =.5  the  error  curve  becomes  narrow  again. 
A  simple  method  to  determine  an  optimal  a  during  simulation  is  to  measure  the  maximum  intensity  of  the  off-axis  and 
on-axis  aerial  images.  Then  a  can  be  determined  to  equalize  the  intensities  of  the  off-axis  and  on-axis  aerial  images. 
Then  a  is  the  ratio  between  Imax_off  / 1 max-on  •  This  will  adjust  the  intensities  between  the  off-axis  and  on-axis  images  such 
that  the  overall  aerial  image  contrast  matches  well  and  a  wide  threshold  region  is  found  for  photoresist  development. 

Figure  7  shows  the  effect  of  varying  or  for  fixed  y  values  using  the  general  and  dense  error  measures.  The  y 
determines  the  region  for  aerial  image  thresholds  and  the  a  helps  to  vary  the  exposure  latitude  locally  in  the  threshold 
region.  For  large  and  small  a  values,  the  error  curve  is  becomes  narrower  as  compared  to  the  optimal  values  of  .5  that  is 
observed  to  be  optimal  for  the  photoresist  image  developed  from  the  aerial  image  [1].  A  similar  trend  is  seen  in  the 
dense  error  measure  in  Figure  7  (b). 


(a) 

Figure  7.  Error  curves  for  five  exp.,  y  =  6,  .8,  1.0.  Each  group  of  y  consists  a 
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2,  1.1,  .5,  .3.  (a)  Unweighted  error  (b)  Dense  error. 
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Figure  8.  Error  curves  for  4  exp.  (solid  line)  and  5  exp.  (dash  line)  aerial  images  for  fix  a  =.5  and  various  y  .  (a)  Unweighted  error 
curve,  (b)  Dense  error  metric. 

Figure  8  (b)  further  illustrates  that  dense  lines  print  better  in  five  exposures  instead  of  four  exposures  by  a  factor  of 
approximately  two  in  the  minimum  error.  Although,  there  is  more  contrast  due  to  the  quadratic  term  to  widen  the  error 
curve,  it  degrades  and  increases  the  relative  minimum  error. 

4.  APODIZATION 

High  NA’s  allow  a  significant  amount  of  frequency  overlap  between  on-axis  and  off-axis  exposures  as  illustrated  in 
Figure  1.  Designing  filters  for  overlapping  frequency  spectrum  between  various  frequency  bands  of  signals  is  large  facet 
of  filterbank  theory  and  signal  processing.  The  use  of  subapertures  in  the  different  IIL  exposures  allows  the  use  of 
smooth  pupil  functions  such  that  the  transitions  in  the  frequency  spectrum  are  smooth  in  transition  instead  of  the 
brickwall  shape  of  the  MTF  in  coherent  illumination.  Of  course,  at  the  highest  spatial  frequencies  of  the  system,  the 
imaging  physics  still  dictates  hard  cutoffs  with  coherent  illumination.  Smooth  transitions  in  the  overlap  regions  allow 
higher  resolution  with  lowered  image  degradation.  Radially,  nonsymmetrical  filters  are  considered  in  [6,7]  to  improve 
aerial  image  robustness  and  DOF,  however  here  the  filters  are  designed  with  a  two  channel  filterbank  idea  in  mind  to 
improve  image  fidelity. 


Figure  9  displays  two  on-axis  pupil  filters  with  the  sharp  raise  cosine  and  raise  cosine  filter  with  square  symmetry. 
The  sharp  raised  cosine  has  a  fast  varying,  narrow  transition  band  and  a  flatter  central  peak.  The  raised  cosine  is 
smoother  and  contains  a  wider  transition  region. 


(a)  (b) 

Figure  9.  On-axis  pupil  filters  with  square  symmetry,  (a)  Sharp  raised  cosine  (b)  Raised  cosine 

The  same  functions  for  the  transition  areas  of  the  filters  are  used,  however  the  off-axis  filters  fan  out  in  diagonal  or 
straight  directions.  Figure  10  consists  of  nonsymmetrical  off-axis  pupil  filters.  The  horizontal  frequency  filters  are 
shown  because  the  vertical  frequency  filters  are  90  degree  rotations  of  the  horizontal  pupil  filters.  Figure  10  (a)  and  (b) 
have  a  triangle  shape  and  even  out  the  overlap  in  frequency  coverage  between  the  horizontal  and  vertical  off-axis 
exposures.  The  filters  in  Figure  10  (c)  and  (d)  overlap  and  amplify  the  corner  frequencies  and  may  provide  higher 
contrast  for  corner  frequencies. 


(a)  (b)  (c)  (d) 

Figure  10.  Off-axis  pupil  filters,  (a)  Triangle  shape  -  raise  cosine  (b)  Triangle  shape  -  sharp  raise  cosine  (c)  straight  shape  -  raise 
cosine  (d)  straight  shape  -  sharp  raise  cosine 

Further  use  of  other  filters  such  as  the  Bartlett,  Gaussian  windows,  and  other  filters  are  possible.  The  transition 
regions  may  not  mirrors  of  each  other  between  the  on-axis  and  off-axis  filters  and  optimizing  the  transition  region  is 
possible  to  improve  image  quality. 

Threshold  images  of  the  aerial  image  at  the  same  threshold  level  are  shown  in  Figure  11.  Figure  11  (a)  displays  the 
defect  caused  by  the  ringing  that  is  well-known  property  of  imaging  in  coherent  light.  The  threshold  image  resulting 
from  apodization  with  the  raise  cosine  filter  is  shown  in  Figure  11  (b).  The  threshold  images  are  similar  for  all  the 
apodization  filters  discussed,  hence  only  one  image  is  shown.  In  all  cases,  the  defect  has  been  mitigated  by  the  addition 
of  apodization.  Aerial  image  profiles  are  show  in  Figure  12,  taken  vertically  through  the  defect  region  of  the  image  in 
Figure  11.  The  raise  cosine  filter  reduces  the  defect  slightly  better  than  the  sharp  raise  cosine  pupil  filter  due  to  the 
smoother  response.  In  addition,  the  apodization  is  carried  out  with  the  mask  pattern  in  Figure  3  (a)  and  the  error  curves 
for  the  raise  cosine  with  square  symmetry  are  include  in  Figure  4  for  comparison.  It  is  observed  that  the  isolated,  dense 
and  unweighted  error  curves  are  all  similar  to  the  error  curves  with  binary  filtering;  improvement  in  the  box  error  curve 
is  attained  by  reduced  biasing  in  the  large  box  region  that  is  worse  in  the  case  of  binary  pupil  filtering. 


(a) 


(b) 


Figure  11.  Threshold  scalar  aerial  images,  (a)  Without  apodization  using  binary  filters  of  Figure  1  (b)  With  raise  cosine  apodization. 


Position  xIO’6  Position 

(a)  (b) 


Figure  12.  Scalar  aerial  image  slices  with  and  without  apodization  (a)  Sharp  raise  cosine  with  square  symmetry  (SRC-S)  (b)  Raise 
cosine  with  square  symmetry  (RC-S) 

Figure  13  shows  photoresist  images  using  PROLITH  vector  imaging  model.  The  hole  in  the  box  region  reproduces 
well  with  the  scalar  model  threshold  image  in  Figure  11  (a).  The  defect  is  removed  by  the  apodization  filters  to  reduce 
the  ringing  and  the  resulting  photoresist  image  is  shown  in  Figure  13  (b)  Note,  the  linewidths  are  straighter  and  some 
iso/dense  bias  is  apparent. 


Figure  13  Photoresist  images  with  and  without  apodization  using  vector  imaging  model  at  high  NA  (a)  Photoresist  image  with  binary 
filters  of  Figure  1.  (b)  Photoresist  images  with  raised  cosine  filters. 


5  CONCLUSION  &  FUTURE  WORK 

Algorithms  have  been  developed  to  optimize  the  frequency  coverage  and  image  quality  in  IIL.  They  optimize  the 
coverage  such  that  the  most  significant  plane  wave  making  up  the  pattern  are  passed  through  the  system.  At  high  NA, 
the  solution  converges  to  the  typical  solution,  however  they  provide  insights  at  low  NA’s  and  provide  alternatives  and 
options  to  overlap  coverages  with  higher  number  of  exposure  for  more  accurate  aerial  images. 

Comparisons  between  scalar  and  vector  model  simulations  are  made.  The  behavior  of  the  aerial  image  in  scalar  and 
vector  tend  to  hold  with  the  Manhattan  pattern  and  polarization  control.  The  quadratic  term  helps  to  increase  contrast  in 
some  cases;  however  it  often  degrades  the  image  quality  beyond  acceptability.  Controlling  the  dosage  ratios  between  the 
various  exposures  helps  to  increase  exposure  latitude  and  contrast.  These  parameters  of  IIL  provide  aerial  image 
robustness  necessary  for  second  generation  RET. 

Lastly,  while  a  binary  pupil  filter  approach  has  been  shown  to  control  the  quadratic  term  in  the  four  exposure  to  an 
extent,  additional  on-axis  (low  frequency)  coverage  with  a  binary  pupil  filter  reduces  the  iso-dense  bias  and  impacts  the 
pattern  printability  significantly.  At  high  NA,  the  five  exposure  IIL  allows  apodization  to  mitigate  defect  printing 
between  exposures.  Apodization  can  help  reduce  the  ringing  without  reducing  the  low  Ki  imaging  offered  by  IIL. 
Vector  simulations  at  high  NA  and  full  photoresist  modeling  with  apodization  are  shown  to  behave  similarly  to  the 
scalar  and  threshold  resist  models.  Implementation  issues  need  to  be  investigated  for  apodization  and  further 
experimental  work  is  necessary  to  fully  understand  the  impact  of  apodization  at  high  NA  for  improved  image  fidelity, 
leading  to  high  resolution  imaging  without  the  need  for  phase-shift  masks  and  extensive  OPC. 
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ABSTRACT 

Liquid  immersion  lithography  (LIL)  can  extend  the  resolution  of  optical  lithography  well  beyond  today’s  capabilities. 
The  half-pitch  limit  is  given  by  the  well-known  formula  A  =  A/(ANA),  where  A  is  the  optical  wavelength  and 

NA  -  n  sin(6>)  is  the  numerical  aperture  of  the  exposure  device  with  n  the  refractive  index  of  the  exposure  medium. 
Through  the  use  of  exposure  media  such  as  purified  water  (refractive  index  of  1.44  at  193  nm),  it  is  possible  to  reduce 
minimum  pitches  by  a  factor  of  as  much  as  44%  -  a  full  technology  node.  Beyond  this  simple  observation,  there  is  a 
good  deal  of  work  necessary  to  fully  understand  the  impact  of  LIL  on  a  lithography  processes.  This  paper  will  address 
issues  concerning  resist  chemistry  and  the  impact  of  water  immersion  on  the  imaging  capabilities  of  different  resist  for¬ 
mulations.  All  resists  were  evaluated  by  imaging  dense  line-space  structures  at  a  65 -nm  half-pitch  both  in  air  and  with 
water  immersion.  Studies  of  the  dense  65 -nm  lines  made  by  immersion  imaging  in  HPLC  grade  water  with  controlled 
variations  in  resist  components  were  performed.  Significant  differences  were  observed  and  will  be  discussed. 

Keywords:  193-nm  resist,  liquid  immersion,  interferometric  lithography,  65 -nm  half-pitch. 


1.  INTRODUCTION 

As  the  microelectronic  industry  pushes  towards  smaller  feature  sizes,  greater  demands  are  placed  on  lithography.  Mod¬ 
ern  lithography  now  commonly  produces  features  smaller  than  the  optical  wavelength.  Through  the  use  of  optical  en¬ 
hancement  techniques  such  as  off-axis  illumination  (OAI),  optical-proximity  correction  (OPC),  phase-shift  masks 
(PSM),  and  imaging  interferometric  lithography  (IIL),  industry  can  extend  the  current  limits  of  state-of-the-art  lithogra¬ 
phy  tools.  These  techniques  are  quickly  being  pushed  to  practical  limits  while  next  generation  lithography  technologies 
are  still  under  development.  Industry  needs  to  find  new  ways  to  extend  their  lithography  capabilities  to  enable  continued 
scaling  of  ICs.  The  ultimate  minimum  feature  optical  lithography  can  produce  is  a  line  and  space  whose  pitch  is  deter¬ 
mined  by  the  well-known  formula,  A  =  A/2NA ,  where  A  is  the  wavelength  of  the  light  and  NA  =  nsin(0)  is  the  product 
of  the  immersion  medium  refractive  index  and  the  sine  of  the  half  angle  of  the  light  at  the  exit  of  the  lens. 

Historically,  the  exposure  wavelength  A  has  been  steadily  decreased  and  the  numerical  aperture  NA  has  been  steadily 
increased  to  achieve  higher  resolution.  Current  leading  edge  exposure  tools  operate  at  a  wavelength  of  193  nm  with  NA 
~  0.7  with  projections  to  NA  ~  0.85  to  0.9  in  the  near  future.  Currently  there  is  no  projected  optical  node  beyond  157 
nm  where  significant  material  issues  remain  outstanding.  Recently,  interest  has  grown  dramatically  in  exploring  the  use 
of  an  immersion  medium  between  the  exit  face  of  the  lens  and  the  photoresist  surface  to  extend  optical  lithography.1 
Currently  air  is  the  only  exposure  medium  being  used  for  commercial  exposure  tools  with  n  =  1 .  The  use  of  immersion 
media  to  extend  optical  resolution  is  well  known  from  oil- immersion  optical  microscopy.  Through  the  use  of  immersion 
media,  such  as  de-ionized  water  with  n  ~  1.44  at  193  nm6,  much  smaller  minimum  feature  sizes  can  be  achieved  than 
are  possible  in  air.  This  can  be  thought  of  either  as  an  effective  wavelength  Aeff  -  A^/n  ~ 134  nm  or  (not  and)  alterna¬ 
tively  as  an  increase  to  an  effective  NAejj-  =  nsinO  <  1.44  .  Thus,  the  use  of  water  immersion  lithography  could  poten- 


Proc.  SPIE  Vol.  5377,  p.  306-318,  Optical  Microlithography  XVII;  Bruce  W.  Smith;  Ed,  May,  2004 


tially  extend  193-nm  wavelength  optical  lithography  by  44%,  twice  the  resolution  enhancement  offered  by  the  problem¬ 
atic  transition  to  157  nm.  Liquid  immersion  lithography  also  provides  an  improvement  in  depth  of  field.  At  a  fixed 

resolution,  DOF  =  k2A0/n  sin2  (#)  .7  Application  of  immersion  techniques  at  157  nm  will  have  a  similar  impact;  how¬ 
ever  a  suitable  liquid  has  yet  to  be  determined. 

There  are  some  issues  that  need  to  be  understood  before  immersion  lithography  can  be  used  in  manufacturing.  One 
issue  is  the  interaction  between  the  immersion  fluid  and  the  last  lens  element.  For  instance  de-ionized  water  (DI  H20) 
will  etch  Calcium- fluoride  (CaF)  lens  elements,  so  a  protective  coating  will  need  to  be  put  on  the  CaF  final  lens  element 
that  will  not  be  influenced  be  the  presence  of  the  DI  water.  Another  issue,  which  is  the  focus  of  this  paper,  is  the  inter¬ 
action  between  the  immersion  fluid  and  the  resist-coated  wafer.  With  liquid  immersion  lithography  there  are  two  new 
interactions  of  concern,  first  any  uptake  of  the  immersion  liquid  into  the  resist  and  the  impact  on  resolution,  and  second 
the  adsorption  of  resist  components  into  the  immersion  liquid  and  contamination  of  the  lens  surface.  Both  of  these 
processes  can  change  the  photoresist  chemistry  and  the  optical  properties  of  the  immersion  liquid. 

This  paper  presents  exposure  results  for  immersion  lithography  at  a  wavelength  of  213  nm,  close  to  the  current  193-nm 
exposure  wavelength.  A  maskless,  interferometric  lithography  approach  that  gives  essentially  ideal  contrast  fringes  was 

8  4-  t- 

used.  Exposures  were  carried  out  on  a  number  of  resists  with  controlled  variations  in  resist  components.  The  images 
of  dense  65-nm  lines  obtained  from  the  resist  exposed  in  air  and  in  HPLC  grade  water  were  studied.  Significant  differ¬ 
ences  were  observed  and  will  be  discussed.  The  use  of  a  protective  coating  for  the  resist  was  also  studied  and  will  be 
discussed. 


2.  EXPERIMENTAL  SETUP 


The  light  source  was  a  Coherent  Infinity  40-100  Nd:YAG  pulsed  (~  3  ns)  laser  followed  by  nonlinear  crystals  to  gener¬ 
ate  a  fifth-harmonic  beam  at  a  wavelength  of  213  nm.  This  source  has  a  much  higher  coherence  length  (~  0.6  m)  than 
that  of  a  typical  193-nm  excimer  laser,  which  greatly  simplifies  the  experimental  arrangement  for  interferometric  lithog¬ 
raphy.  193-nm  resists  generally  have  comparable  absoption  at  193-  and  213-nm  and  work  well  at  this  wavelength.  A 
simple  telescope  with  an  aperture  expands  the  /4-inch  beam  to  about  1.5-inches  at  the  entrance  of  the  immersion  inter¬ 
ferometer  setup. 
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Figure  1  -  Michelson  interferometer  immersion  exposure  setup. 


A  Michelson  interferometer  with  a  coupling  prism  was  designed  and  constructed  for  performing  the  immersion  expo¬ 
sures.  Figure  1  shows  a  diagram  of  the  exposure  setup.  The  simplest  way  to  couple  the  light  into  an  immersion  liquid 
was  to  use  an  isosceles  prism.  The  top  faces  of  the  prism  are  antireflection  coated.  A  drop  of  water  is  used  between  the 
bottom  of  the  prism  and  the  resist  coated  wafer.  Both  laboratory  deionized  water  and  HPLC  water  were  used  with  com¬ 
parable  results.  Figure  2  illustrates  the  two  beam  interference  at  the  liquid-resist  interface.  Here,  ri\  and  n2  denote  the 
indices  of  the  prism  and  immersion  liquid,  and  0i  and  02  are  the  corresponding  propagation  angles  with  respect  to  the 
wafer  normal,  as  drawn  nx  is  larger  than  n2.  The  liquid  layer  is  about  1  mm  thick.  The  period  of  the  grating  exposed  in 

the  resist  is  given  by  A  =  2  —  2-nrtin(02)  '  configurati°n>  the  grating  period  is  independent  of  the  refrac- 


tive  index  of  the  plane-parallel  liquid  layer:  the  change  in  wavelength  according  to  is  exactly  canceled  by  the 

change  in  0  due  to  refraction  at  the  glass-liquid  interface.  In  practice  it  is  desirable  for  the  glass  and  liquid  indices  to  be 
approximately  equal,  in  order  to  minimize  reflections  at  the  interface. 


Figure  2  -  Prism  interface  with  immersion  liquid. 


An  advantage  of  this  arrangement  is  that  the  beam  is  folded  directly  onto  itself,  allowing  for  interference  with  beams 
with  only  small  spatial  and  temporal  coherence  lengths.  Disadvantages  are  the  cumbersome  alignment  required  to 
change  the  pitch,  and  the  lack  of  a  direct  measurement  of  the  liquid  refractive  index.  Fig.  3  shows  a  picture  of  the 
Michelson  interferometer  immersion  exposure  setup  used  for  the  experiments. 


Figure  3  -  Michelson  interferometer  immersion  exposure  setup. 


Half-wave  plates  are  mounted  above  the  prism  and  used  to  rotate  the  polarization  of  the  incident  light.  This  allows  for 
full  control  of  the  polarization  state  of  each  interfering  beam.  TE  polarized  light  always  gives  the  best  contrast  in  the 
interference  pattern  and  was  used  for  all  of  the  results  reported  here.  By  incorporating  the  half-wave  plates  experiments 
can  be  executed  that  mimic  the  imaging  performance  of  a  actual  lithography  system,  in  terms  of  illumination  polariza¬ 
tion  state. 


3.  EXPERIMENTAL  RESULTS 


Two  resist  manufactures  provided  resists  in  which  the  material  was  fully  disclosed  as  to  the  chemical  components  and 
makeup  of  the  resist.  The  first  company  was  TOK  that  provided  six  samples  of  resist.  These  resist  were  formulated 
using  two  different  polymers  and  three  different  bases.  The  second  company  was  AZ  Clariant  that  provided  eight  sam¬ 
ples  of  resist.  Their  resist  were  formulated  using  two  different  polymers,  two  different  PAGs,  and  two  different  bases 

3.1.  TOK  resist  liquid  immersion  imaging  results 

The  TOK  resist  used  Acrylic  polymer  backbones.  Three  of  the  resists  ILP01,  ILP02,  and  ILP03  were  based  on  an 
acrylic  ter-polymer,  and  the  other  three,  ILP04,  ILP05,  and  ILP06,  on  an  acrylic  tetra-polymer.  All  six  resists  used  the 
same  photoacid  generator  (PAG),  Tri-Phenylsufonium  nanofluorobutanesulfonate  (TPSNf).  Three  different  base 
quenchers  were  used.  Resists  ILP01  and  ILP04  used  a  small  molecular  sized  base  triethanolamine.  Resists  ILP02  and 
ILP05  used  a  medium  sized  base  tripentylamine.  Resists  ILP03  and  ILP06  used  a  large  sized  base  tri-n-dodecylamine. 
Table  1  summarizes  the  matrix  of  TOK-ILP  resist  formulations.  All  resist  were  spun  at  a  thickness  of  100  nm  unless 
noted  otherwise,  atop  a  32-nm  thick  Brewer  Science  ARC28-4  anti  reflective  coating  that  was  optimized  to  give  a  mini¬ 
mum  substrate  reflection  for  65 -nm  half-pitch  dense  patterns. 


Base  Quencher 

Triethanolamine  (Bs) 

Tripentylamine  (Bm) 

Tri-n-dodecylamine  (BL) 

Polymer 

Backbone 

Acrylic  ter-polymer 

TOK-ILPOl 

TOK-ILP02 

TOK-ILP03 

Acrylic  tetra-polymer 

TOK-ILP04 

TOK-ILP05 

TOK-ILP06 

Table  1  -  TOK  resist  formulation  matrix 


3.1.1.  Image  comparison  of  resist  formulations 

Figure  4  shows  the  three  Acrylic  ter-polymer  resists  (ILP01,  ILP02,  and  ILP03)  exposed  in  air  and  in  HPLC  H20.  All 
three  samples  gave  reasonable  exposures  in  both  air  and  water.  The  samples  exposed  in  air  had  noticible  top  rounding. 
The  samples  exposed  in  HPLC  H20  a  smaller  degree  top  rounding.  All  the  samples  exposed  in  HPLC  H20  had  better 
line  profiles  than  the  corresponding  air-exposed  samples,  with  the  sample  containing  the  tri-n-dodecylamine  base 
(ILP03)  giving  the  best  results. 
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Figure  4  -  131-nm  pitch  TOK  acrylic  ter-polymer  resist  LIL  results. 

Figure  5  shows  the  three  Acrylic  tetra-polymer  resists  (ILP04,  ILP05,  and  ILP06)  exposed  in  air  and  in  HPLC  H20.  All 
three  samples  gave  reasonable  results  when  exposed  in  air.  Two  of  the  samples  gave  reasonable  exposure  in  the  HPLC 
water.  The  sample  with  the  small  Triethanolamine  base  (ILP04)  had  micro  bridging  and  pattern  collapse  in  the  HPLC 
H20  exposure.  The  samples  exposed  in  air  had  top  rounding  similar  to  the  acrylic  ter-polymer  resists.  The  samples  ex¬ 
posed  in  HPLC  H20  showed  a  reduced  top  rounding,  and  the  smaller  the  base  the  less  rounded  the  line  profile.  Two  of 
the  samples  exposed  in  HPLC  H20  had  better  line  profiles  than  the  corresponding  air-exposed  samples,  with  the  sample 
containing  the  tri-n-dodecylamine  base  (ILP03)  giving  the  best  results. 
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Figure  5  -  131-nm  pitch  TOK  acrylic  tetra-polymer  resist  LIL  results. 


The  samples  made  from  the  acrylic  ter-polymer  gave  slightly  better  results  than  the  samples  made  form  the  acrylic  tetra- 
polymer  when  exposed  in  HPLC  H20.  The  triethonalamine  base  did  not  perform  well  as  the  other  bases  in  the  HPLC 
H20  exposure,  especially  with  the  acrylic  tetra-polymer  backbone.  The  best  TOK  resist  for  immersion  lithography  was 
TOK-ILP03  closely  followed  by  TOK-ILP06  both  containing  the  tri-n-dodecylmine  base. 

3.1.2.  Impact  of  resist  soak  times 

Using  one  of  the  more  successful  TOK  resists  (TOK  ILP06),  the  impact  on  the  time  the  sample  soaked  in  HPLC  water 
was  investigated.  First  a  sample  was  exposed  as  soon  as  it  was  immersed  in  the  water  and  dried  off  right  after  the  expo¬ 
sure  followed  by  a  post-exposure  bake,  for  a  maximum  water  contact  time  of  less  than  30  seconds.  A  second  sample 
was  soaked  in  the  water  for  10  minutes  before  it  was  exposed  and  immediately  dried  off  and  PEBed.  A  third  sample 
was  exposed  as  soon  as  it  was  immersed  in  the  water  and  then  soaked  in  the  water  for  10  minutes  at  before  it  was  dried 
off  and  PEBed.  Figure  6  shows  the  exposure  results  for  the  three  samples. 


Exposed  in  HPLC  H20 
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Figure  6  -  Impact  of  soak  times  on  131-nm  pitch  TOK-ILP06  resist. 

The  10-minute  pre-soak  sample  had  slightly  better  exposure  latitude  and  uniformity,  which  cannot  be  seen  from  Fig.  It 
is  clearly  seen  that  regardless  of  soak  time  in  the  water  all  samples  gave  similar  and  good  line  profiles.  It  is  believed 
that  from  this  and  other  experiments  that  whatever  interaction  happens  between  the  resist  and  water  take  place  in  a  very 
short  time  (less  than  1  second) 

3.1.3.  Impact  of  top  surface  protection 

One  proposed  technology  to  isolate  the  resist  from  the  immersion  liquid  is  the  use  of  a  top  surface  protection  barrier 
layer.  The  barrier  layer  needs  to  be  transparent  to  the  exposure  light  and  to  be  inert  to  both  the  immersion  liquid  and  the 
resist.  TOK  has  formulated  a  floro-carbon  spin-on  top  protective  coating  TSP-03A,  which  meets  these  requirements  for 
a  barrier  layer.  Two  additional  steps  are  added  to  the  processing.  First  after  coating  and  post-application  baking  the 
wafer  with  resist,  an  additional  spin-on  and  bake  sequence  is  used  to  coat  the  TSP-03A  material  atop  the  resist.  Second 
after  the  post  exposure  bake,  the  TSP-03A  coating  is  removed  using  a  special  solvent  TSP-3  remover  before  the  resist 
development. 

Figure  7  shows  TOK-ILP01,  TOK-ILP03,  and  TOK-ILP06  resist  exposed  in  air  with  and  without  the  TSP-03A  coating. 
As  expected,  the  samples  with  the  protective  coating  look  practically  the  same  as  the  samples  without  the  coating  when 
exposed  in  air. 
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Figure  8  -  13 1-nm  pitch  TOK  resist  exposed  in  HPLC  H20  with  and  without  a  top  surface  protection  coating. 

Other  experiments  suggest  that  the  photo-acid  generated  upon  exposure  is  leached  out  of  the  top  few  monolayers  of  the 
resist  into  the  water.  This  would  also  support  the  squaring  of  the  line  profiles  for  the  immersion  exposure,  as  the  top  of 
the  resist  becomes  less  sensitive  as  compared  to  the  bulk  of  the  resist. 


Although  a  top  surface  protection  layer  does  prevent  resist  water  interaction,  there  are  resists  that  perform  well  when 
exposed  in  water.  From  an  imaging  point  of  view  there  is  no  real  need  for  the  added  cost  and  complexity  of  using  a  top 
surface  protection  layer.  Even  at  this  early  stage  of  development  it  looks  possible  to  engineer  the  resist  with  compo¬ 
nents  that  work  well  in  a  liquid  immersion  lithography  system. 


3.1.4.  Evaluation  through  half-pitch 

Using  one  of  the  better  TOK  resists  (TOK-ILP03),  the  resist  profiles  were  evaluated  as  the  half-pitch  was  decreased 
until  the  resist  was  not  resolving  the  dense  line  and  space  structure  or  having  poor  imaging  capabilities.  Figure  9  shows 
the  resist  as  the  half-pitch  decreases  from  87  nm  in  air  to  45  nm  in  HPLC  H20.  The  dense  line-space  profiles  of  the 
images  are  square  and  well  defined  down  to  a  50  nm  half-pitch  in  HPLC  H20.  As  the  half-pitch  is  reduced  to  45  nm  the 
dense  lines-spaces  exhibit  severe  micro  bridging  and  pattern  collapse.  50  nm  lines  and  spaces  is  about  the  usable  reso¬ 
lution  limit  of  the  TOK-ILP03  resist  under  the  present  exposure  and  process  conditions. 


64  nm  (H20)  50  nm  (H20)  45  nm  (H20) 


Figure  9  -  TOK-ILP03  resist  exposures  through  half-pitch. 


3.1.5.  Impact  of  a  rinse  surfactant 

One  of  the  problems  with  imaging  65 -nm  half-pitch  lines  is  that  at  aspect  ratio  larger  than  2.5  there  is  frequently  com¬ 
plete  pattern  collapse.  This  is  due  to  surface  tension  of  water  drying  unevenly  between  the  lines  after  the  development 
rinse.  The  simplest  way  to  reduce  this  pattern  collapse  is  by  reducing  the  surface  tension  of  water  via  adding  a  surfac¬ 
tant  to  the  water.  At  a  65 -nm  half-pitch  pattern  collapse  due  to  adhesion  tension  will  not  occur  if  the  resist  thickness  is 
at  or  below  100  nm,  however  at  150  nm  and  larger  there  is  often  pattern  collapse  due  to  adhesion  tension. 

Figure  10  shows  images  of  65 -nm  half-pitch  dense  lines  exposed  in  air  and  DI  H20  with  and  without  a  rinse  surfactant 
for  both  100-nm  and  150-nm  thick  lines  in  TOK-ILP03  resist.  There  was  no  pattern  collapse  in  any  of  the  images. 
Based  on  other  experience  at  these  with  these  resists,  collapse  was  expected  that  the  150-nm  thick  lines  exposed  in  air 
and  DI  H20  without  the  rinse  surfactant  would  show  pattern  collapse  due  to  water  adhesion  tension  during  wafer  drying. 
A  much  more  controlled  investigation  of  pattern  collapse  limits  at  these  scales  is  needed  to  gain  a  fuller  understand  of 
the  limitations  and  control  strategies. 
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Figure  10  -  129-nm  pitch  TOK-ILP03  resist  exposed  with  and  without  a  rinse  surfactant. 


3.2.  AZ  Clariant  resist  liquid  immersion  imaging  results 

The  AZ  Clariant  provided  eight  resist  formulations.  Four  of  the  resists  used  a  COMA  hybrid  polymer  T2030  backbone 
and  the  remaining  four  resists  used  an  acrylic  polymer  T518  backbone.  Figure  11  shows  the  chemical  composition  of 
the  two  polymers  used  in  the  AZ  Clariant  resists.  Half  of  the  resists  used  a  large  molecule  TPSNF  PAG  similar  to  the 
TOK  resist,  and  the  other  half  used  a  small  TPSTf  PAG.  AZ  Clariant  also  used  two  base  quenchers,  a  large  molecule 
trioctylamine  base  and  a  small  diethanolamine  base.  Table  2  summarizes  the  matrix  of  AZ  Clariant  resist  formulations. 
All  resists  were  spun  on  a  32-nm  thick  Brewer  Science  ARC28-4  anti  reflective  coating  that  was  optimized  to  give  a 
minimum  substrate  reflection  for  65-nm  half-pitch  dense  patterns. 
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Figure  11  -  AZ  Claraint  polymer  backbone  chemical  composition. 
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Table  2  -  AZ  Clariant  resist  formulation  matrix 


3.2.1.  Image  comparison  of  resist  formulations 

Figure  12  shows  65-nm  half-pitch  dense  lines  in  130-nm  thick  hybrid  polymer  T2030  resist.  There  were  issues  of  pat¬ 
tern  collapse  at  this  resist  thickness,  and  no  rinse  surfactant  was  used.  In  air  there  was  noticeable  T-toping  of  the  line 


profiles,  with  the  large-PAG,  small-base  resist  formulation  having  imaging  issues.  If  the  pattern  collapse  is  neglected 
the  smaller  PAG  gave  better  profiles  for  the  air  exposures.  In  the  HPLC  H20  exposure,  the  small-PAG  resist  formed  a 
skin  and  once  the  developer  penetrated  this  surface  layer,  the  remaining  resist  lines  were  completely  developed  away. 
This  suggests  that  the  small  PAG  is  completely  leaching  away  from  the  surface  of  the  resist  into  the  water.  Again  ig¬ 
noring  the  pattern  collapse  issues,  the  resist  with  the  large  PAG  and  large  base  gave  the  best  results  in  HPLC  H20  for 
the  hybrid  T2030  polymer  resist. 
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Figure  12  -  131-nm  pitch  AZ  Clariant  hybrid  T2030  polymer  resist  LIL  results. 

Figure  13  shows  65 -nm  half-pitch  dense  lines  in  120-nm  thick  Acrylic  polymer  T518  resist.  Like  the  T2030  polymer 
resist  there  were  issues  of  pattern  collapse  as  the  resist  thickness  was  thicker  than  the  desired  100  nm.  The  AZ  Clariant 
resists  all  imaged  well  in  air  similar  to  the  TOK  Acrylic  resist,  ignoring  the  pattern  collapse.  The  resist  with  the  smaller 
PAG  had  reduced  T-toping  and  gave  better  results  in  air.  In  the  HPLC  H20  exposures,  ignoring  pattern  collapse,  the 
large  PAG  had  slightly  more  T-toping  then  the  small  PAG,  and  no  real  noticeable  difference  between  the  small  and 
large  base  resist  images. 
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Figure  13  -  131-nm  pitch  AZ  Clariant  Acrylic  T518  polymer  resist  LIL  results. 


From  the  AZ  Clariant  resist  results  it  is  clear  that  the  acrylic  resists  work  better  for  liquid  immersion  lithography.  The 
size  of  the  PAG  made  noticeable  differences  for  both  the  hybrid  and  acrylic  polymer  resist.  The  best  AZ  Clariant  resist 


was  the  Acrylic  polymer  T5 18  with  the  small  PAG  and  large  Base.  The  Acrylic  resist  should  be  reevaluated  at  a  thinner 
formulation  that  allows  for  100  nm  resist  film  thickness. 

3.2.2.  Evaluation  through  half-pitch 

Using  an  improved  Acrylic  AZ  Clariant  resist  (EXP  M3 000)  spun  at  100-nm  thick,  the  resist  profiles  were  evaluated  as 
the  half-pitch  was  decreased  until  the  resist  was  not  resolving  the  dense  line  and  space  structure  or  having  poor  imaging 
capabilities.  Figure  14  shows  the  resist  as  the  half-pitch  decreases  from  87  nm  in  air  to  45  nm  in  HPLC  H20. 

87  nm  (Air)  7 1  nm  (H20)  64  nm  (H20) 
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Figure  14  -  AZ  Clariant  EXP  M3 000  resist  exposures  through  half-pitch. 


3.2.3.  Evaluation  of  contact  holes 

Using  the  same  Acrylic  AZ  Clariant  resist  (EXP  M3000)  spun  at  100-nm  thick,  dense  contact  holes  were  exposed  in  the 
resist  and  the  profiles  were  evaluated.  Contact  holes  are  achieved  by  performing  two  back-to-back  exposures  where  the 
sample  was  rotated  by  90  degrees  between  the  two  exposures.  Figure  15  shows  the  65  nm  dense  contact  holes  array  in 
the  AZ  Clariant  resist. 

Exposed  in  Air 
Exposed  in  DI  H20 


Figure  15  -  129-nm  pitch  contact  holes  in  AZ  Clariant  EXP  M3 000  resist. 


4.  SUMMARY 


4.1.  Conclusions 

Immersion  lithography  reduces  the  minimum  accessible  pitch  by  a  factor  of  the  index  of  refraction  n  of  the  immersion 
medium.  The  work  in  this  paper  has  demonstrated  that  45 -nm  half-pitch  lines  can  be  patterned  into  resist  using  liquid 
immersion  lithography  (LIL)  with  a  NA=1.18.  This  would  allow  for  193-nm  lithography  to  be  extended  out  at  least  2 
technology  nodes. 

Some  but  not  all  existing  193-nm  resists  work  well  in  LIL.  Based  on  these  preliminary  results,  it  seems  likely  that  193- 
nm  LIL  resists  can  be  formulated  with  fully  comparable  performance  to  today’s  193-nm  air-exposed  resists.  The  acrylic 
polymer  resist  platform  performs  well  in  LIL.  It  is  also  clear  that  PAGs  and  bases  play  an  important  role,  although 
more  studies  are  needed  to  find  an  optimal  set  of  PAGs  and  bases  for  193-nm  LIL  resist. 

Resist  top  surface  protection  coatings  exist  that  work  well  with  193-nm  resist  and  water  immersion  liquid.  From  a  im¬ 
aging  stand  point  a  top  surface  protection  coating  is  most  likely  not  necessary,  and  just  adds  cost  and  complexity  to  the 
processing. 

As  LIL  enables  the  printing  of  ever-smaller  lines  and  spaces,  the  issue  of  pattern  collapse  due  to  adhesion  tension  during 
post  develop  wafer  drying  will  become  increasingly  important.  Rinse  surfactants  should  reduced  pattern  collapse  is¬ 
sues,  however  resist  thicknesses  likely  will  continue  to  decrease,  posing  issues  for  subsequent  pattern  transfer  steps. 
The  problem  of  pattern  collapse  needs  further  study;  a  new  resist  with  larger  Young’s  modulus  will  help  combat  these 
issues. 

Based  on  all  the  work  presented  here  there  are  no  showstoppers  to  liquid  immersion  lithography.  There  are  known  solu¬ 
tion  to  deal  with  the  interactions  between  resist  and  the  water  immersion  liquid.  The  resist  can  be  engineered  and  de¬ 
veloped  for  45-nm  half-pitch  dense  lines  in  LIL  and  beyond. 


4.2.  Future  studies 


Work  will  need  to  be  continued  to  evaluate  new  193-nm  resists  formulations  for  liquid  immersion  lithography.  These 
resist  will  also  need  to  be  evaluated  at  and  below  the  45 -nm  half-pitch  node  to  understand  the  true  limits  to  the  resist. 
The  resist  will  also  need  to  be  evaluated  for  contact  holes,  as  resist  performance  tends  to  differ  between  contact  holes, 
dense  lines  and  isolated  lines. 

Detailed  studies  will  need  to  be  performed  on  pattern  collapse.  The  limit  of  resist  aspect  ratios  for  a  given  resist  system 
needs  to  be  studied  at  the  65-nm  and  45 -nm  nodes.  Evaluation  of  rinse  surfactants  and  possible  developer  surfactant 
needs  to  be  performed  at  the  LIL  technology  nodes. 

Polarization  effects  are  going  to  become  very  important  on  hyper  NA  systems.  These  effects  need  to  be  quantified  on 
an  LIL  system  and  their  corresponding  impacts  on  resist  imaging  evaluated. 

Currently  purified  water  with  a  refractive  index  of  1 .44  has  been  identified  as  the  preferred  liquid  immersion  medium. 
The  resolution  limits  of  193-nm  lithography  can  be  further  improved  if  a  viable  new  liquid  is  found  with  an  even  higher 
refractive  index.  Investigation  for  new  liquid  immersion  media  needs  to  be  continued. 

A  hyper  NA  imaging  system  needs  to  be  developed  and  demonstrated  in  LIL.  Evaluation  of  the  improved  depth  of  fo¬ 
cus  (DOF)  and  process  latitude  needs  to  be  performed.  New  exposure  strategies  will  be  needed  for  high  NA  imaging. 
Polarization  sensitivity  will  impact  existing  reticle  enhancement  techniques,  and  may  require  new  strategies  that  involve 
separation  high  spatial  frequencies  in  the  orthogonal  x-  and y-  directions.  Arbitrary  65-nm  and  45 -nm  patterns  must  be 
evaluated  in  the  new  193-nm  immersion  resist. 
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ABSTRACT 

The  minimum  half  pitch  available  to  a  lithography  tool  is  given  by  the  well-known  equation  Amin_haif.  =  X/Ansm 6, 
where  A  is  the  exposure  wavelength,  n  is  the  refractive  index  of  the  immersion  medium  and  26  is  the  angle  between 
the  propagation  directions  of  the  two  beams.  Using  water  ( n  =  1.44)  as  an  immersion  medium,  the  highest  spatial 
frequency  available  with  ArF-based  (193  nm)  lithography  tools  with  an  NA  of  1.3  (1.44x0.9)  corresponds  to  a  half¬ 
pitch  of  37  nm  suggesting  that  the  45-nm  half-pitch  node  should  be  accessible.  A  detailed  simulation  study  (PRO- 
LITH™  8)  is  reported  for  two  different  approaches  to  printing  for  this  node.  Dipole  illumination  (with  two  masks) 
as  well  as  imaging  interferometric  lithography  (with  a  single  mask  and  multiple  exposures  incorporating  pupil  plane 
filters)  is  shown  to  be  capable  of  printing  arbitrary  structures  under  these  conditions.  Because  of  the  loss  of  contrast 
for  TM-polarization  at  the  high  spatial  frequencies  at  this  node  the  high  spatial  frequencies  in  the  x-  and  y-directions 
need  to  be  printed  with  different  polarizations  in  order  to  retain  the  necessary  contrast.  This,  in  turn,  will  require 
modification  of  the  illumination  system  or  a  multiple  exposure  approach  to  allow  the  necessary  polarization  control. 

1.  INTRODUCTION 


The  current  industry  roadmap  calls  for  initial  volume  manufacturing  of  the  45-nm  half-pitch  node  in  the  2005-2007 
time  frame,  which  in  turn  demands  an  immediate  decision  on  the  manufacturing  technology.  It  now  appears  that 
next  generation  lithography  techniques  will  be  hard  pressed  to  meet  this  time-line  and  attention  has  turned  to  exten¬ 
sions  of  optical  lithography.  Recently,  water  immersion  lithography1-6  at  193  nm  has  emerged  as  the  industry  con¬ 
sensus  for  the  next  major  optical  lithography  extension.  The  minimum  half  pitch  available  to  a  lithography  tool  is 
given  by  the  well-known  equation, 

Amin  half.  2/  (1) 

where  X  is  the  exposure  wavelength,  n  is  the  refractive  index  of  the  immersion  medium  and  the  two  beams  interfer¬ 
ing  to  create  a  sinusoidal  pattern  at  the  resist  are  propagating  in  the  water  at  angles  of  ±0  with  respect  to  the  wafer 
surface  normal.  For  193-nm  exposures  with  an  immersion  medium  of  water  ( n  =  1.44)7  and  a  maximum  sin#  of  0.9, 
this  minimum  half-pitch  is  37  nm,  suggesting  that  a  system  with  these  capabilities  should  be  capable  of  printing  ar¬ 
bitrary  45-nm  half-pitch  patterns. 

At  the  high  angles  in  the  resist  that  these  dimensions  and  wavelength  imply,  there  are  significant  differences  in  the 
resist  response  to  TE  and  TM  polarizations;  in  particular  the  TM  polarization  aerial  image  loses  contrast  and  even 
inverts  polarity  (exchanging  lines  and  spaces).  To  circumvent  these  adverse  effects,  the  high  spatial  frequencies  in 
each  direction  need  to  be  printed  with  optimized  (orthogonal)  polarizations.  This,  in  turn,  will  require  modification 
of  the  illumination  system  or  some  form  of  multiple  exposure  lithography  to  allow  the  necessary  polarization  con¬ 
trol.  Two  imaging  schemes8  that  lend  themselves  to  this  constraint  are  imaging  interferometric  lithography9,10  (IIL), 
which  uses  off-axis  illumination  at  the  limits  of  the  pupil  along  with  pupil-plane  filters  to  control  the  image  fidelity, 
and  dipole  illumination,  where  different  masks  and  off-axis  illumination  directions  are  used  for  the  high  spatial  fre¬ 
quency  content  in  the  x-  and  y-directions  of  a  Manhattan  geometry  pattern.  Both  of  these  techniques  offer  independ¬ 
ent  control  of  the  polarization  for  the  x-  and  y-direction  exposures.  In  section  2,  the  polarization  effects  in  immer¬ 
sion  lithography  are  discussed.  Section  3  presents  modeling  results  for  partial  coherent  illumination,  IIL,  and  dipole 
illumination  for  a  simple  nested-ell  test  structure.  Finally  conclusions  and  future  research  directions  are  outlined  in 
Section  4. 
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2.  POLARIZATION  EFFECTS  IN  IMMERSION  LITHOGRAPHY 


The  polarization  effects  can  readily  be  understood  in  terms  of  a  simple  interferometric  lithography  (IL)  exposure 
with  beams  incident  on  the  wafer  from  the  extremes  of  the  lens  pupil.  It  is  straightforward  to  calculate  the  interfer¬ 
ence  pattern  for  two-beam  IL  with  equal  intensity  beams  with  TE-  and  TM-polarizations. 


ITe(x)~  21 o 
1tm(x)=  21 o 
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where  70  is  the  intensity  of  each  beam  and  the  subscripts  emphasize  that  the  refractive  index  and  internal  angles 
within  the  resist  are  used  in  evaluating  these  expression.  The  TE  result  retains  full  contrast  over  the  entire  accessible 
half-pitch  range  independent  of  6resist.  The  additional  factor  of  cos (20resist)  that  appears  in  the  TM  expression  has  a 
dramatic  impact  on  the  aerial  image.  For  6resist  =  45°,  the  contrast  of  the  interference  pattern  goes  to  zero;  at  higher 
angles  (smaller  half-pitches)  there  is  a  phase  shift  so  that  lines  and  spaces  are  interchanged.  These  polarization  ef¬ 
fects  have  not  yet  become  a  significant  issue  with  current  lithography  tools.  Unpolarized  illumination,  which  aver¬ 
ages  the  effects  of  the  two  possible  polarizations,  is  adequate  for  most  imaging.  This  is  because  the  immersion  fluid 
today  is  air  (n  =  1)  while  the  index  of  a  typical  resist  is  considerably  higher  (nresist  ~  1.7).  As  a  consequence  of 
Snell’s  law,  the  propagation  angles  in  the  resist  are  considerably  smaller  than  in  the  air.  For  a  period  corresponding 
to  the  90-nm  half-pitch  node,  sin^>  ~  0.54,  the  propagation  angle  in  air  is  32°  so  Q resist  is  only  18°  and  the  corre¬ 
sponding  contrast  for  TM  polarization  is  84%.  However,  for  the  45-nm  half-pitch  node,  the  propagation  angle  of  48° 
in  the  immersion  fluid  corresponds  to  Qresist  -39°  where  the  contrast  for  TM  polarization  is  only  21%! 

This  is  illustrated  in  Fig.  1(a),  which  shows  the  intensity  distribution  versus  position  in  the  resist  for  periods 
corresponding  to  the  90-,  65-  and  45-nm  half-pitch  nodes.  The  top  trace  is  for  TE  polarization  that  retains  ideal  con¬ 
trast  independent  of  the  pitch.  The  TM  curves  for  the  90-,  65-  and  45-nm  half-pitch  nodes  are  also  shown.  Figure 
1(b)  shows  the  TM  visibility,  (7max  —  /min)/ (/max  +  /min),  as  a  function  of  the  half  pitch.  For  the  90-nm  node,  the  TM 
visibility  is  84%,  dropping  to  62%  for  the  65-nm  node  and  21%  for  the  45-nm  node.  The  line-edge  roughness  is 
strongly  dependent  on  the  visibility11  so  that  this  loss  of  contrast  impacts  the  quality  of  the  image  in  several  ways. 
The  immersion  fluid  also  impacts  the  reflection  at  the  liquid-photoresist  interface,  generally  offering  partial  anti- 
reflection  coating  and  improving  the  energy  transfer  across  the  interface. 
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Figure  1:  (a)  Intensity  profiles  in  resist  for  65-nm  half  pitch  TE  polarization  (top)  with  full  contrast,  and  for  90-,  65- 
and  45-nm  half-pitch  patterns  with  TM  polarization.  The  decrease  of  contrast  is  severe,  (b)  Visibility  of  the  TM  po¬ 
larization  pattern  in  the  resist  as  a  function  of  half-pitch.  The  shaded  region  is  only  accessible  with  immersion. 


3.  IMMERSION  LITHOGRAPHY  SIMULATIONS  FOR  ARBITRARY  PATTERN 


Figure  2  shows  the  mask  pattern  that  has  been  used  in  all  of  the  simulations  reported  here.  It  consists  of  a  series  of 
five  nested  “ells”  at  a  45-nm  half-pitch  CD  and  a  large  box  at  10><CD.  For  the  nested  features,  the  middle  feature 
extends  beyond  the  others  (a  different  amount  in  the  x-  and  y-directions)  to  give  some  indication  of  dense-isolated 
biases.  For  imaging  interferometric  lithography,  this  mask  is  used  without  any  OPC  or  other  RETs.  For  dipole  illu¬ 
mination,  the  features  are  split  into  two  separate  masks  as  described  below.  Simulations  were  carried  out  with  PRO- 
LITH™  Version  8  (KLA-Tencor)  in  full  vector  electro-magnetic  mode  and  with  the  full  physical  model  for  the 
photoresist  (PEB  diffusion,  3D  etching).  In  all  cases,  the  simulations  used  the  same  film  stack.  Substrates  were  bare 
silicon  with  a  48-nm  thick  bottom  anti-reflection  coating  (BARC)  ( n  =  1.7059  +  z'0.335)  and  a  50-nm  thick,  positive 
photoresist  (n  =  1.703  +  /0.0172;  Dill  parameters  A  =  0,  B  =  1.122  jam"1,  C  =  0.068  cm'2  /mJ).  Standard  post¬ 
exposure  bake  (1 10°C  for  60  s)  and  develop  conditions  (60  s)  were  used  for  all  simulations.  As  the  previous  section 
indicated,  separating  the  exposures  of  the  high  spatial  frequencies  in  the  x-  and  y-directions  and  using  orthogonal 
polarizations  for  the  two  cases,  so  that  all  exposures  are  dominantly  TE  polarization,  will  be  key  to  imaging  at  the 
limits  of  immersion  lithography.  The  simulations  presented  below  bear  this  out.  In  the  following  sections,  simulation 
results  for  two  approaches  to  imaging  the  45-nm  half-pitch  node 
at  1 93  nm  using  water  immersion  are  presented. 

The  highest  spatial  frequencies  possible  in  an  optical  system 
correspond  to  two  beams  at  opposite  ends  of  a  pupil  diameter, 
mimicking  2-beam  interferometric  lithography,  e.g.  to  coherent 
illumination  (cr=  0)  with  the  incident  beam  tilted  to  one  edge  of 
the  pupil  and  the  high  frequency  diffracted  beam  from  the  mask 
captured  at  the  opposite  pupil  edge.  Partial  coherence  necessarily 
degrades  this  resolution  as  it  involves  beams  with  lower  tilt  that 
do  not  capture  the  highest  frequencies  and  lower  the  overall  im¬ 
age  transfer  function.  Imaging  interferometric  lithography  (IIL), 
essentially  off-axis  illumination  at  the  edge  of  the  pupil  along 
with  imaging  pupil-plane  filters  to  ensure  uniform  weighting  of 
all  spatial  frequencies  is  discussed  first.  Second  is  dipole  illumi¬ 
nation  at  the  edges  of  the  pupil,  with  two  masks  that  separate  out 
the  high  x-  and  high  y-spatial  frequency  parts  of  the  image.  In 
both  cases  the  polarization  is  rotated  between  exposures  to  en¬ 
sure  predominately  TE  polarization  exposures. 

3.1  Imaging  interferometric  lithography 

Imaging  interferometric  lithography  (IIL)  was  introduced  as  a  technique  to  cover  the  extremes  of  frequency  space  by 
tilting  the  illumination  as  far  as  possible  off-axis.  With  the  modest  NA’s  of  ~  0.5  to  0.65  that  were  available  at  the 
time,  this  required  tilts  that  were  beyond  the  limits  of  the  imaging  pupil  and  consequently  reintroduction  of  a  zero- 
order  beam  on  the  image  side  of  the  lens.  With  today’s  very  high  NA  lenses  of  sin#  ~  0.9,  the  gain  for  the  significant 
added  optical  complexity  of  building  interferometers  around  the  lens  is  small.  With  transmission  of  the  zero-order 
beam  through  the  imaging  optics,  IIL  is  similar  to  quadrupole  off-axis  illumination  with  a  very  small  partial  coher¬ 
ence  for  each  off-axis  illumination  beam,  and  with  the  quadrupole  oriented  along  the  principal  (x,  y)  axes  of  a  Man¬ 
hattan  geometry  pattern  to  maximize  the  spatial  frequency  capture.  The  polarization  of  each  of  the  off-axis  illumina¬ 
tion  beams  can  be  independently  controlled  to  optimize  the  imaging.  Quadrupole  off-axis  illumination  inherently 
emphasizes  low  frequencies  at  the  expense  of  high  frequencies10,12  because  each  of  the  exposures  includes  the  same 
low-frequency  information  while  the  individual  high  frequencies  are  covered  only  in  selected  exposures.  Pupil  plane 
filters  eliminate  these  multiple  coverages  and  provide  an  improved  image,  close  to  the  theoretical  frequency  space 
limits.  Thus,  IIL  for  sin#  ~  0.9  lenses  becomes  quadrupole  OAI  with  tilts  to  the  edges  of  the  imaging  pupil  along 
with  pupil-plane  filters.  Figure  3  shows  the  result  of  this  procedure  for  the  45-nm  half-pitch  pattern  without  using 
Phase  Shift  Mask  (PSM)  or  any  OPC.  The  pattern,  Fig.  3(a),  is  well  resolved.  The  impacts  of  the  limited  resolution 
include  rounding  of  the  line  edges  and  some  line  edge-ripple.  The  pupil  plane  filters  used  in  this  simulation  are 
shown  in  Fig.  3(b). 
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Figure  2:  Test  pattern  used  for  the  simulations 


Figure  3:  Modeling  of  IIL.  (a)  gives  the  final  real-space  image,  (b)  shows  the  pupil  filters  used  in  the 
off-axis  exposures,  (c)  shows  the  resulting  frequency- space  coverage  along  with  the  intensity  Fourier 
transform  of  the  desired  mask  pattern.  The  gray  shadings  are  simply  to  denote  the  frequency  cover¬ 
age,  the  effective  MTF  for  the  linear  imaging  terms  is  uniform  across  the  image.  The  polarization  of 
the  individual  exposures  is  indicated. 


In  the  simulation,  two  offset  exposures  from  opposite  directions  are  included  to  restore  a  telecentric  response.  The 
polarization  is  controlled  in  each  of  the  exposures  as  indicated  in  Fig.  3(c)  to  ensure  a  dominant  TE  polarization, 
especially  for  the  highest  spatial  frequencies.  Figure  3(c)  shows  the  resulting  spatial  frequency  coverage  with  the 
intensity  Fourier  transform  of  the  image  superimposed.  The  different  gray  levels  in  this  figure  are  simply  to  distin¬ 
guish  individual  exposure  coverages.  The  effective  modulation  transfer  function  for  the  linear  imaging  terms  is  uni¬ 
form  across  the  entire  accessible  frequency  space.  The  impact  of  polarization  control  is  shown  in  Figure  4.  This  re¬ 
sult  is  for  the  same  pupil  filters  as  the  result  of  Fig.  3,  the  only  difference  is  that  a  fixed  polarization  has  been  used 
for  all  exposures.  The  impact  of  having  the  “wrong”  polarization  for  the  x-directed,  high-spatial  frequencies  is  clear. 


Figure  4:  Exposures  identical  to  Fig.  3  with  except  for  the  polarization. 

The  impact  of  using  the  “wrong”  polarization  for  the  x-directed  high- 
spatial  frequencies  is  clear. 

Symmetric  results  were  obtained  if  y-directed  polarization  is  used  for  all  exposures,  the  opposite  parts  of  the  ell  did 
not  print.  This  result  confirms  the  simple  grating  analysis  given  above  -  it  will  be  critical  to  independently  optimize 
the  polarization  for  the  high  spatial  frequencies  involved  in  printing  45 -nm  node  patterns  by  immersion  with  a  193- 
nm  source.  The  impact  of  defocus  on  the  IIL  pattern  is  shown  in  Figure  5.  At  a  80-nm  defocus,  the  pattern  is  largely 
maintained.  At  a  100-nm  defocus,  dense/isolated  bias  is  evident  in  the  line-end  shortening  and  thinning  of  isolated 
features.  Finally,  at  a  130-nm  defocus  the  lines  begin  to  merge  together  and  a  hole  has  developed  in  the  large  box. 
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Figure  5:  Evolution  of  the  IIL  image  of  Fig.  5  though  focus  showing  (a)  80  nm  out  of  focus  (b)  100  nm  out  of 
focus  and,  (c)  130  nm  out  of  focus.  The  depth-of- field  is  of  order  160  nm. 


The  addition  of  OPC  should  extend  the  depth-of-field  (DOF)  which  is  ~  160  nm  from  these  simulations.  It  is  impor¬ 
tant  to  recognize  that  the  large  off-axis  angles  of  IIL  (or  dipole  illumination,  below)  maximize  the  DOF  compared 
with  conventional  on-axis  illumination  for  these  high  spatial  frequency  structures.  In  the  limit  of  a  periodic  structure 
where  the  off-axis  illumination  is  chosen  to  symmetrize  the  zero-  and  first-order  diffracted  beams  on  the  image  pu¬ 
pil,  i.  e.  interferometric  lithography,  the  DOF  approaches  infinity  since  both  beams  are  inclined  at  the  same  azi¬ 
muthal  angle  (0).  The  frequently  cited  Rayleigh  result  ( DOF  ocK2/'k  NA2)  does  not  hold  for  this  situation. 

The  importance  of  incorporating  the  pupil-plane  filters  is  shown  by  the  simulation  result  in  Fig.  6  for  the  same  pat¬ 
tern  using  only  the  offset  exposures  but  without  using  any  pupil  plane  filters.  The  dense  features  are  not  completely 
resolved  and  there  is  a  significant  dense-isolated  bias.  The  frequency  space  coverage  shown  in  Fig.  6(b)  shows  that 
the  low  frequency  regions  of  the  pattern  are  included  in  multiple  exposures  while  the  high  frequency  components  are 
only  counted  once.  The  resulting  overemphasis  on  the  low  frequencies  is  the  reason  for  the  inability  to  resolve  the 
dense  features.  This  exposure  strategy  is  equivalent  to  dipole  illumination,  but  with  a  single  mask  with  all  of  the 
features  included.  The  next  section  will  show  that  somewhat  better  results  are  available  by  parsing  the  features  be¬ 
tween  two  masks 


Figure  6:  (a)  Result  of  IIL  exposures  without  pupil  plane  filters.  The  dense  features 
are  not  resolved,  (b)  Corresponding  frequency  space  coverage.  The  multiple  cover¬ 
age  of  the  low  frequency  regions  leads  to  the  lack  of  resolution. 


3.2  Dipole  illumination 


Dipole  illumination,  where  the  mask  is  split  into  two  sub-masks  each  with  the  high  spatial  frequencies  in  one  direc¬ 
tion  is  an  alternative  exposure  technique  that  offers  the  necessary  polarization  flexibility.  The  wide  range  of  spatial 
frequencies  in  the  present  test  pattern  poses  some  difficulties  for  this  approach.  Figure  7  shows  an  example  of  coher¬ 
ent  (a  =  0)  dipole  illumination  at  an  offset  of  0.9  just  as  was  used  for  the  IIL  example  above. 


Figure  7:  Dipole  illumination  with  two  masks  (a)  and  (b)  separating  the  high  small  features  in  the 
x-  andy-directions.  The  large  box  was  eliminated,  since  it  was  not  possible  to  print  the  pattern 
when  it  was  present.  The  illumination  offsets  were  at  the  edge  of  the  imaging  pupil  and  the  fre¬ 
quency  space  coverage  was  identical  to  that  shown  in  Fig.  6.  Because  the  large  box  was  absent,  the 
frequency  space  overlaps  were  tolerable. 


The  large  box  has  been  removed  from  the  pattern  and  the  remaining  features  split  between  separate  masks  for  the  x- 
and  y-directions  as  shown.  The  polarization  is  optimized  separately  for  exposure  of  each  mask.  The  dense/isolated 
bias  is  more  pronounced  in  this  configuration  and  the  comers  of  the  ells  are  less  robustly  defined,  even  though  these 
regions  were  left  unexposed  in  both  masks.  Alignment  of  the  two  masks  between  exposures,  and  precise  control  of 
line-edge  placement  are  added  requirements  for  this  technique.  Figure  8  shows  the  result  of  a  single  mask  exposure 
including  the  large  box  (left  column).  The  individual  lines  close  to  the  box  are  not  resolved.  The  two  right  columns 
show  two  different  dark  field  (inverted  image)  mask  strategies. 


Figure  8:  Results  of  three  different  dipole  mask  strategies.  A  single,  bright-field  mask 
printing  only  the  lines  parallel  to  the  y-axis  and  the  large  box.  It  was  not  possible  to  print 
both  the  lines  and  the  box  in  a  single  exposure.  Also  shown  are  two  different  attempts  at 
dark  field  (inverted)  images.  The  second  mask  for  the  first  case  (full  large  box)  included 
only  the  lines  parallel  to  the  x-axis  with  no  box.  The  second  mask  for  the  second  case  (tri¬ 
angular  box  segment)  was  symmetric  to  the  mask  shown. 


In  the  center  column,  the  mask  includes  the  entire  box  while  the  second  mask  (not  shown)  has  only  the  dense  fea¬ 
tures  parallel  to  the  x-axis.  Again  in  this  case  there  is  difficulty  in  separating  the  dense  lines  from  the  box,  and  there 
is  significant  distortion  of  the  pattern.  The  right-hand  column  shows  a  more  symmetric  division  of  the  mask.  This 
produced  the  best-printed  pattern,  but  still  shows  significant  distortion  and  line-width  variation.  More  detailed  inves¬ 
tigations  are  necessary  to  optimize  these  dipole  illumination  strategies.  In  particular,  there  is  overcounting  of  the  low 
frequencies  in  this  case  as  well  and  pupil-plane  filters  may  help  the  imaging. 


4.  CONCLUSIONS 

A  detailed  simulation  of  arbitrary  pattern  lithography  for  the  45 -nm  half-pitch  node  using  a  193-nm  source  with  a 
NA  =  1.3  water- immersion  lithography  tool  has  been  presented.  A  requirement  of  this  the  physics  of  interference  at 
high  angles  is  to  optimize  the  polarization  individually  for  different  parts  of  the  image,  which  in  turn  requires  a  ma¬ 
jor  change  to  illumination  optics  and  will  impact  tool  strategies  and  cost  of  ownership  evaluations.  In  order  to  access 
the  highest  available  spatial  frequencies  -  necessary  for  printing  45-nm  half-pitch  structures,  off-axis  illumination 
near  the  edges  of  the  imaging  pupil  will  be  required.  Two  strategies  for  separating  the  exposures  to  allow  for  the 
necessary  polarization  control  have  been  presented.  Imaging  interferometric  lithography,  related  to  off-axis  illumi¬ 
nation  with  the  addition  of  pupil  plane  filters  to  adjust  the  details  of  the  image,  provides  relatively  robust  results. 
Different  pupil  plane  filter  strategies  have  been  presented  along  with  details  of  variation  of  the  image  through  focus. 
Dipole  illumination  with  the  division  of  the  pattern  into  two  masks,  each  with  patterns  primarily  oriented  in  or¬ 
thogonal  directions,  was  also  investigated.  Initial  results  are  not  as  promising  as  the  IIL  strategy,  but  more  investiga¬ 
tion  is  required.  These  simulations  show  that  45-nm  half-pitch  lithography  is  indeed  possible  by  extension  of  193- 
nm  tools  to  water  immersion.  Much  more  investigation  is  required  to  optimize  the  imaging  over  the  range  of  patterns 
required  for  actual  circuits,  and  to  add  OPC  to  provide  more  robust  results.  Experimental  confirmation  of  the  simula¬ 
tions  using  micro-stepper  platforms  is  also  an  urgent  need. 
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ABSTRACT 

Initial  volume  manufacturing  of  the  16-  to  22-nm  half-pitch  integrated  circuit  node  is  targeted  for  the  year  2018. 
Lithography  is  under  tremendous  pressure  to  extend  its  capabilities  to  meet  this  deadline.  Recently,  immersion  li¬ 
thography,  particularly  using  water  as  an  immersion  fluid  at  193  nm,  has  attracted  much  attention  as  a  promising 
optical  lithography  extension.  However  based  on  simple  optical  bandwidth  considerations,  193-nm-based  optical 
lithography  alone  will  not  have  the  bandwidth  necessary  for  printing  the  22-nm  half-pitch  mode  with  any  foresee¬ 
able  combination  of  immersion  liquids  and  conventional  resolution  enhancement  techniques  (RETs).  The  approach 
to  reaching  this  node  presented  here  is  to  combine  all  available  RETs  with  processing  nonlinearities  and  spatial- 
frequency  doubling  using  two  photoresist  layers  and  an  image  storage  layer.  Appropriate  combinations  of  multiple 
exposures/processes  can  access  the  22-nm  node;  thus  reaching  current  end-of-roadmap  values  for  half-pitch  while 
retaining  the  current  193-nm  lithography  infrastructure.  A  detailed  simulation  (PROLITH™  8)  study  of  this  ap¬ 
proach  is  reported. 


1.  INTRODUCTION 


The  relentless  demands  of  Moore’s  law  on  lithography  are  well  known.  The  semiconductor  industry  has  been  ag¬ 
gressively  moving  to  print  ever-decreasing  feature  size,  imposing  tremendous  pressure  on  lithography  to  extend  its 
present  capability.  By  2018,  at  the  culmination  of  current  industry  roadmap,  volume  manufacturing  of  the  16-  to  22- 
nm  half-pitch  node  will  be  needed.1  Immersion  lithography,  particularly  using  water  as  an  immersion  fluid  at  193 
nm,  has  recently  emerged  as  the  industry  consensus  for  the  next  major  optical  lithography  extension.2"5  Taking  ad¬ 
vantage  of  a  water  ( n  =  1.44  at  193  nm)6  immersion  lens  with  NA=  nsinO  of  up  to  1.3  (1.44x0.9),  the  45-nm  half¬ 
pitch  node  will  be  directly  accessible.7  The  resolution  limits  of  optical  lithography  are  often  discussed  in  the  context 
of  the  famous  Raleigh  criterion: 


CD  =  k.— 
1  NA 


(1) 


where  CD  is  the  critical  dimension  or  smallest  printed  feature  (line  or  space).  The  parameter  Ki  depends  on  process 
specifications,  but  there  is  a  hard  limit  at  a  Ki  of  0.25  (assuming  equal  lines/spaces)  for  a  single  photoresist  layer, 
corresponding  to  the  highest  spatial  frequency  of  2NA/2  available  from  the  optical  system.  Resolution  enhancement 
techniques  (RETs)8  including:  phase  shift  masks  (PSM),  off-axis  illumination  (OAI),  optical  proximity  correction 
(OPC),  and  imaging  interferometric  lithography  (IIL)  can  be  used  to  reduce  the  value  of  k1?  approaching  the  theo¬ 
retical  limit  of  0.25.  These  RETs  are  not  mutually  exclusive  and  can  be  combined  to  address  the  image  specific  is¬ 
sues.  For  water  immersion  this  gives  a  limiting  spatial  frequency  response  of  37  nm,  validating  the  claim  that  the  45- 
nm  half  pitch  node  should  be  accessible  for  these  parameters.  As  shown  below  for  a  substantial  extrapolation  to  an 
index  of  2,  simple  scaling  gives  a  limiting  pitch  of  26  nm,  allowing  sufficient  bandwidth  for  the  32-nm  half-pitch 
node.  Continuing  this  scaling,  an  index  of  2.9  is  required  for  the  22-nm  half-pitch  node.  These  are  clearly  aggressive 
goals  for  the  liquid  index,  and  alternative  techniques  for  enhancing  resolution  need  to  be  investigated  in  parallel. 

In  this  paper,  we  present  an  approach  to  extend  optical  lithography  beyond  the  22-nm  node  by  combining  process 
nonlinearities  with  a  spatial  frequency  doubling  step.9,10  The  basic  concept  is  based  on  the  fact  that  the  fundamental 
single-image  limit  in  optical  lithography  is  on  pattern  period  (e.g.  on  pattern  density),  not  on  the  individual  CDs 
(linewidths).  As  a  result  of  nonlinear  processes  in  the  exposure,  development  and  process  transfer  steps,  CDs  can  be 
much  smaller  than  half  the  smallest  available  period.  As  an  example,  oxygen  plasma  thinning  of  photoresist  lines  is 
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already  an  accepted  process  in  advanced  microprocessor  manufacturing.  If  the  results  of  such  a  nonlinear  imaging 
process  are  stored  in  a  sacrificial  layer  on  the  sample,  and  a  second  similar  imaging  step  is  performed  and  interpo¬ 
lated  with  the  first  image,  a  factor  of  two  extension  in  the  maximum  available  spatial  frequency  is  achieved.  In  prin¬ 
ciple,  this  nonlinear  interpolation  can  be  carried  out  multiple  times,  leading  to  the  conclusion  that  there  are  no  fun¬ 
damental  limits  to  optical  lithography.  In  practice,  both  the  image  fidelity  demands  and  the  image  placement  re¬ 
quirements  are  increased,  leading  to  the  obvious  corollary  that  there  are  increasingly  challenging  process  and  manu¬ 
facturing  issues  that  need  to  be  addressed. 

Simulations  were  carried  out  with  Prolith™  Version  8  (KLA-Tencor)  in  full  vector  electromagnetic  mode  and  with 
the  full  physical  model  for  the  photoresist  (PEB  diffusion,  3D  etching,  etc.).  Substrates  were  bare  silicon  with  a  bot¬ 
tom  anti-reflection  coating  (BARC)  ( n  =  1.7059  +  i0.335)  and  a  positive  photoresist  ( n  =  1.703  +  i0.0172;  Dill  pa¬ 
rameters  A  =  0,  B=  1.122  pm"1,  C  =  0.068  cm'2  /mJ).  Standard  post-exposure  bake  (110°C  for  60  s)  and  develop 
conditions  (60  s)  were  used  for  all  simulations. 


2  IMMERSION  IMAGING  INTERFEROMETRIC  LITHOGRAPHY  (I3L)  TO  THE  32-NM 

HALF-PITCH  NODE 

Imaging  interferometric  lithography  (IIL)  is  a  relatively  new  concept11  for  accessing  the  fundamental  linear  systems 
resolution  limits  of  optics,  which  uses  off-axis  illumination  at  the  limits  of  the  pupil  along  with  pupil-plane  filters  to 
control  the  image  fidelity.  I3L  at  193-nm  with  water  as  immersion  fluid  ( NA  =  1.3)  will  access  the  45-nm  half-pitch 
node. 

New  liquids  are  also  being  studied  to  extend  the  refractive  index  to  1.6  or  beyond,  compared  with  1.44  for  water.  If 
we  take  the  limit  of  practical  optical  systems  as  sin 6=  0.9  and  therefore  NA  =  nflui(N 0.9,  it  is  relevant  to  ask  what  NA 
is  required  to  reach  the  next,  32-nm  half-pitch,  node  at  193  nm.  PROLITH  simulation  results  of  arbitrary  dense  pat¬ 
tern  with  CDs  down  to  32-nm  using  IIL  exposures  at  193  nm  assuming  NAs  up  to  1.8  and  corresponding  spatial  fre¬ 
quency  coverage  are  shown  in  the  Lig.  1.  Lig.  1(a)  shows  the  resulting  spatial  frequency  coverage  of  five-exposure 
IIL  with  the  limiting  frequency  of  IN  AIN  The  polarization  is  controlled  in  each  of  the  exposures  as  indicated  in  the 
figure  to  ensure  a  dominant  TE  polarization,  especially  for  the  highest  spatial  frequencies.  The  scaling  is  as  ex¬ 
pected,  it  takes  a  factor  of  1.4  improvement  in  NA  to  print  a  factor  of  0.7  smaller  CD.  Continuing  this  scaling  to  the 
22  nm  node  requires  a  probably  unrealistic  liquid  index  of  2.9. 


Lig.l  Modeling  of  IIL.  (a)  shows  the  pupil  filters  with  frequency-space  coverage.  The  gray  shad¬ 
ings  are  simply  to  denote  the  frequency  coverage,  the  polarization  of  the  individual  exposures  is 
indicated,  (b),  (c),  and  (d)  show  the  final  real-space  images. 


3  EFFECT  OF  LITHOGRAPHIC  PROCESS  NONLINEARITIES  ON  THE  RESOLUTION 

The  minimum  half  pitch  available  to  a  lithography  tool  is  given  by  the  well-known  equation  A  min  half  =  2/  4^7 sin 0, 
where  2  is  the  exposure  wavelength,  n  is  the  refractive  index  of  the  immersion  medium  and  0  is  the  interfering  angle 
of  the  two  beams.  These  limits  are  on  the  half  pitch,  not  on  the  CD.  Using  nonlinear  properties  of  the  photoresist 
and  post-development  processing,  which  is  already  standard  industry  practice,  much  smaller  CDs  can  be  realized9. 


The  higher  harmonics  added  by  the  process  nonlinearities  extend  optical  lithography  beyond  the  linear  system  limits 
imposed  by  diffraction.  However,  process  nonlinearities  do  not  increase  the  pattern  density.  Since  the  harmonics  are 
in  phase  with  the  fundamental  components,  they  do  not  lead  to  increased  density  in  the  printed  pattern  and  the  ef¬ 
fects  are  limited  to  the  reduction  of  linewidth  and  to  improved  sidewall  slopes.  Figure  2  shows  a  sinusoidal  aerial 
image  and  corresponding  developed  photoresist  pattern. 


* 


Developed  pattern: 

y 

V 

/ 

ULIULL  _L 

_L 

■3p  -2p  -p  o  P  2p  3p 


CD=50  nm,  period=100  nm 


Fig.  2  Resist  nonlinearity  adds  higher  harmonic  spatial  frequencies.  The  developed  photo¬ 
resist  pattern  (immersion  result  at  213  nm  courtesy  of  A.  Raub;  paper  26  this  volume)  ex¬ 
hibits  nearlv  vertical  sidewalls,  corresponding  to  this  higher  frequencv  content. 


A  one  dimensional  example  of  the  spatial  frequency-doubling  concept  is  illustrated  in  Fig. 3.  The  top  row  of  Fig. 3 
shows  a  photoresist  pattern  on  top  of  a  sacrificial  layer  above  a  substrate.  The  period  of  the  pattern  is  limited  by  op¬ 
tics  to  A/2 NA;  the  CD,  on  the  other  hand,  is  not  limited  by  the  optics  and  is  chosen  to  be  less  than  A/8 NA.  This  can 
be  achieved  by  manipulating  the  higher  harmonic  content  of  the  image  and  does  not  violate  any  optical  limits.  In 
Fig.  3b  the  photoresist  pattern  has  been  transferred  into  the  sacrificial  layer  using  standard  pattern  transfer.  A  second 
photoresist  layer  has  been  spun,  exposed  and  developed  as  shown  in  the  Fig.  3c. 
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Fig.  3  Sequence  of  steps  to  double  the  ID  pattern  density  by  spatial  period  division 


The  period  and  CD  are  the  same  as  in  the  first  case,  but  the  pattern  has  been  shifted  by  half  the  pitch  (k/4NA)  to  in¬ 
terpolate  the  lines  of  the  second  pattern  midway  between  the  lines  of  the  first  pattern.  Finally,  this  pattern  is  also 
transferred  into  the  sacrificial  layer  (Fig. 3d).  The  sacrificial  layer  with  the  combined  pattern  is  then  used  as  a  etch 
mask  to  transfer  the  total  pattern  into  the  substrate.  Subsequently  the  sacrificial  layer  is  removed  leaving  a  final  pat¬ 
tern  with  a  pitch  of  A/4  NA,  twice  as  dense  as  the  single  exposure  optical  limit. 


For  this  simple  case,  the  mathematics  is  trivial.  The  aerial  image  for  a  two-beam  interferometric  exposure  at  a  period 
p  is  simply: 


7(jc)  =  2 


1  +  COS 


r  2nx^ 


\  f  y 


(2) 


Following  the  nonlinear  processes  in  the  exposure,  development  and  pattern  transfer  the  result  is  a  simple  Fourier 
cosine  series: 
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where  the  nonlinear  transformation  is  indicated  by  JV  and  a  is  the  pattern  CD.  The  final  amplitude  coefficients  de¬ 
pend  on  the  process  variables.  The  addition  of  the  second  pattern,  at  the  same  period  as  the  first  but  with  a  half¬ 
period,  p/2,  lateral  shift  is  just: 


Pl&2=.N[/l(*M[/2(*)]  =  I 


sin 


nna 


nna 


p  , 


cos 


2  7rnx 


■(-i  TL 


sin 


nna 


-|A 


nna 


cos 


2nnx 


sm 


2nn'a 


(4) 


2nn'a 


cos 


4  nnyx 


where  the  (-1)"  factor  arises  from  the  lateral  shift  of  the  second  pattern.  As  a  result  of  this  factor,  the  odd  terms  in  the 
original  sirvc/x  summations  cancel,  the  even  terms  survive  and  the  result  is  a  sirvc/x  expansion  at  twice  the  pitch  - 
e.  g.  spatial-frequency  doubling.  Schematically,  this  result  is  shown  in  Figure  4.  The  Fourier  coefficients  are  shown 
for  a  simple  grating  with  lines  parallel  to  the  x-axis.  The  Fourier  coefficients  of  the  second  exposure  are  identical 
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Figure  4:  Graphical  representation  of  a  simple  frequency  doubling  process. 


except  for  the  phase  shift  associated  with  the  lateral  translation.  Unfilled  circles  indicate  negative  values.  Finally  the 
summation  gives  a  grating  at  twice  the  spatial  frequency.  The  circles  with  radius  2NA/X  indicate  the  frequency  space 
limits  of  optics.  Note  that  the  final  grating  period  cannot  be  produced  by  a  single  optical  exposure  since  its  funda¬ 
mental  frequency  is  greater  than  the  optical  system  bandpass. 


4  COMBINATION  OF  PROCESS  NONLINEARITIES  WITH  RETs 

The  general  concept  of  utilizing  nonlinear  processes  in  the  multiple  exposure  and  transfer  method  is  to  print  a  dif¬ 
fraction  limited  image  in  a  single  step;  transfer  this  image  to  a  sacrificial  layer,  and  repeat  this  sequence  one  or  more 
times  accumulating  a  composite,  interpolated  image  in  the  sacrificial  layer  which  is  subsequently  used  as  an  etch 
mask  to  transfer  the  complete  pattern  into  the  underlying  circuit  layers.  A  Prolith  simulation  of  this  process  for  the 
test  pattern  introduced  above  for  a  dense  CD  of  22  nm  (A/6n)  with  a  water  immersion  NA  of  1.3  and  an  exposure 
wavelength  of  193  nm  is  shown  in  Fig. 5.  Two  different  masks  are  used  with  each  having  every  other  feature  of  the 
composite  pattern.  The  smallest  period  for  each  exposure  is  90  nm,  corresponding  to  the  single  exposure  frequency 
space  limit  at  these  parameters.  In  the  modeling,  the  resist  exposure  dose  is  increased  to  give  a  CD  of  22  nm,  rather 
than  the  45 -nm  CD  for  a  equal  line: space  pattern.  The  mask  CD  is  unchanged  in  this  process,  so  the  mask  making 
requirements  are  not  more  difficult  than  for  the  45 -nm  node.  In  the  simulation,  the  sacrificial  layer  was  simply  the 
final  photoresist  pattern  that  was  stored  in  the  computer.  In  an  actual  process,  a  thin  sacrificial  layer  such  as  a  nitride 


Fig.  5  Simulation  for  interpolating  two  images,  each  near  the  single  exposure  limit  of  optics  to  provide  a  final  pat¬ 
tern  at  twice  the  usual  optics  limits.  The  modeling  is  for  22-nm  dense  CD  at  a  wavelength  of  193  nm  with  an  im¬ 
mersion  NA  of  1.3. 


or  polymer  layer  would  be  required.  This  layer  must  be  robust  to  removal  of  the  photoresist  and  recoating  and  must 
have  sufficient  resolution  to  store  the  multiple,  interpolated  images. 

It  is  critical  that  the  nonlinearities  to  be  applied  separately  to  each  exposure.  If  multiple  exposures  are  accumulated 
in  a  single  resist  layer  before  the  nonlinearity  is  applied  [equivalent  to  JV(/i+/2)  rather  than  JV(/i)+JV(/2)in  Eq.  4],  the 
patterns  merge  and  enhanced  resolution  is  not  realized.  The  top  row  shows  the  results  of  two  exposures  and  their 
combination  using  an  straightforward  pair  of  WYSIWYG  masks  without  any  OPC.  The  line  edge  ripple  is  increased 
substantially  from  the  45  nm  node  case  shown  in  Fig.  lc).  This  is  due  to  the  increased  exposure  uniformity  require¬ 
ments  associated  with  the  overexposure  as  discussed  below.  The  second  row  shows  improved  results  with  the  addi¬ 
tion  of  some  optical  proximity  correction  (OPC)  mask  features  (see  discussion  below).  The  two  bottom  left  figures 
show  the  aerial  image  frequency  content  for  the  first  exposure,  limited  to  2NA/A  by  the  transmission  limit  of  optics, 
and  the  final  frequency  content  of  the  image  stored  in  the  sacrificial  layer  after  the  nonlinear  exposure,  development 
and  pattern  transfer  steps.  The  middle  column  shows  similar  results  for  the  second  exposure  (top:  WYSIWYG  im¬ 
age;  middle:  image  with  OPC;  and  bottom:  Fourier  content  in  aerial  image  and  following  nonlinearity  and  pattern 
transfer.  The  rightmost  column  shows  the  combined  image  results  (top:  WYSIWYG;  middle:  with  OPC;  bottom: 
Fourier  content  of  composite  image.)  Note  that  just  as  was  the  case  for  the  simple  grating  example,  the  fundamental 
frequency  components  are  out  of  phase  and  largely  cancel  leaving  the  higher-frequency  second-harmonic  compo¬ 
nents  as  the  dominant  spectral  features  in  the  composite  image.  Note  again,  that  these  second  harmonic  frequencies 
are  not  transmitted  by  the  optics,  but  arise  solely  from  the  nonlinearities  in  the  exposure,  develop,  and  pattern  trans¬ 
fer  steps. 

These  results  imply  that  the  process  control  issues  will  be  more  significant  at  these  high  exposure  levels  as  the 
linewidth  variation  for  a  small  change  in  intensity  becomes  more  severe.  This  is  explained  in  the  Fig. 5  which  shows 
the  aerial  image  intensity  profiles  along  the  cuts  indicated  by  the  dotted  lines  in  the  left  hand  images  of  the  Fig.4, 
passing  through  two  vertical  "ell  bars". 


Fig.  6  The  aerial  image  intensity  profiles  along  the  solid  lines  in  the  lefthand  images  of  Fig.  4  (see  text). 


Because  the  amplitudes  of  the  two  lines  are  not  precisely  equal,  there  is  a  relative  narrowing  of  the  second  line  in  the 
final  developed  pattern.  This  is  more  pronounced  at  the  higher  exposure  level  (indicated  by  the  lower  dashed  hori¬ 
zontal  line  labeled  Threshold  (CD  =  A J6n))  than  at  the  lower  exposure  level  (A/3n)  that  is  used  for  single  exposure, 
half-pitch  lithography  (e.g.  Fig.  1).  This  sensitivity  is  the  reason  for  the  enhanced  sidewall  ripple  in  the  no  OPC  case 
in  fig.  5.  The  imbalance  between  the  exposures  for  these  lines  can  be  addressed  by  widening  of  the  mask  pattern  at 
this  point  as  shown  in  the  OPC  mask  (Fig.  6).  The  frequency  of  the  intensity  oscillation  is  set  by  the  cutoff  limit  of 
optics  and  is  the  same  for  both  lines,  so  the  intensity  (depth  of  the  null)  is  the  only  possible  adjustment,  each  line  is 
the  same  width.  If  the  threshold  is  moved  up  to  the  mid-intensity  levels,  as  it  shown  by  the  dotted  lines  labeled 
"threshold  (CD~?C3n)"  the  impact  of  these  variations  on  the  final  printed  image  is  much  less  pronounced.  For  this 
simulation,  the  definition  of  this  OPC  was  an  ad-hoc  process  of  adding  and  subtracting  bars  in  the  areas  that  exhib¬ 
ited  the  most  pronounced  deviations. 


Fig. 7:  OPC  additions  (black)  and  subtractions  (white)  to  the  masks  used  for  the  simulations. 


An  extensive  investigation  of  the  OPC  was  not  carried  out  and  a  better  result  is  certainly  available.  These  results  are, 
however,  sufficient  to  prove  the  concept.  The  message  is  that  mask  correction,  mask  CD  control,  and  process  control 
all  will  become  more  demanding  as  increasing  orders  of  nonlinear  interpolations  are  attempted. 

5  CONCLUSIONS 

There  are  no  fundamental  limits  to  optical  lithography.  There  are  only  process,  implementation  and  cost  issues  that 
will  have  to  be  weighed  against  the  similar  issues  in  new  technologies.  Spatial-frequency  doubling  after  applying 
process  nonlinearities  independently  to  each  sub-image  has  sufficient  flexibility  to  extend  immersion  193-nm  optical 
lithography  to  at  least  the  end  of  the  current  industry  roadmap.  Combining  multiple  nonlinear  exposures  requires 
transferring  each  image  to  a  sacrificial  layer  and  multiple  resist  layers.  This  represents  an  increased  processing  bur¬ 
den.  In  addition,  the  demands  on  the  mask  while  not  increasing  in  terms  of  resolution  do  increase  in  terms  of 
linewidth  control  and  the  need  for  OPC.  In  principle,  these  nonlinear  interpolation  techniques  can  be  applied  multi¬ 
ple  times.  In  practice,  manufacturing  considerations  will  impose  a  limit  that  can  only  be  adequately  evaluated  in 
comparison  with  alternative  technologies. 
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Liquid  immersion  lithography  (LIL)  extends  the  resolution  of  optical  lithography  to  meet  industry 
demands  into  the  next  decade.  Through  the  use  of  exposure  media  such  as  purified  water  ( n  of  1 .44 
at  193  nm),  it  is  possible  to  reduce  minimum  pitches  compared  with  traditional  air/vacuum 
exposures  media  by  a  factor  of  as  much  as  44% — a  full  technology  node.  Beyond  this  simple 
observation,  there  is  a  good  deal  of  work  necessary  to  fully  understand  the  impact  of  LIL  immersion 
lithography  on  a  lithography  processes.  This  article  addresses  the  impact  of  water  immersion  on  the 
imaging  capabilities  of  different  resist  formulations.  All  resists  were  evaluated  by  imaging  dense 
line- space  structures  at  a  65 -nm  half-pitch  both  in  air  and  with  water  immersion.  Studies  of  dense 
65 -nm  lines  made  by  immersion  imaging  in  HPLC  grade  water  with  controlled  variations  in  resist 
components  were  performed.  Significant  differences  were  observed  and  will  be  discussed.  ©  2004 
American  Vacuum  Society.  [DOI:  10.1116/1.1824951] 


I.  INTRODUCTION 

As  the  microelectronic  industry  pushes  towards  smaller 
feature  sizes,  ever  greater  demands  are  made  on  lithography. 
Current  lithographic  approaches  are  being  pushed  to  practi¬ 
cal  limits,  while  next  generation  lithography  technologies  are 
still  under  development.  Recently,  interest  has  grown  dra¬ 
matically  in  exploring  the  use  of  a  high-index  liquid- 
immersion  medium  between  the  exit  face  of  the  lens  and  the 
photoresist  surface  to  further  extend  optical  lithography.1-7 
The  use  of  immersion  media  to  extend  optical  resolution  is 
well  known  from  oil-immersion  optical  microscopy.  Through 
the  use  of  immersion  media,  such  as  deionized  water  with 
ft  —  1.44  at  193  nm  and  1.42  at  213  nm,7,8  smaller  minimum 
feature  sizes  can  be  achieved  than  are  possible  in  air.  This 
can  be  thought  of  either  as  an  effective  wavelength,  Xeff 
=  \0/n~  134  nm,  or  alternatively  as  an  increase  to  an  effec¬ 
tive  NAeff=ft  sin  0 ^  1.44.  Thus,  the  use  of  water  immersion 
lithography  could  potentially  extend  193-nm  wavelength  op¬ 
tical  lithography  by  44%,  twice  the  resolution  enhancement 
offered  by  the  problematic  transition  to  157  nm.  Liquid  im¬ 
mersion  lithography  provides  an  improvement  in  depth  of 
held  for  a  given  NA.  At  a  fixed  resolution,  DOF 
=  k2\0/ft  sin2(#).9  An  air  exposure  system  with  a  NA  of  0.80 
will  have  a  smaller  DOF  than  an  immersion  exposure  system 
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with  the  same  NA.  Application  of  immersion  techniques  at 
157  nm  would  have  a  similar  impact  should  a  suitable  liquid 
be  identified. 

There  are  some  issues  that  need  to  be  addressed  before 
immersion  lithography  can  be  used  in  manufacturing.  One 
concern  is  the  interaction  between  the  immersion  fluid  and 
the  last  lens  element.  For  instance  deionized  water  (DI  H20) 
will  etch  calcium  fluoride  (CaF2)  lens  elements,  so  a  protec¬ 
tive  coating  will  be  needed.  The  focus  of  this  paper  is  the 
interaction  between  the  immersion  fluid  and  the  resist-coated 
wafer.  With  liquid  immersion  lithography  there  are  two  new 
interactions  of  concern:  first  uptake  of  the  immersion  liquid 
into  the  resist  and  its  potential  impact  on  resolution;  and 
second  desorption  of  resist  components  into  the  immersion 
liquid  and  possible  contamination  of  the  lens  surface.  Both 
of  these  processes  can  change  the  photoresist  chemistry  and 
the  optical  properties  of  the  immersion  liquid. 

This  paper  presents  exposure  results  for  immersion  lithog¬ 
raphy  at  a  wavelength  of  213  nm,  close  to  the  current 
193-nm  exposure  wavelength.  A  maskless, 
interferometric-lithography  approach  that  gives  fringes 
with  essentially  ideal  contrast  was  used.10  Exposures  were 
carried  out  on  a  number  of  resists  with  controlled  varia¬ 
tions  in  resist  components.  Developed  resist  patterns  of 
dense  65-nm  lines  obtained  from  the  resist  exposed  in  air 
and  in  HPLC  grade  water  were  studied.  Significant  differ- 
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Fig.  1.  Michelson  interferometer  immersion  exposure  setup. 


Two  resist  manufacturers,  TOK  and  AZ  Clariant,  provided 
resists  for  which  the  chemical  components  and  makeup  were 
fully  disclosed.  TOK  provided  six  different  resists.  These 
resists  were  formulated  using  combinations  of  two  polymers 
and  three  bases.  AZ  Clariant  provided  eight  samples  of  resist 
formulated  using  combinations  of  two  polymers,  two  PAGs, 
and  two  bases. 


III.  EXPERIMENTAL  RESULTS 


ences  related  to  resist  formulations  were  observed  and  are 
discussed.  The  use  of  a  protective  coating  for  the  resist 
was  also  studied  and  is  discussed. 


II.  EXPERIMENTAL  SETUP 

The  light  source  was  a  Coherent  Infinity  40-100  Nd:YAG 
pulsed  (~3  ns)  laser  followed  by  nonlinear  crystals  to  gen¬ 
erate  a  fifth-harmonic  beam  at  a  wavelength  of  213  nm.  This 
source  has  a  much  higher  coherence  length  (—0.6  m)  than 
that  of  a  typical  193-nm  excimer  laser,  which  greatly  simpli¬ 
fies  the  experimental  arrangement  for  interferometric  lithog¬ 
raphy.  193-nm  resists  generally  have  comparable  absorption 
at  193-  and  213-nm  and  work  well  at  this  wavelength.  A 
Michelson  interferometer  with  a  coupling  prism  was  de¬ 
signed  and  constructed  for  performing  the  immersion  expo¬ 
sures.  Figure  1  shows  a  diagram  and  a  picture  of  the  immer¬ 
sion  exposure  setup.  The  light  was  coupled  into  the 
immersion  liquid  using  an  isosceles  prism.  The  top  faces  of 
the  prism  were  antireflection  coated.  A  drop  of  water  is 
placed  between  the  bottom  of  the  prism  and  the  resist  coated 
wafer  (thickness  —1  mm).  Both  laboratory  deionized  water 
and  HPLC  water  were  used  with  comparable  SEM  cross- 
section  resist  line  profile  results. 

The  period  of  the  grating  exposed  in  the  resist  is  given  by 
A  =  X/2-^1-sin(^1)  =  X/2*^2_sin(^2)-  Here,  nx  and  n2  de¬ 
note  the  indices  of  the  prism  and  immersion  liquid,  and 
and  02  are  the  corresponding  propagation  angles  with  respect 
to  the  wafer  normal;  as  drawn  nx  is  larger  than  n2.  The  grat¬ 
ing  period  is  independent  of  the  refractive  index  of  the  plane- 
parallel  liquid  layer:  the  change  in  wavelength  according  to 
X  =  X0/n  is  exactly  canceled  by  the  change  in  6  due  to  refrac¬ 
tion  at  both  the  glass-liquid  and  the  liquid-resist  interfaces. 
In  practice  the  resist  has  a  higher  index  than  the  lens,  and  it 
is  desirable  for  the  glass  and  liquid  indices  to  be  approxi¬ 
mately  equal  in  order  to  minimize  interface  reflections. 


A.  TOK  resist  liquid  immersion  imaging  results 

The  TOK  resists  were  based  on  acrylic  polymer  back¬ 
bones.  Three  of  the  resists  ILP01,  ILP02,  and  ILP03  were 
formulated  with  an  acrylic  ter-polymer,  and  the  other  three: 
ILP04,  ILP05,  and  ILP06  with  an  acrylic  tetra-polymer.  All 
six  resists  used  the  same  photoacid  generator  (PAG),  tri- 
phenylsufonium  nanofluorobutanesulfonate  (TPS-Nf).  Three 
different  base  quenchers  were  used.  Resists  ILP01  and  ILP04 
used  the  small  molecular- size  base,  tri-ethanolamine.  Resists 
ILP02  and  ILP05  used  the  medium  sized  base  tri- 
pentylamine.  Resists  ILP03  and  ILP06  used  the  large  sized 
base  tri-octylamine.  Table  I  summarizes  the  matrix  of  TOK- 
ILP  resist  formulations.  All  resist  were  spun  at  a  thickness  of 
100  nm  unless  noted  otherwise,  atop  a  32-nm  thick  Brewer 
Science  ARC28-4  antireflective  coating  optimized  to  give  a 
minimum  substrate  reflection  at  213  nm  for  65-nm  half-pitch 
dense  patterns. 

7.  Image  comparison  of  resist  formulations 

The  three  acrylic  ter-polymer  resists  (ILP01,  ILP02,  and 
ILP03)  gave  reasonable  patterns  in  both  air  and  water.11  The 
small  tri-ethanolamine  base,  used  with  the  acrylic  tetra- 
polymer  resists  (ILP04),  gave  good  pattern  results  only  in  air 
and  exhibited  microbridging  and  pattern  collapse  when  ex¬ 
posed  in  HPLC  H20.  This  suggests  that  the  base  has  a  more 
significant  interaction  with  the  immersion  water  when  incor¬ 
porated  into  the  acrylic  tetra-polymer  resist  than  in  the 
acrylic  ter-polymer  resist,  which  did  give  reasonable  line 
profiles.  The  other  two  resists  based  on  the  acrylic  tetra- 
polymer  however  did  result  in  good  patterns  in  the  immer¬ 
sion  exposures.11  When  the  samples  were  exposed  in  HPLC 
H20  they  exhibited  lessened  top  rounding  than  for  the  cor¬ 
responding  air  exposures,  and  the  smaller  the  base  the  less 
rounded  the  profile.  The  samples  using  the  acrylic  ter- 
polymer  gave  slightly  better  results  than  the  samples  made 
from  the  acrylic  tetra-polymer  when  exposed  in  HPLC  H20. 


Table  I.  TOK  resist  formation  matrix. 


Base  quencher 

Tri-ethanolamine  (Bs) 

Tri-pentylamine  (Bm) 

Tri-octylamine  (BL) 

Polymer 

backbone 

Acrylic  ter-polymer 
Acrylic  tetra-polymer 

TOK-ILP01 

TOK-ILP04 

TOK-ILP02 

TOK-ILP05 

TOK-ILP03 

TOK-ILP06 
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Fig.  2.  TOK-ILP03  resist  exposures  through  half-pitch. 

The  best  TOK  resist  for  immersion  lithography  was  TOK- 
ILP03  closely  followed  by  TOK-ILP06  both  containing  the 
large  tri-octylamine  base. 

2.  Evaluation  through  half -pitch 

Using  one  of  the  better  TOK  resists  (TOK-ILP03),  the 
resist  profiles  were  evaluated  as  the  half-pitch  was  decreased 
until  the  resist  was  unable  to  resolve  the  dense  line-space 
structure.  Figure  2  shows  the  results  as  the  half-pitch  was 
decreased  from  64  nm  to  45  nm  in  HPLC  H20.  The  dense 
line- space  profiles  of  the  images  are  square  and  well  defined 
down  to  a  ~50-nm  half-pitch.  As  the  half-pitch  was  reduced 

Table  II.  AZ  clariant  resist  formulation  matrix. 


to  45  nm,  the  dense  features  exhibited  severe  micro-bridging 
and  pattern  collapse.  The  usable  resolution  extends  to 
~50-nm  lines  and  spaces  for  the  TOK-ILP03  resist  under  the 
present  exposure  and  process  conditions. 

B.  AZ  Clariant  resist  liquid  immersion  imaging  results 

AZ  Clariant  provided  eight  resist  formulations.  Four  of 
the  resists  used  a  COMA  hybrid  polymer  T2030  backbone 
and  the  remaining  four  resists  used  an  acrylic  polymer  T518 
backbone.  Half  of  the  resists  used  a  large  molecule  TPS-NF 
PAG  similar  to  the  TOK  resist,  and  the  other  half  used  a 
small  molecule  TPS-Tf  PAG.  AZ  Clariant  also  used  two  base 
quenchers,  a  large  molecule  tri-octylamine  base  and  a  small 
di-ethanolamine  base.  Table  II  summarizes  the  matrix  of  AZ 
Clariant  resist  formulations.  All  resists  were  spun  on  a 
32-nm  thick  Brewer  Science  ARC28-4  antireflective  coating 
optimized  to  give  a  minimum  substrate  reflection  for 
65 -nm  half-pitch  dense  patterns. 

1.  Image  comparison  of  resist  formulations 

There  were  issues  of  pattern  collapse  in  the  hybrid  poly¬ 
mer  T2030  with  a  resist  thickness  of  130  nm,  and  without  the 
use  of  a  rinse  surfactant.  Beyond  the  pattern  collapse  issue, 
there  was  noticeable  T-toping  of  the  line  profiles  exposed  in 
air.  For  the  large-PAG,  small-base  resist  formulation  pattern¬ 
ing  issues  were  clearly  evident;  the  smaller  PAG  gave  better 
profiles  for  the  air  exposures.  In  the  HPLC  H20  exposure, 
the  small-PAG  resist  formed  a  skin  and  once  the  exposure 
dose  was  significant  enough  for  the  developer  to  clear  this 
surface  layer,  the  resist  lines  were  completely  overexposed 
and  developed  away.  This  suggests  that  the  small  PAG  is 
substantially  leaching  away  from  the  surface  of  the  resist  into 
the  water.  Again  ignoring  the  pattern  collapse  issues,  the  re¬ 
sist  with  the  large  PAG  and  large  base  gave  the  best  results  in 
HPLC  H20  for  the  hybrid  T2030  polymer  resists. 

As  with  the  T2030  polymer  resist  there  were  issues  of 
pattern  collapse  in  the  Acrylic  polymer  T518  resist,  as  the 
120-nm  thickness  exceeded  the  desired  100  nm.  The  AZ 
Clariant  acrylic  resists  all  imaged  well  in  air  similar  to  the 
TOK  acrylic  resist,  again  ignoring  pattern  collapse  issues. 
The  resist  with  the  smaller  PAG  had  reduced  T-toping  and 
gave  better  results  in  air.  In  the  HPLC  H20  exposures,  ig¬ 
noring  pattern  collapse,  the  large  PAG  had  slightly  more 
T-toping  than  the  small  PAG,  and  no  substantial  difference 
between  the  small  and  large  base  resist  images. 


PAG 

Base  quencher 

TPS-Tf  (Ps) 

TPS-Nf  (PL) 

Di-ethanolamine 

(Bs) 

Tri-octylamine 

(BL) 

Di-ethanolamine 

(Bs) 

Tri-octylamine 

(BL) 

Hybrid  polymer 

T2030  Ps  Bs 

T2030  Ps  BL 

T2030  PL  Bs 

T2030  PL  BL 

Polymer 

T2030 

backbone 

Acrylic  polymer 

T518  Ps  Bs 

T518  Ps  BL 

T518  PL  Bs 

T518  PL  BL 

T518 
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45  nm  (H20) 


Fig.  3.  AZ  Clariant  EXP  M3000-50  resist  exposures  through  half-pitch. 


Fig.  4.  Impact  of  soak  times  on  131-nm  pitch  TOK-ILP06  resist. 


Based  on  the  limited  sample  space,  it  seems  that  the 
acrylic  resists  are  more  suitable  for  liquid  immersion  lithog¬ 
raphy.  The  size  of  the  PAG  resulted  in  noticeable  differences 
for  both  the  hybrid  and  acrylic  polymer  resists.  The  best  AZ 
Clariant  resist  was  the  acrylic  polymer  T518  with  the  smaller 
PAG  and  large  base. 


2.  Evaluation  through  half-pitch 

Using  an  improved  acrylic  AZ  Clariant  resist  (EXP 
M3000-50A)  spun  100-nm  thick,  resist  profiles  were  evalu¬ 
ated  as  the  half-pitch  was  decreased  until  the  resist  was  un¬ 
able  to  resolve  the  dense  line  and  space  structure.  Figure  3 
shows  the  resist  profiles  as  the  half-pitch  is  decreased  from 
64  nm  to  45  nm  in  HPLC  H20.  Similar  to  the  TOK  acrylic 
resist,  the  AZ  Clariant  resist  dense  line-space  profiles  were 
also  square  and  well  defined  down  to  a  50-nm  half-pitch  in 
HPLC  H20.  However  the  line  edge  roughness  was  more  pro¬ 
nounced  for  the  AZ  Clariant  resist.  The  AZ  Clariant  resist 
also  exhibited  severe  micro-bridging  and  pattern  collapse  as 
the  half-pitch  was  reduced  to  45  nm. 


C.  Impact  of  resist  soak  times 

Using  one  of  the  best  performing  TOK  resists  (TOK 
ILP06),  the  impact  of  sample  soak- times  in  HPLC  water  was 
investigated.  First  a  sample  was  exposed  as  soon  as  it  was 
immersed  in  the  water  and  dried  off  right  after  the  exposure 
followed  by  a  postexposure  bake,  for  a  maximum  water  con¬ 
tact  time  of  less  than  30  s.  A  second  sample  was  soaked  in 
the  water  for  10  min  before  it  was  exposed  and  immediately 
dried  off  and  postbaked.  A  third  sample  was  exposed  as  soon 
as  it  was  immersed  in  the  water  and  then  soaked  in  the  water 
for  10  min  before  it  was  dried  off  and  postbaked.  Figure  4 
shows  the  exposure  results  for  these  three  samples. 

The  10-min  presoak  sample  had  slightly  better  exposure 
latitude  and  uniformity,  which  was  evaluated  across  the  full 
wafer  and  cannot  be  seen  in  the  figure.  Regardless  of  soak 
time  in  the  water  all  samples  gave  similar,  good  line  profiles. 
It  is  believed  that  from  this  and  other  cited  work  that  any 
interactions  between  the  resist  and  water  take  place  in  a  very 
short  time  (less  than  1  s). 

D.  Impact  of  top  surface  protection 

One  suggested  technology  to  isolate  the  resist  from  the 
immersion  liquid  is  the  use  of  a  top- surface-protection  (TSP) 


J.  Vac.  Sci.  Technol.  B,  Vol.  22,  No.  6,  Nov/Dec  2004 


3463 


Raub  et  al.:  Imaging  capabilities  of  resist  in  deep  UV  liquid 


3463 


barrier  layer.  The  barrier  layer  needs  to  be  transparent  to  the 
exposure  light  and  to  be  inert  to  both  the  immersion  liquid 
and  the  resist.  TOK  has  formulated  a  fluorocarbon  spin-on 
top-protective  coating  TSP-03A,  which  meets  these  require¬ 
ments.  Two  additional  steps  are  added  to  the  processing.  First 
after  resist  spin-coating  and  postapplication  bake,  an  addi¬ 
tional  spin-on  and  bake  sequence  is  used  to  apply  the  TSP- 
03 A  material  atop  the  resist.  Second  after  the  postexposure 
bake,  the  TSP-03A  coating  is  removed  using  a  special  sol¬ 
vent,  TSP-3  remover,  before  the  resist  development. 

Three  TOK  resists  (TOK-ILPOl,  TOK-ILP03,  and  TOK- 
ILP06)  were  exposed  in  air  with  and  without  the  TSP-03A 
coating.  For  the  samples  with  the  protective  coating,  the  de¬ 
veloped  resist  line  shapes  are  very  similar  to  those  for 
samples  without  the  coating  when  exposed  in  air.  Similarly 
the  same  three  resists  were  exposed  in  HPLC  H20  with  and 
without  the  TSP-03A  coating.  The  samples  with  the  protec¬ 
tive  coating  look  practically  the  same  as  the  samples  exposed 
in  air.  However  the  samples  exposed  in  HPLC  H20  without 
any  top  coating  have  noticeably  squarer  line  profiles  with 
reduced  rounding.  This  would  suggest  that  the  top  surface 
protection  barrier  is  preventing  an  interaction  between  the 
resist  and  water.  Other  experiments  suggest  that  the  photo¬ 
acid  generated  upon  exposure  is  leached  out  of  the  top  few 
monolayers  of  the  resist  into  the  water.12,13  This  would  also 
support  the  observed  squaring  of  the  line  profiles  for  the 
immersion  exposure,  as  the  top  of  the  resist  becomes  less 
sensitive  as  compared  to  the  bulk  of  the  resist. 

E.  Impact  of  a  rinse  surfactant 

One  of  the  issues  with  imaging  65 -nm  half-pitch  lines  is 
that  at  aspect  ratios  larger  than  2.5  there  is  frequently  com¬ 
plete  pattern  collapse.  This  is  due  to  the  impact  of  the  water 
surface  tension  as  it  dries  unevenly  between  the  lines  after 
the  development  rinse.  The  simplest  way  to  reduce  this  pat¬ 
tern  collapse  is  to  reduce  the  surface  tension  of  the  water  by 
adding  a  surfactant.  At  a  65 -nm  half-pitch  pattern  collapse 
due  to  adhesion  tension  will  not  occur  if  the  resist  thickness 
is  at  or  below  100  nm,  however  at  150  nm  and  larger  there  is 
often  pattern  collapse.  At  a  45 -nm  half-pitch  with  100-nm 
thick  resist,  the  pattern  collapse  was  severe  using  a  DI  H20 
rinse.  This  pattern  collapse  was  not  noticeable  with  a 
surfactant/water  rinse,  but  microbridging  was  still  observed 
due  to  the  resolution  limits  of  the  resist.  A  much  more  con¬ 
trolled  investigation  of  pattern  collapse  limits  at  these  scales 
is  needed  to  understand  the  limitations  and  control  strategies 
associated  with  pattern  collapse  issues. 

F.  Evaluation  of  contact  holes 

Using  the  same  acrylic  AZ  Clariant  resist  (EXP  M3 000- 
50A)  spun  at  a  100-nm  thickness,  dense  contact  holes  were 
exposed  in  the  resist  and  the  profiles  were  evaluated.  Contact 
holes  are  achieved  by  performing  two  sequential  exposures 
where  the  sample  is  rotated  by  90°  between  the  two  expo¬ 
sures.  This  technique  allows  for  using  TE  polarized  light  in 
both  the  v-  and  y-directions  of  the  contact  hole  pattern.  Fig¬ 
ure  5  shows  the  65 -nm  diameter  dense  contact  holes  array  in 


Exposed  in  DI  H20  (top-down) 


Fig.  5.  129-nm  pitch  contact  holes  in  AZ  Clariant  EXP  M3000  resist. 

the  AZ  Clariant  resist.  Work  continues  to  optimize  this  pro¬ 
cess  and  to  understand  and  reduce  the  hole-to-hole  variations 
evident  in  these  first  exposures. 

IV.  SUMMARY 

The  work  in  this  paper  has  demonstrated  that  45-nm  half¬ 
pitch  lines  can  be  patterned  into  resist  using  liquid  immer¬ 
sion  lithography  (LIL)  with  a  NA=  1.18  and  \  =  213  nm.  This 
would  allow  the  extension  of  193-nm  lithography  by  almost 
two  technology  nodes. 

Some  but  not  all  existing  193-nm  resists  work  well  in  LIL 
immersion  lithography.  Based  on  these  preliminary  results,  it 
seems  likely  that  193-nm  LIL  immersion  resists  can  be  for¬ 
mulated  with  fully  comparable  performance  to  today’s 
193-nm  air-exposed  resists.  The  acrylic  polymer  resist  plat¬ 
form  performed  well  in  LIL  immersion  lithography.  It  is  also 
clear  that  PAGs,  bases,  and  polymers  combinations  play  an 
important  role.  Studies  are  needed  to  find  an  optimal  set  of 
PAGs  and  bases  for  a  given  polymer  system  to  be  used  as  a 
193-nm  LIL  immersion  resist.  These  studies  will  need  to  be 
performed  for  contact  holes,  dense  lines,  and  isolated  lines, 
as  the  resist  performance  tends  to  differ  between  them. 

Resist  top  surface  protection  coatings  work  well  with 
193-nm  resist  and  water  immersion  liquid.  Although  a  top 
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surface  protection  layer  does  prevent  resist  water  interaction, 
there  can  be  interactions  between  resist  and  the  protection 
layer.  Still  there  are  resists  that  perform  well  when  exposed 
in  contact  with  water  and  provide  a  simpler  process  flow. 

As  LIL  liquid  immersion  lithography  enables  the  printing 
of  smaller  lines  and  spaces,  pattern  collapse  due  to  adhesion 
tension  during  post  develop  wafer  drying  will  become  in¬ 
creasingly  important.  Rinse  surfactants  should  reduce  pattern 
collapse  issues,  however  resist  thicknesses  likely  will  con¬ 
tinue  to  decrease,  posing  issues  for  subsequent  pattern  trans¬ 
fer  steps.  The  problem  of  pattern  collapse  needs  further  study 
to  quantify  the  limit  of  resist  aspect  ratios  for  specific  resist 
systems.  A  resist  with  a  larger  Young’s  modulus  will  help 
combat  these  issues.  Evaluation  of  rinse  surfactants  and  pos¬ 
sible  developer  surfactant  needs  to  be  performed  at  the  LIL 
immersion  lithography  technology  nodes. 

All  the  exposure  made  in  this  article  used  the  optimal  TE 
polarization.  This  might  not  be  the  case  for  an  imaging  im¬ 
mersion  lithography  system,  and  polarization  effects  are  go¬ 
ing  to  become  very  important  for  hyper-NA  imaging.  These 
effects  need  to  be  quantified  along  with  their  corresponding 
impacts  on  resist  profiles.  The  exposures  also  used  HPLC 
grade  water.  Although  purified  water  with  a  refractive  index 
of  1 .44  has  been  identified  as  the  preferred  liquid  immersion 
medium,  the  resolution  limits  of  193-nm  lithography  can  be 
further  improved  if  a  viable  liquid  is  found  with  an  even 
higher  refractive  index. 


No  major  issues  were  found  concerning  the  resist- liquid 
interaction  for  liquid  immersion  lithography.  There  are 
known  solutions  to  deal  with  the  interactions  between  the 
resist  and  the  water  immersion  liquid.  Resists  can  be  engi¬ 
neered  and  developed  for  45 -nm  half-pitch  dense  lines  in 
LIL  immersion  lithography  and  beyond. 
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Imaging  interferometric  lithography,  combining  off-axis  illumination,  multiple  exposures  covering 
different  regions  of  spatial  frequency  space,  and  pupil  plane  filters  to  ensure  uniform 
frequency- space  coverage,  is  a  relatively  new  imaging  concept  that  provides  an  approach  to 
accessing  the  fundamental,  linear- systems-resolution  limits  of  optics.  With  an  air  medium  between 
the  lens  and  the  wafer,  the  highest  spatial  frequency  available  with  244-nm  exposure  tool  with  a 
numerical  aperture  of  0.9  corresponds  to  a  half-pitch  of  68-nm.  Allowing  for  —  10%  subbands  above 
this  central  frequency,  this  suggests  that  — 75-nm  half-pitch  patterns  should  be  accessible.  A  22  X 
reduction  imaging  interferometric  lithography  testbed  demonstration  of  printing  a  non-periodic 
(arbitrary)  86-nm  half-pitch  pattern  is  reported.  This  result  was  achieved  with  a  simple 
chrome-on-glass  mask  without  the  use  of  any  mask-based  resolution-enhancement  techniques  such 
as  phase-shift  or  optical  proximity  correction.  Scaling  this  result  to  a  193  nm  wavelength  and  an 
immersion  numerical  aperture  of  1.3  directly  addresses  the  45-nm  half-pitch  node.  ©  2004 
American  Vacuum  Society.  [DOI:  10.1116/1.1821505] 


I.  INTRODUCTION 

The  tremendous  growth  of  the  microelectronics  industry 
has  enabled  today’s  digital  revolution.  A  driving  force  has 
been  the  steady  improvement  in  semiconductor  manufactur¬ 
ing  technology,  especially  in  lithographic  pattern  definition, 
enabling  the  continuous  increase  in  the  functionality  and 
speed  of  integrated  circuits.  Moore’s  Law  ’  has  both  pre¬ 
dicted  and  driven  the  remarkable  advances  in  increased  chip 
area,  decreased  feature  size  and  improved  circuit  design.  The 
National  Technology  Roadmap  for  Semiconductors,  devel¬ 
oped  by  the  Semiconductor  Industry  Association  (SIA), 
serves  as  the  industry  standard  version  of  Moore’s  Law.3  The 
current  industry  roadmap,  looking  forward  to  the  65 -nm 
technology  node  by  2007  and  45-nm  technology  node  by 
2010,  demands  a  decision  on  lithographic  technology  for 
these  nodes  in  the  near  future. 

A  number  of  new  or  next-generation  lithography  tech¬ 
nologies  are  being  developed  to  meet  the  SIA  roadmap  de¬ 
mands.  Some  of  the  technologies  being  considered  as  alter¬ 
natives  to  traditional  optical  lithography  include  extreme 
ultraviolet  (EUVL),  electron  projection,  and  nanoimprint 
lithographies.  While  these  NGL  technologies  hold  promise,  it 
is  unclear  which,  if  any,  will  become  the  prevailing  technol¬ 
ogy  for  mass  production  in  the  future,  especially  given  the 
near-term  requirements  of  the  roadmap.4 

Additional  attention  is  being  placed  on  extensions  of  op¬ 
tical  lithography.  One  particularly  promising  technique  is  liq¬ 
uid  immersion  lithography5  that  allows  an  increase  in  the 
resolution  of  up  to  a  factor  of  the  liquid  refractive  index 
(^h2o=1-44  at  193  nm).6  This  represents  a  44%  resolution 
improvement,  larger  than  the  28%  wavelength  reduction 
from  248-  to  193-nm,  or  the  problematic  23%  reduction  from 
193-  to  157-nm.  It  is  therefore  important  to  investigate  the 
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ultimate  resolution  capabilities  of  optical  lithography  and  to 
investigate  imaging  techniques  that  offer  practical  ap¬ 
proaches  to  reaching  those  ultimate  limits. 

Interferometric  lithography  provides  a  simple  technique 
that  allows  access  to  features  approaching  the  resolution  lim¬ 
its  of  optics.7,8  Integration  of  interferometric  lithography 
with  conventional  optical  lithography9  offers  an  optics-based 
lithographic  capability  for  at  least  the  45-nm  half-pitch 
generation.10 

The  minimum  half-pitch  feature  interferometric  lithogra¬ 
phy  can  access  is  given  by  the  well-known  equation, 
Ahaif-min =  k  / An  sin  0 ,  where  X  is  the  exposure  wavelength,  n 
is  the  refractive  index  of  the  immersion  medium  and  the  two 
beams  propagate  at  angles  of  ±0  with  respect  to  the  wafer 
surface  normal.  For  244-nm  exposures  in  air  (n=1.0)  and  a 
maximum  NA=rc  sin  6  of  0.9,  this  minimum  half-pitch  is 
68-nm.  For  an  imaging  system,  the  ultimate  resolution  does 
not  reach  this  level  as  a  result  of  the  need  to  capture  the 
information  in  the  pattern-dependent  sidebands  around  the 
fundamental  frequency.  Assuming  that  a  bandwidth  of 
—  10%  around  the  center  frequency  is  needed  for  a  typical 
pattern,  a  system  with  these  capabilities  will  be  able  to  print 
— 75-nm  half-pitch  patterns.  There  remains  the  issue  of  real¬ 
izing  this  resolution  in  the  face  of  the  nonlinear  properties  of 
optical  imaging;  optical  proximity  correction  and  phase-shift 
masks  are  well-studied  directions  for  enhancing  the  resolu¬ 
tion  of  an  lithographic  image. 

Imaging  Interferometric  Lithography  (IIL),  using  off-axis 
illumination  at  the  limits  of  the  pupil  along  with  pupil-plane 
filters,  to  ensure  a  uniform  frequency- space  coverage,  is  a 
relatively  new  imaging  concept  that  provides  an  approach  to 
accessing  the  fundamental  linear  system  resolution  limits  of 
optics.  The  resolution  limits  of  conventional  optical  lithogra¬ 
phy  reflect  the  low-pass  spatial-frequency  characteristies  of 
the  imaging  system.  IIL  eases  many  of  the  constraints  that 
limit  the  capabilities  of  conventional  optical  lithography  and 
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offers  a  path  to  the  ultimate  linear  systems  which  limits  of 
optics  extends  to  dense  CDs  of  —X/3NA  (NA -n  sin  6  for 
immersion)  for  arbitrary  patterns.  Significant  benefits  of  IIL, 
especially  for  high-NA  optics,  include  the  ability  to  optimize 
the  polarization  for  the  highest  spatial  frequencies,  and  to 
vary  relative  intensities  and  contrast  of  the  multiple  images 
separately  in  the  x-  and  y-directions  for  2D  mask  patterns. 

II.  IIL 

IIL  was  introduced9,11  as  a  technique  to  cover  the  ex¬ 
tremes  of  frequency  space  by  tilting  the  illumination  as  far  as 
possible  off-axis.  With  the  NAs  of  —0.6  available  at  that 
time,  this  required  tilts  that  were  beyond  the  limits  of  the 
imaging  pupil  and  consequently  reintroduction  of  a  zero- 
order  beam  on  the  image  side  of  the  lens.  The  first  experi¬ 
ments  showed  a  successful  result  for  the  2-pm  CD  using  X 
=  364  nm  and  NA=0.04  equivalent  to  a  Rayleigh  kx  of 
0.22. 12  The  next  step  towards  higher  NAs  and  smaller  fea¬ 
tures  was  made  using  \  =  364  nm  and  NA= 0.356  with  the 
principals  of  IIL  applied  in  only  one  direction  because  of  the 
difficult  optical  system  resulting  from  the  small  working  dis¬ 
tance  of  the  lens.  A  CD  of  250  nm  was  achieved,  equivalent 
to  a  —  0.25. 13 

With  today’s  very  high  NA  lenses  of  sin  0—0.9,  it  is  suf¬ 
ficient  to  set  the  offset  to  the  largest  value  allowed  by  the 
lens  NA  and  to  retain  the  zero-order  beam  within  the  lens, 
eliminating  the  need  for  an  around-the-lens  interferometer. 
Thus,  IIL  is  similar  to  quadrupole  off-axis  illumination  with 
a  small  partial  coherence  for  each  off-axis  illumination 
beam,  and  with  quadrupole  oriented  along  the  principal  (x,  y) 
axes  of  a  Manhattan  geometry  pattern  to  maximize  the  extent 
of  the  spatial  frequency  capture  (as  in  dipole  illumination, 
but  with  only  a  single  mask).  The  x-  and  y-axis  spatial  fre¬ 
quencies  are  covered  with  separate  off-axis  illumination  ex¬ 
posures.  As  NAs  are  increased  even  further  with  the  use  of 
liquid  immersion  techniques,  control  over  the  light  polariza¬ 
tion  will  become  a  critical  issue.  TE  polarization  retains  full 
contrast  while  the  contrast  of  TM  is  reduced  and  phase 
shifted  at  higher  angles.10  The  choice  of  proper  polarization 
for  the  high  spatial  frequency  components  in  orthogonal  di¬ 
rections  dramatically  improves  the  resist  patterns. 

Quadrupole  off-axis  illumination  inherently  emphasizes 
low  frequencies  at  the  expense  of  high  frequencies  because 
each  of  the  exposures  redundantly  covers  the  same  low- 
frequency  information,  while  the  high  frequencies  are  cov¬ 
ered  only  in  individual  exposures.  Pupil  plane  filters  can  be 
used  to  eliminate  these  multiple  coverages  and  thereby  pro¬ 
vide  a  uniform  transfer  function  and  an  improved  image.14 

In  this  paper,  we  present  our  initial  results  at  demonstrat¬ 
ing  IIL  at  high  NAs  and  at  feature  sizes  approaching  the 
ultimate  limits  outlined  above.  The  experimental  testbed  uses 
a  244-nm,  cw  Ar-ion  laser  source  and  a  0.9  NA  lens.  The 
optical  system  includes  a  relayed  imaging  system  pupil  plane 
on  the  mask  side  of  the  imaging  lens  to  allow  insertion  of 
pupil  plane  filters.  As  detailed  below,  our  first  results  dem¬ 
onstrate  resolution  for  arbitrary  patterns  to  at  least  an  86-nm 
half  pitch  (/r/— 0.32)  with  a  simple  binary  mask  without  the 


Fig.  1.  Schematic  layout  of  the  experimental  IIL  tool  with  244-nm  laser 
source,  0.9  NA  Cheetah  objective  lens  and  real-time  focusing  system. 

need  for  either  optical  proximity  correction  or  phase-shift 
mask-based  resolution  enhancement  techniques.  Scaling  this 
result  to  193-nm  and  to  an  immersion  NA  of  1.3  directly 
gives  the  45 -nm  half-pitch  node. 

III.  EXPERIMENTAL  SETUP 

A  0.5-W,  cw,  intracavity  doubled,  Ar-ion  laser  at  244  nm 
was  the  illumination  source  for  the  IIL  exposures.  A  cylin¬ 
drical  lens  was  used  to  correct  for  the  frequency-doubling- 
induced  astigmatism  and  to  get  a  spatial  distribution  close  to 
TEM00.  A  polarizer  and  a  half-wave  plate  were  used  to  con¬ 
trol  the  beam  intensity.  A  second  half-wave  plate  is  used  to 
control  the  polarization  of  the  beam  for  the  various  IIL  ex¬ 
posures.  A  pair  of  steering  mirrors  allows  control  of  the  off- 
axis  angle-of-illumination  of  the  mask  for  the  various  expo¬ 
sures.  A  schematic  of  the  experimental  layout  is  shown  in 
Fig.  L 

The  key  element  of  the  optical  system  is  the  0.9  NA 
“ cheetah ”  lens  (TROPEL).  Lens  specifications  are  presented 
in  Table  I.  The  imaging  pupil  plane  inside  the  lens  is  inac¬ 
cessible.  Therefore,  a  set  of  transform  lenses  is  used  to  relay 
the  pupil  plane  to  an  accessible  position  for  the  insertion  of 
pupil  plane  filters.  The  lens  is  catadioptric  with  central  ob¬ 
scuration  of  15%  of  the  NA.  Specifically,  this  means  that 
coherent  illumination  at  normal  incidence  is  not  possible  be- 

Table  I.  Specifications  for  the  0.9  NA  TROPEL  Cheetah  lens. 


Trade  name  Cheetah  244 


NA 

0.9 

Field  pm 

1.00 

Wavelength  (nm) 

244 

Pupil  diameter  (mm) 

3.6 

Tube  length 

Infinity  -  custom  tube  lens 

Working  distance  (mm) 

1.1 

Telecentric  (deg.) 

0.1  -  object  side 

Transmission  (%) 

75  -  normal 

Optical  material 

Excimer  Si02 

Obscuration  (%) 

15  -  linear 

Cemented  surfaces 

None 

Internal  foci 

None 

Diameter  (mm) 

72  -  maximum 

Length  (mm) 

72  -  maximum 

Mass  (g) 

1200 
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Fig.  2.  Schematic  layout  of  the  22 X  reduction  telecentric  imaging  geometry 
showing  imaging  and  illumination  rays  and  relative  positions  of  lenses  and 
relayed  planes. 


cause  the  lens  blocks  the  zero-order  beam.  Thus,  an  off-axis 
strategy  must  be  adopted  for  the  low-frequency  portions  of 
the  image  as  well  as  for  the  high  frequencies  in  the  IIL  spec¬ 
tral  decomposition. 

An  optical  schematic  of  the  22  X  reduction  imaging  sys¬ 
tem  is  shown  in  Fig.  2.  The  optical  system  was  simulated 

2 

with  standard  ray-tracing  software.  The  full  100  X  100  jmm 
field-of-view  at  the  wafer  plane  has  a  wavefront  flatness  of 
0.82  waves.  There  are  several  addition  elements  in  the  setup 
that  are  associated  with  visual  feedback  of  the  image  focus. 
The  depth  of  focus  for  the  high-frequency  IIL  exposures  is 
only  ~200-nm,  requiring  precise  z-positioning  of  the  wafer; 
since  only  a  single  level  is  printed,  there  is  no  x-y  position¬ 
ing  requirement.  Wafer  flatness  is  not  an  issue  for  the  small 
lens  field-of-view.  For  visual  feedback  of  the  image  focus  we 
built  an  optical  system  associated  with  the  beamsplitter  cube 
in  the  primary  optical  path  that  provides  a  real-time  view  of 
the  image,  i.e.,  magnifies  the  reflection  of  the  mask  image 
from  the  photoresist  surface  back  through  the  Cheetah  and 
projects  the  image  onto  a  CCD  camera.  Since  the  optical 
system  onto  the  camera  can  accommodate  an  out-of-focus 
image,  it  is  necessary  to  simultaneously  image  the  reflected 
pattern  as  well  as  artifacts  on  the  photoresist  surface. 

The  mask  patterns  are  Manhattan  structure  patterns 
[dense/isolated  nested  ells  with  a  large  box  (10  CD)  to 
present  a  range  of  spatial  frequencies].  The  binary  chrome- 
on-glass  mask  does  not  incorporate  any  optical  proximity 
correction  (OPC)  or  phase  shift  mask  (PSM)  features.  As  a 
result  of  the  22  X  reduction,  the  mask  fabrication  was  rela¬ 
tively  straightforward  and  posed  no  special  issues. 

IV.  EXPERIMENTAL  PROCESS 
A.  Pupil  plane  filters 

PROLITH™  simulations  were  used  prior  to  the  experi¬ 
ment  to  provide  insight  into  resolution  and  exposure  strate¬ 
gies.  PROLITH  can  simulate  the  IIL  image  process  using 
custom  sources  and  pupil  plane  filters  and  different  polariza¬ 
tions.  From  the  simulation,  the  optimal  parameters  can  be 
obtained  as  evaluated  from  the  areal  image  and  the  devel¬ 
oped  photoresist  patterns.  Figure  3(a)  shows  a  simulation 
result  for  a  3 -exposure  IIL  for  a  CD  of  86-nm.  Figure  3(b) 
shows  the  pupil  plane  filters  that  were  used  for  this  simula¬ 
tion.  The  filter  for  the  high  x- spatial  frequencies  includes  a 


(a)  18*  i 

O  •  Q 

Fig.  3.  PROLITH™  simulations  of  IIL  were  used  prior  to  the  experiment  to 
provide  insight  into  resolution  and  exposure  strategies,  (a)  3-exposure  IIL 
for  a  CD  of  —86  nm,  ends  of  the  lines  are  well  defined,  isolated  lines  are 
cleared;  and  (b)  the  pupil  plane  filter  used  in  the  off-axis  exposures  (left  and 
right  filters)  and  low-frequency  exposure  (center  filter). 


small  hole  at  the  edge  of  the  pupil  that  transmits  the  zero- 
order  beam.  In  the  experiment,  the  filter  is  put  in  place  and 
the  off-axis  tilt  is  adjusted  to  ensure  that  the  zero-order  beam 
is  cleanly  transmitted  through  the  0.5-mm  hole  (a=  0.003).  A 
very  small  size  is  required  in  order  to  maximize  the  spatial 
frequency  content  of  the  image.  The  maximum  spatial  fre¬ 
quency  transmitted  through  the  optical  system  with  this  filter 
in  place  is  2NA/X  and  corresponds  to  the  68-nm  half  pitch 
discussed  earlier.  The  black  regions  of  the  spatial  filter  block 
the  low-frequency  spectral  components  of  the  image  because 
these  components  would  otherwise  be  covered  in  both  the 
high  x-frequency  (left  filter)  and  high  y-frequency  exposures 
(right  filter)  and  would  therefore  be  overrepresented  in  the 
final  image.  The  “batwings”  at  45°  from  the  zero-order  ap¬ 
erture  represent  frequencies  that  are  covered  in  one  of  the 
high  frequency  exposures  and  blocked  in  the  other.  The  small 
circular  regions  around  each  zero-order  aperture  eliminate 
the  low  spatial  frequencies  that  are  covered  in  the  low- 
frequency  (center  filter)  exposure.  The  zero-order  beam  is 
directed  through  the  center  of  the  aperture  in  this  exposure. 
The  hole  is  offset  to  allow  for  the  central  lens  obscuration. 
The  same  coverage  could  be  accomplished  with  only  two 
exposures  by  extending  the  transparent  “vee”  regions  all  the 
way  to  the  zero-order  aperture.  There  are  two  reasons  that 
the  present  scheme  was  chosen.  First,  the  alignment  of  the 
two  apertures  becomes  critical  if  a  two-exposure  scheme  is 
chosen  since  there  are  strong  spectral  components  in  the  low 
frequency  region  that  must  be  captured  for  a  faithful  image. 
This  is  not  critical  in  the  present  scheme  since  there  are  no 
strong  spectral  components  at  the  transitions  from  blocked  to 
open  in  the  present  filters.  Second,  for  the  “vee”  geometry 
(and  for  conventional  imaging)  the  strong  spectral  compo¬ 
nents  at  low  frequencies  and  those  corresponding  to  the 
dense  lines  and  spaces  interfere  with  each  other  resulting  in 
strong  autocorrelation  (dark-field)  components,  which  can 
distort  the  final  image.  In  the  present  scheme  these  compo¬ 
nents  are  separated  into  different  exposures  and  the  un¬ 
wanted  interference  terms  are  completely  eliminated.  After 
simulating  many  different  pupil  plane  filters  geometries,  we 
found  that  this  set  of  filters  gives  the  best  imaging  results  for 
the  present  pattern. 
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Fig.  4.  SEM  of  PR  lines  with  a  CD  of  ~86-nm,  demonstrating  that  the 
optical  system  has  the  expected  spatial  frequency  bandwidth.  Those  lines 
were  exposed  using  a  mask  and  a  single  off-axis  exposure  without  any  pupil 
plane  filter. 

B.  Photoresist/ARC  stack 

Because  of  the  relatively  small  depth-of-focus  and  the 
small  pattern  CD,  the  ARC  and  PR  thickness  have  to  be  very 
carefully  optimized.  Optimal  thicknesses  as  obtained  from 
the  PROLITH  simulation  are:  ARC  (90+8)-nm,  PR 
(100+10)-nm.  The  bottom  ARC  was  DUV  64-3  from 
Brewer  Science.  The  photoresist  was  DUV  210-0.6  from 
Shipley  Corp.  The  DUV  210-0.6  photoresist  was  diluted  with 
the  ES  solvent  11  for  a  final  thickness  of  —  110-nm  after 
spinning. 

C.  Exposures 

The  exposure  process  starts  with  insertion  of  the 
v-direction  pupil  plane  filter  and  setting  the  focal  position 
using  a  reduced  intensity  beam  and  the  real-time  focusing 
system.  At  this  point  the  mask  and  wafer  are  fixed  relative 
to  each  over ;  no  further  adjustment  of  the  positions  is  al¬ 
lowed.  Then  the  first  exposure  is  carried  out,  the  pupil  plane 
filter  is  rotated  by  90°  (to  provide  the  y -pupil  filter)  and  the 
second  exposure  is  carried  out.  Then  the  low  frequency  pupil 
plane  filter  is  inserted  and  the  third  exposure  is  made.  A 
postexposure  bake  at  130  °C  for  1-min  on  a  vacuum  chucked 
hot  plate,  followed  by  a  45 -s  CD-26  puddle  develop  com¬ 
pletes  the  processing 

V.  EXPERIMENTAL  RESULTS 

A  preliminary  set  of  exposures  with  periodic  grating  pat¬ 
terns  was  carried  out  to  assess  the  optical  system  resolution 
capability.  Figure  4  shows  the  resulting  photoresist  pattern 
for  a  80-nm  half-pitch  grating,  demonstrating  that  the  optical 
system  has  the  expected  spatial-frequency  bandwidth. 

A  scanning  electron  micrograph  SEM  of  the  developed 
PR  pattern  for  3 -exposure  IIL  for  the  244-nm  wavelength 
with  0.9  NA  lens  at  CD  —  86-nm  is  shown  in  Fig.  5(a).  This 
was  done  using  the  1.9-pm  CD  mask.  The  hole  in  the  middle 
of  the  square  is  due  to  overexposure.  A  PROLITH™  simu¬ 
lation  for  an  overexposed  image  shows  similar  results  [see 
Fig.  5(b)].  The  disappearance  of  the  corner  of  the  fifth  line  is 
due  to  overexposure  as  well  as  possible  flare;  work  is  under¬ 
way  to  improve  the  pattern  definition.  Also  deviation  from 
the  optimum  ARC  and  PR  thicknesses  can  cause  similar 


Fig.  5.  (a)  SEM  of  developed  PR  for  3-exposure  IIL  for  a  CD  of  ~  86-nm  at 
a  244-nm  wavelength  and  0.9  NA.  Isolated  lines  and  the  square  edge  of  the 
box  are  well  defined,  dense  lines  are  cleared;  (b)  modeling  of  3-exposure  IIL 
for  a  CD  of  —86  nm  for  an  overexposed  condition;  shows  the  same  hole  in 
the  middle  of  the  square  implying  that  this  feature  on  the  pattern  is  due  to 
exposure  issues  not  to  the  imaging. 

problems.  The  image  has  well-defined  line  extensions  (iso¬ 
lated  lines);  the  dense  lines  are  clear  and  well  separated 
which  indicates  that  the  imaging  system  is  performing  well. 
The  defects  in  the  features  are  due  only  to  photoresist  and 
exposure  issues,  not  to  the  IIL  imaging  capabilities. 

VI.  CONCLUSION 

The  244-nm  IIL  setup  has  demonstrated  a  capability  of 
printing  patterns  with  a  CD  compatible  with  industrial  needs. 
We  have  achieved  —  86-nm  half-pitch  features  at  a  244-nm 
wavelength  with  0.9  NA  without  any  additional  mask-based 
RETs  (PSM  or  OPC).  The  implication  is  that  IIL  can  mini¬ 
mize  the  use  of  mask-based  RETs,  thereby  reducing  mask 
cost  and  improving  cost  of  ownership  (COO).  This  conclu¬ 
sion  is  supported  by  a  recent  simulation  analysis10  modeling 
the  printing  of  similar  pattern  at  the  45 -nm  half-pitch  node 
using  immersion  (NA— 1.3)  and  a  193-nm  source  without 
the  use  of  any  mask-based  RETs.  Comparing  the  simulation 
results  and  the  present  PR  patterns  shows  that  IIL  is  capable 
of  printing  small  patterns.  The  defects  on  the  photoresist  pat¬ 
tern  are  associated  with  photoresist  and  exposure  issues,  not 
with  the  resolution.  Introducing  the  real-time  focusing  tech¬ 
nique  was  a  critical  step  that  dramatically  improved  the  re¬ 
sults  of  the  experiment.  Control  over  the  polarization  was  not 
critical  at  these  dimensions  but  it  will  be  very  important  as 
the  CD  is  scaled  further  with  immersion  techniques. 

The  size  of  the  printed  pattern  can  be  scaled  using  the 
simple  relation  CDoc\/NA.  In  the  present  experiment  we 
have  achieved  —  86-nm  with  a  244-nm  source  and  a  0.9-NA 
lens.  Switching  to  a  193-nm  source  and  a  1.3  immersion  NA 
leads  directly  to  a  — 47-nm  half-pitch — again  without  the 
need  for  any  mask-based  RETs  such  as  phase  shift  and  OPC. 
Simulation  predicts  that  the  resolution  of  the  present  system 
extends  to  — 75-nm.  This  would  seales  to  a  —41 -nm  half¬ 
pitch  for  the  193-nm  immersion  system. 

In  the  future  we  plan  to  optimize  the  exposure  parameters 
for  the  3 -exposure  IIL  and  evaluate  the  resolution  limit  of  the 
system  using  IIL  using  both  simulation  and  experiment.  We 
would  like  to  print  different  patterns  on  the  current  setup 
such  as  via  holes.  Apodization  of  the  pupil  plane  filters  can 
be  used  to  improve  image  quality.15  Scaling  to  the  45-nm 
half-pitch  node  will  require  use  of  193-nm  source  and  an 
immersion  exposure  medium  in  order  to  achieve  1.15/1.3 
NA  systems.  With  this  extended  testbed,  polarization  and 
hyper-NA  effects  will  be  investigated. 
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In  situ  real-time  monitoring  of  profile  evolution  during  plasma  etching 
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We  have  employed  attenuated  total  reflection  Fourier  transforms  infrared  spectroscopy 
(ATR-FTIRS)  to  monitor  the  profile  evolution  of  patterned  mesoporous,  low-dielectric-constant 
Si02  films  in  situ  and  in  real  time  during  plasma  etching.  A  stack  of  patterned  photoresist, 
anti-reflective  coating,  and  mesoporous  Si02  is  etched  in  an  inductively  coupled  plasma  reactor, 
using  CHF3  and  Ar.  During  etching,  the  IR  absorbance  of  Si-O-Si  stretching  modes  near  1080  cm-1 
decreases,  and  the  rate  of  decrease  in  Si-O-Si  absorbance  translates  to  the  Si02  removal  rate.  When 
corrected  for  the  exponentially  decaying  evanescent  electric  field,  the  removal  rate  helps  monitor  the 
profile  evolution  and  predict  the  final  etch  profile.  The  predicted  profiles  are  in  excellent  agreement 
with  the  cross-sectional  images  taken  by  scanning  electron  microscopy.  In  a  similar  approach,  we 
calculate  the  absolute  total  number  of  C-F  bonds  in  the  sidewall  passivation  and  observe  its 
formation  rate  as  a  function  of  time.  Assuming  that  the  thickness  of  the  sidewall  passivation  tapers 
down  towards  the  trench  bottom,  we  deduce  that  C-F  formation  occurs  mostly  in  the  final  stage  of 
etching  when  the  trench  bottom  meets  the  Ge  ATR  crystal  and  that  a  critical  amount  of  C-F  buildup 
is  necessary  to  maintain  the  anisotropic  etch  profile.  ©  2005  American  Vacuum  Society. 

[DOI:  10.1116/1.1865154] 


I.  INTRODUCTION 

The  integrated  circuit  (IC)  manufacturing  has  witnessed 
continuous  device  miniaturization  giving  rise  to  numerous 
engineering  challenges.  According  to  the  2003  International 
Technology  Roadmap  for  Semiconductors  (ITRS),  the  IC  de¬ 
vice  dimension  will  reach  an  18  nm  node  by  201 8. 1  For 
advanced  microprocessors  and  logic  devices,  in  particular, 
such  miniaturization  requires  a  reduction  in  the  resistance- 
capacitance  (RC)  time  constant  associated  with  metal  inter¬ 
connects  and  intermetal  dielectrics.2  The  purpose  is  to  in¬ 
crease  the  device  operating  speed  despite  the  miniaturization. 
Two  primary  solutions  exist  today  to  reduce  the  RC  constant. 
One  is  copper  metallization,  and  the  other  is  low-dielectric- 
constant  materials  insulating  the  metal  interconnects.  A  vari¬ 
ety  of  materials,  such  as  polytetrafluoro  ethylene  (PTFE),3  4 
polyimides  (PI),5-7  silsesquioxanes,8  and  mesoporous  Si02,9 
have  been  considered  as  viable  candidates  to  replace  the  con¬ 
ventional  vapor-deposition-based  Si02.  In  this  study,  we  fo¬ 
cus  on  solgel-based  mesoporous  Si02  whose  dielectric  con¬ 
stant  ranges  from  1.5  to  2.0  in  the  1-40  GHz  range.10  We 
demonstrate  that  the  patterned  etch  profile  of  Si02  film, 
along  with  the  sidewall  passivation,  can  be  monitored  in  situ 
and  in  real  time,  using  attenuated  total  reflection  Fourier 
transform  infrared  spectroscopy  (ATR-FTIRS).  The  ATR 
technique  has  been  previously  used  to  study  surface  reac¬ 
tions  during  plasma  enhanced  processes11-14  as  well  as  “re¬ 
setting”  the  wall  condition  of  commercial  plasma  reactors  for 
process  reproducibility.15  In  ah  cases,  the  ATR  technique 
provides  sub-monolayer  sensitivity  to  examine  heteroge¬ 
neous  reactions  occurring  on  semiconductor  and  insulator 
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films.16,17  We  take  advantage  of  the  high  sensitivity  to  probe 
the  patterned  surface  of  mesoporous  Si02  films  during  etch¬ 
ing.  This  nondestructive  method  eliminates  the  need  for 
cross-sectional  scanning  electron  microscopy  (XSEM)  to  ex¬ 
amine  the  etch  profile.  We  expect  that  the  ATR-FTIRS  tech¬ 
nique  and  our  analytical  approach  can  be  applied  to  other 
dielectric  films  such  as  Si3N4  and  SiOJMy  during  etching  for 
the  purpose  of  process  development.  The  wafer-level  use  of 
the  ATR  technique  in  production  tools,  however,  might  be 
limiting,  since  the  multilevel,  intermetal  dielectric  architec¬ 
ture  contains  metal  interconnects.  These  metal  lines,  whose 
pitch  is  smaller  than  the  wavelength  of  IR,  would  prevent  the 
evanescent  wave  from  propagating  through  the  dielectric 
film. 

II.  EXPERIMENT 

Figure  1  illustrates  a  top  cross-sectional  view  of  our  ex¬ 
perimental  setup.  The  stainless  steel,  tubular  plasma  chamber 
is  25  cm  in  diameter  with  multiple  view  ports.  A  turbomo- 
lecular  pump  (Alcatel  5900CP)  with  1000  //  s  pumping 
speed  maintains  the  base  pressure  at  6.7  X  10-4  Pa.  The  tur- 
bomolecular  pump  is  assisted  by  a  14  //s  double-stage  rotary 
vane  mechanical  pump  (Edwards  E2M40).  A  gate  valve  con¬ 
trols  the  conductance  to  the  turbomolecular  pump  and  main¬ 
tains  the  chamber  pressure  at  1.3  Pa  during  etching,  indepen¬ 
dent  of  reactant  gas  flow  rates.  A  mixture  of  CHF3  and  Ar  is 
introduced  to  the  chamber  through  a  10  cm  diam  circular  gas 
distribution  ring  located  12.5  cm  above  the  substrate.  During 
etching,  mass  flow  controllers  (UNIT  URS  100-5)  maintain 
the  flow  rates  of  CHF3  and  Ar  constant. 

A  “stove-top-coil”  inductive  plasma  source  is  mounted  on 
a  2.54  cm  thick,  16  cm  diam  Pyrex  window  that  separates  the 
plasma  source  from  vacuum.  A  radio  frequency  (rf)  potential 


347  J.  Vac.  Sci.  Technol.  A  23(2),  Mar/Apr  2005  0734-21 01  /2005/23(2)/347/8/$1 9.00  ©2005  American  Vacuum  Society  347 


348 


Gerung  et  a!.:  In  situ  real-time  monitoring  of  profile  evolution 


348 


Fig.  1.  Schematic  diagram  of  plasma  etch  chamber 
equipped  with  ATR-FTIRS  setup. 


at  13.56  MHz  is  applied  to  the  plasma  source  through  an 
impedance  matching  network  to  maximize  the  power  deliv¬ 
ery  to  the  plasma.  An  independent  rf  potential  at  13.516 
MHz  is  applied  to  the  substrate  holder  to  control  the  ion 
energy.  The  slight  frequency  offset  from  13.56  MHz  is 
meant  to  avoid  interference  with  the  plasma  source  fre¬ 
quency.  The  rf  power  delivered  to  the  plasma  source  is  main¬ 
tained  at  400-800  W,  while  the  rf  power  delivered  to  the 
substrate  is  maintained  at  100-150  W.  Although  the  reactor 
geometry  departs  from  commercial  reactors,  we  expect  the 
plasma  conditions  (e.g.,  density  of  reactive  neutrals  and  ions, 
ion  energy  distribution,  and  ion  angular  distribution)  to  be 
comparable  to  those  of  commercial  reactors  based  on  the 
aforementioned  rf  power  consumption  and  independent  bias. 

For  ATR-FTIRS,  a  trapezoidally  shaped  Ge(lll)  crystal  is 
mounted  on  a  metal  substrate  platen  cooled  by  chilled  water. 
The  choice  of  the  ATR  substrate  (e.g.,  GaAs  and  KRS-5)  is 
immaterial,  provided  that  the  substrate  is  IR-transparent  in 
the  mid-IR  region.  We  also  expect  the  effect  of  the  substrate 
on  etch  chemistry  to  be  minimal  prior  to  the  endpoint,  since 
reactive  radicals  and  energetic  ions  impinge  almost  entirely 
on  the  film  surface  rather  than  on  the  underlying  ATR  sub¬ 
strate  during  etching.  Figure  2  illustrates  that  the  Ge  crystal 
is  coated  with  mesoporous  Si02,  anti-reflective  coating 
(ARC),  and  fully  patterned  photoresist  (PR).  The  patterned 
PR  serves  as  an  etch  mask.  The  preparation  steps  for  these 
films  will  be  discussed  in  the  following  section. 

The  Ge  crystal  is  5  cm  long,  2  cm  wide,  and  2  mm  thick 
with  two  ends  beveled  at  45°  with  respect  to  the  substrate 
normal.  The  IR  beam  from  a  FTIR  spectrometer  (Nicolet 
Nexus  670)  is  focused  onto  a  beveled  edge.  Upon  entering 
the  crystal,  the  IR  beam  undergoes  —12  reflections  from  the 
crystal  top  surface  and  exits  the  opposite  beveled  edge.  The 
above  arrangement  renders  the  Ge  crystal  transparent  to  IR 
above  650  cm-1.  The  multiple  internal  reflections  also  sig¬ 
nificantly  enhance  the  IR  absorption  that  stems  from  the 


changes  in  the  film  on  top  of  the  ATR  substrate,  especially  in 
comparison  to  the  IR  absorption  that  stems  from  the  changes 
on  the  beveled  edges.  In  addition,  the  flat  crystal  surface  area 
is  — 10  times  greater  than  that  of  the  two  beveled  edges.  Note 
also  that  the  beveled  edges  face  away  from  the  plasma  source 
and  that  the  oxide  film,  deposited  on  the  longer  side  of  the 
ATR  crystal,  faces  the  plasma  source  (Figs.  1  and  2).  These 
factors  ensure  that  the  absorption  signal  from  the  film  is  at 
least  two  orders  of  magnitude  greater  than  that  from  the  bev¬ 
eled  edges.  The  ATR  technique  is  also  robust  against 
scratches  inadvertently  introduced  during  sample  transfer  to 
and  from  the  plasma  chamber.  We  have  previously  demon¬ 
strated  that  the  surface  corrugation,  whether  introduced  natu- 


Sidewall 


Fig.  2.  Conceptual  layout  of  a  partially  patterned  Si02  film  on  a  Ge  ATR 
crystal.  The  ATR  technique  is  used  to  measure  the  removal  rate  of  Si02  as 
well  as  the  formation  rate  of  C-F  in  the  sidewall  passivation  layer.  Patterned 
photoresist  and  anti-reflective  coating  are  stacked  on  top  of  mesoporous 
Si02  film.  The  magnified  cross-sectional  view  outlines  the  etch  front  at  the 
trench  bottom. 
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rally  or  artificially,  does  not  cause  IR  to  scatter  upon  internal 
reflection,  provided  that  the  corrugation  dimension  is  less 
than  the  IR  wavelength.22 

The  internal  reflections  create  an  evanescent  electromag¬ 
netic  field  whose  strength  decays  exponentially  away  from 
the  film-crystal  interface.  Infrared- active  dipole  moments  in 
the  etched  mesoporous  Si02  film  (e.g.,  C-F  and  Si-O-Si 
stretching  vibrational  modes)  absorb  the  evanescent  field  at 
their  characteristic  vibrational  frequencies.  For  instance,  the 
IR  absorbance  peaks  of  C-Fx  (jc=1,  2,  and  3)  and  Si-O-Si 
stretching  vibrational  modes  appear  at  1200-1700  and 
890-1195  cm-1,  respectively.14,23,24  Because  the  evanescent 
field  strength  decays  exponentially  away  from  the  film- 
crystal  interface,  the  IR  absorption  by  dipole  moments  near 
the  film-crystal  interface  is  more  pronounced  than  that  away 
from  the  interface.  The  mathematical  correction  for  the  ex¬ 
ponential  decay  will  be  described  in  Result  and  Discussion. 
The  exiting  IR  is  collimated  by  a  KBr  lens  and  then  focused 
onto  a  HgCdTe  mid-range  IR  detector  by  an  off-axis  parabo¬ 
loid  mirror.  Prior  to  etching,  a  background  spectrum  is  taken 
with  4  cm-1  resolution  averaged  over  50  scans.  During  etch¬ 
ing,  a  series  of  sample  spectra  are  collected  with  4  cm-1 
resolution.  Each  absorbance  spectrum  is  averaged  over  20 
scans  to  maintain  the  peak-to-peak  noise  level  below  0.016, 
while  achieving  the  time  resolution  of  12  s. 

Figure  2  depicts  partially  etched  parallel  trenches  created 
in  the  mesoporous  Si02  film  during  etching.  The  magnified 
view  of  a  trench  shows  how  PR,  ARC,  and  mesoporous  Si02 
film  are  stacked  on  top  of  the  Ge  ATR  crystal.  To  create  this 
stack  of  films,  the  Ge  crystal  is  first  cleaned  in  100%  ethanol 
(EtOH)  to  dissolve  organic  contaminants  on  the  surface.  The 
substrate  is  then  treated  in  air  plasma  for  2  min  to  increase 
the  adhesion  of  a  sol  that  contains  Si02  film  precursors.  The 
sol  consists  of  EtOH,  H20,  tetraethyl  orthosilicate  (TEOS), 
polyoxyethylene(lO)  cetyl  ether  (Brij  56),  and  a  trace  amount 
of  HC1.  We  prepare  the  sol  by  a  2-step  process  before  spin¬ 
coating  it  on  the  Ge  crystal  at  2000  rpm  for  20  s.  The  first 
step  is  to  create  a  stock  solution  (A2**)  by  mixing  EtOH, 
H20,  TEOS,  and  HC1  at  a  molar  ratio  of  3.8:  1:  1:  5.1 
X  10-5  and  heating  the  solution  at  333  K  for  90  min.  The 
purpose  of  heating  is  to  expedite  the  hydrolysis  of  TEOS, 
replacing  ethoxy  groups  with  hydroxyl  groups.  The  second 
step  is  to  dilute  the  prepared  A2**  stock  solution  with  addi¬ 
tional  EtOH,  H20,  and  HC1.  Brij  56  is  also  added  to  set  the 
molar  composition  of  EtOH,  H20,  TEOS,  Brij  56,  and  HC1 
at  20:  5:  1:  0.06:  4X  10-3.  The  HC1  concentration  is  chosen 
to  minimize  premature  condensation  of  TEOS  while  promot¬ 
ing  the  self-assembly  of  Brij  56  surfactant  to  form  either  a 
hexagonal  or  cubic  phase  template.25  This  template  forma¬ 
tion  occurs  concurrently  with  evaporation  of  EtOH  and  H20; 
hence,  the  template  formation  is  named  evaporation  induced 
self-assembly  (EISA).26  The  prepared  sol  is  then  spin-coated 
on  the  Ge  crystal  and  allowed  to  dry  in  the  ambient  air  (298 
K  and  30%  relative  humidity).  The  TEOS  oligomers  in  the 
sol  subsequently  condense  around  the  template  upon  calcin¬ 
ing  the  spin-coated  film  in  an  oven  filled  with  the  ambient 
air.  The  oven  temperature  is  ramped  at  1  K/min  to  673  K, 


Fig.  3.  Time-series  of  IR  absorbance  spectra  taken  during  a  2  min  etch 
process  show  both  C-F  and  Si-O-Si  stretching  vibrational  modes.  Si-O-Si 
absorbance  becomes  increasingly  negative,  indicating  its  loss,  whereas  C-F 
absorbance  becomes  increasingly  positive,  indicating  its  gain  during  the 
etch. 

maintained  at  this  temperature  for  180  min,  and  then  cooled 
down  to  the  room  temperature  (298  K)  at  the  same  rate.  The 
removal  of  organic  materials  by  calcination  results  in  meso- 
pores  Si02  matrix.  As  control  samples,  we  also  prepare  the 
mesoporous  films  on  Si(100)  under  the  same  condition. 
These  control  samples  are  used  to  measure  the  etch  rate  and 
the  post-etch  profile,  using  XSEM.  The  Si02  film  thickness 
is  measured  prior  to  coating  ARC  and  PR,  using  variable- 
angle  spectroscopic  ellipsometry  (Woolam  WVASE32).  The 
Si02  film  thickness  is  —2600  A  for  all  samples. 

The  anti-reflective  coating  (Brewers  Science  XHRi-16) 
and  photoresist  (Shipley  i300)  are  sequentially  spin-coated 
on  top  of  the  Si02  film  at  4000  rpm  for  30  s.  The  thickness  of 
ARC  and  PR  is  1600  and  8000  A,  respectively.  The  photo¬ 
resist  is  patterned  by  interferometric  lithography  (IL)  de¬ 
scribed  elsewhere.27-30  The  photoresist  is  exposed  to  the  355 
nm  line  of  a  Nd:YaG  laser  operated  at  55  mJ/pulse  for  20  s, 
baked  at  383  K,  and  then  immersed  in  a  developer  solution 
(Shipley  MF702)  for  1  min.  The  pitch  (P)  and  width  (W)  of 
the  fully  developed  PR  pattern  are  0.3  and  0.15  pm,  respec¬ 
tively.  The  underlying  ARC  is  etched  either  in  an  02-Ar 
plasma  or  in  a  CHF3-Ar  plasma,  depending  on  the  desired 
etch  profile.  The  02-Ar  etch  results  in  an  anisotropic  profile, 
whereas  CHF3-Ar  plasma  results  in  a  profile  where  the  mid¬ 
section  bows  out.  The  rf  power  delivered  to  the  plasma 
source  and  substrate  platen,  pressure,  and  flow  rates  of  02 
and  Ar  are  100  W,  50  W,  1.3  Pa,  4  seem,  and  25  seem, 
respectively. 

III.  RESULTS  AND  DISCUSSION 

Figure  3  shows  a  typical  set  of  time-series  absorbance 
spectra  taken  during  a  2  min  etch  to  completely  remove  the 
exposed  2600  A  thick  Si02  under  the  etch  mask.  The  absor¬ 
bance  by  Si-O-Si  stretching  vibrational  modes  from  890  to 
1195  cm-1  decreases  continuously  in  the  negative  direction 


JVST  A  -  Vacuum,  Surfaces,  and  Films 


350 


Gerung  et  al.:  In  situ  real-time  monitoring  of  profile  evolution 


350 


(Fig.  3).  This  observation  is  consistent  with  the  fact  that  Si02 
is  removed  during  etching.  The  pronounced  noise  level  near 
1050  cm-1  is  due  to  the  fact  that  Si-O-Si  stretching  mode  is 
a  strong  IR  absorber  and  that  the  IR  intensity  of  the  reference 
background  in  the  said  region  is  minimal  with  a  2600  A  thick 
Si02  layer. 

In  contrast  to  the  Si-O-Si  absorbance,  the  absorbance  by 
C-Fx  vibrational  modes  near  1200-1700  cm-1  increases  con¬ 
tinuously,  indicating  that  the  total  amount  of  fluorocarbon 
(FC)  deposit  increases  throughout  the  etching  process.  This 
increase  may  stem  from  fluorocarbon  accumulation  on  the 
sidewalls  ’  and/or  on  the  trench  bottom.  The  work  of  Rue- 
ger  et  al.  and  Standaert  et  al. ,  using  angle-resolved  x-ray 
photoelectron  spectroscopy  (AR-XPS),  demonstrates  that  a 
thin  layer  of  FC  exists  on  Si02  surfaces  even  under  heavy 
ion  bombardment.  Thus,  we  expect  the  FC  to  exist  both  on 
the  trench  bottom  as  well  as  on  the  sidewalls.  However,  the 
heavy  ion  bombardment  on  an  independently  biased  sub¬ 
strate  is  believed  to  maintain  the  FC  accumulation  at 
minimum.33,35  The  fluorocarbon  would,  therefore,  accumu¬ 
late  mostly  on  the  sidewalls.32  In  addition,  the  trench  bottom 
area  remains  fairly  constant  throughout  the  etch,  whereas  the 
sidewall  area  continues  to  increase.  Note  also  that  we  have 
discounted  the  FC  deposit  on  trench  tops  from  the  possible 
cases  discussed  above.  We  have  determined  that  the  IR  ab¬ 
sorption  contribution  from  the  FC  deposition  on  trench  tops 
is  minimal  when  the  Si02  thickness  is  comparable  to  or  sig¬ 
nificantly  exceeds  the  depth  of  penetration  of  the  evanescent 
wave.  That  is,  when  the  FC  deposit  on  trench  top  is  placed 
outside  or  at  a  distance  comparable  to  the  probing  depth  of 
the  evanescent  wave,  the  observed  IR  absorption  pertains  to 
either  the  sidewalls  or  the  trench  bottom.  Therefore,  we  de¬ 
duce  that  the  observed  increase  pertains  mostly  to  the  side- 
walls. 

Two  mechanisms  may  contribute  to  the  increase  in  fluo¬ 
rocarbon.  (1)  The  fluorocarbon  deposits  in  the  sub-surface 
pores  near  the  exposed  Si02  sidewalls.36  For  porous  Si02 
with  2  nm  wide  pores  and  30%  porosity,  similar  to  our  me- 
soporous  Si02,  the  sub- surface  pore  filling  may  reach  a  depth 
of  ~3  to  4  nm.36  As  the  etching  exposes  progressively  in¬ 
creasing  Si02  sidewall  area  with  concurrent  pore  filling,  the 
total  amount  of  fluorocarbon  increases;  (2)  the  fluorocarbon 
deposits  externally  on  top  of  the  exposed  Si02  sidewalls,  and 
the  total  amount  increases  with  increasing  Si02  sidewall 
area.  We  assume  that  the  rate  of  diffusion  of  fluorocarbon 
precursors  through  the  mesopores  (~2  nm)  far  exceeds  the 
rate  at  which  the  exposed  Si02  surface  increases  and  that  the 
precursor  diffusion  is  self-limiting  due  to  the  fluorocarbon 
film  sealing  the  mesopores.32,36  Thus,  we  expect  that  the  in¬ 
crease  in  C-Fx  absorbance  is  primarily  due  to  the  FC 
build-up  on  top  of  the  exposed  Si02  sidewalls. 

For  a  quantitative  analysis  to  obtain  the  absolute  amount 
of  Si02  removal  and  C-Fx  accumulation  during  etching,  the 
IR  absorbance  spectra  in  Fig.  3  need  to  be  corrected  for  the 
exponentially  decaying  evanescent  field.  A  similar  correction 
procedure  has  been  previously  described.  ’  To  summarize, 
we  employ  the  effective  thickness  ( de )  approximation38,39 


where  the  IR  absorbance  (A)  is  normalized  by  de.  The  absor¬ 
bance  A  corresponds  to  the  Si02  film  remaining  at  any  given 
moment  during  etching.  A  is  approximated  by  the  sum  of  two 
absorbance  spectra: 

A=A0  +  Aexp,  (1) 

where  A0  is  the  absorbance  spectrum  representing  the  pre¬ 
etch  Si02  film  taken  with  respect  to  the  clean  Ge  ATR  crys¬ 
tal,  and  Aexp  is  the  experimentally  measured  absorbance 
spectrum  taken  with  respect  to  a  pre-etch  background  where 
the  patterned  PR  and  ARC  are  stacked  on  top  of  yet-to-be- 
etched  Si02.  A  and  de  are  related  to  the  extinction  coefficient 
( k )  of  the  mesoporous  Si02  film  by 


*(5)  = 


Mv) 
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where  v  and  NR  denote  the  wave  number  in  cm-1  and  the 
number  of  internal  reflections  from  the  Si02-Ge  interface, 
respectively.  NR  is  12  in  this  case.  Since  de  is  yet  unknown, 
the  pre-etch  Si02  film  thickness  is  used  as  the  initial  value  of 
de.  k  is  related  to  the  real  part  (n)  of  the  complex  refractive 
index  (n=n  +  ik)  for  the  Si02  film  by  the  Kramers-Kronig 
dispersion  relation: 
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where  nt  is  the  refractive  index  of  the  film  at  wave  number 
vb  and  P  denotes  the  Cauchy  principle  value  of  the  integral 
from  zero  to  infinity.  However,  the  experimentally  collected 
spectrum  has  a  finite  wave  number  range  within  the  mid-IR 
region,  and  the  integral  cannot  be  evaluated  from  zero  to 
infinity.  This  limitation  can  be  circumvented  by  measuring  a 
real  part  (nr)  of  the  complex  refractive  index  at  a  known 
wave  number  vr  and  subtracting  nr  from  nr  The  inherent 
assumption  is  that  the  difference  between  nt  and  nr  is  close  to 
zero  in  the  domains  outside  the  experimentally  accessible 
wave  number  range:  i.e.,  0^  v<  vx  and  v2<  i^00-  This  as¬ 
sumption  leads  to 
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where  nr  is  measured  at  vr=\5  800  cm-1  by  spectroscopic 
ellipsometry  (SE).13  The  experimentally  measured  value  of 
nr  is  1.26  for  the  mesoporous  Si02  film.  The  limits  (vx  and 
v2)  of  integration  in  the  principle  integral  are  650  and 
4000  cm-1,  respectively. 

The  value  of  nt  from  Eq.  (4)  is  then  used  to  calculate  the 
two  components  of  the  evanescent  field:  Em  that  is  parallel  to 
the  substrate  surface  and  E01  that  is  perpendicular  to  the 
substrate  surface.  The  strength  of  these  two  components  de¬ 
pends  on  n{  of  the  mesoporous  Si02;  nx(v),  the  real  part  of 
the  complex  refractive  index  for  Ge  ATR  crystal;  and  n3(v), 
the  real  part  of  the  complex  refractive  index  for  vacuum. 
n3(v)  approximates  the  refractive  index  of  high  vacuum  in 
the  etch  chamber.  Denoting  nt  as  n2(v)  for  convenience,  Em 
is  expressed  as 
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2  cos  #[(1  +  ^32)sin2  0-n^]112 

£°"  =  (i-4)1/2[(i+4)sin20-4]1/2’  5 

where  nim  ( 1=3  and  m- 1  or  2)  denotes  nrto-nm  ratio.  The 
perpendicular  component,  E0±,  is  expressed  as 

2  cos  6 

£»i  =  a (6) 

The  value  of  nt  from  Eq.  (4)  is  also  used  to  calculate  the 
characteristic  penetration  depth  ( dp )  of  the  evanescent  wave 
by 


dp  =  27r(sin2  9- n\\)vl  ’  (?) 

where  X  is  the  IR  wavelength  in  vacuum,  and  6  is  the  inci¬ 
dent  angle  of  the  IR  on  the  top  surface  inside  the  ATR  crys¬ 
tal.  The  effective  thickness  ( de )  is  then  expressed  as 

de  =  "2lE° -d  [l  - exp(-  2 S/dp)],  (8) 

2  cos  6  F 

where  8  is  the  remaining  Si02  film  thickness.  8  is  calculated 
by 


8=  80-  (ER  X  t) ,  (9) 

where  8(),  ER ,  and  t  represent  pre-etch  Si02  film  thickness, 
etch  rate  measured  on  a  blank  Si02  film,  and  etch  duration, 
respectively.  Since  the  Si02  film  is  partially  masked  during 
the  pattern  etch,  8  is  an  underestimation  of  remaining  Si02. 
Equation  (9)  also  assumes  that  the  vertical  etch  rate  remains 
constant  throughout  etching.  To  account  for  possible  loading 
effect  and  time-dependent  etch  rate,  in  situ  spectroscopic 
ellipsometry 14,40  or  interferometry41,42  can  be  used  for  more 
reliable  estimate  on  8.  Note  that  Eq.  (8)  results  in  two  effec¬ 
tive  thickness  values  (de y  and  del ),  depending  on  whether  Em 
or  E0±  is  used  in  place  of  E0.  As  an  approximation,  the 
arithmetic  average  of  de y  and  del  obtained  from  Eq.  (8)  is 
used  for  de  in  Eq.  (2)  to  calculate  k.  The  above  procedure 
using  Eqs.  (2)-(8)  is  repeated  until  nt  and  k  converge.  To  test 
their  convergence,  the  real  part  (e^  of  the  complex  dielectric 
constant  (e)  is  first  calculated  by 

s{=  n2  -  k2,  (10) 

where  the  subscript  i  for  nt  is  omitted  for  convenience.  The 
imaginary  part  (e2)  is  then  calculated  by 


s2  =  Ink. 


(ID 


The  magnitude  of  s  is  then  expressed  as 

8  =  Ve?  +  e|.  (12) 

We  employ  an  absolute  convergence  criterion  where  the  root 
mean  square  (rms)  value  defined  in  Eq.  (13)  is  less  than  10-2: 
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rms  = 


-ekJ)2/N, 


(13) 


where  k  is  the  kth  wave  number,  l  is  the  /th  iteration,  and  N 
is  the  number  of  experimental  data  points  in  the  absorbance 
spectrum. 

After  de  is  self-consistently  calculated  by  the  procedure 
described  above,  the  absolute  amount  of  Si02  remaining  at 
each  moment  during  etching  is  approximated  by 

2.60  X  1012  cm-1 1  Asi_0_si(P)dv  w  o 
ASi_0_si-  _  Xd 

nr  f  hx  de(y) 

XAtop,  (14) 

where  A^Si_0_si  is  the  total  number  of  Si-O-Si  removed,  /  is 
the  oscillator  strength  of  Si-O-Si  stretching  vibrational 
mode,  and  Atop  is  the  top  surface  area  of  the  Ge  ATR  crystal. 
The  integration  is  evaluated  from  1^  =  984  cm-1  to  v2 
=  1195  cm-1.  The  top  surface  area  remains  constant  at  5  cm2, 
while  8  decreases  according  to  Eq.  (9).  The  oscillator 
strength  is  7.38  X  10-5  calculated  from  43 


f=  1.29  X  1017 


aW 


K 


mesoporous 


nlR 

(niR  +  2)2’ 


(15) 


where  a,  W ,  A^mesoporous’  and  ^ir  denote  the  peak  absorption 
coefficient  (1.72  cm-2),  the  full  width  at  half  maximum 
(100  cm-1)  for  the  Si-O-Si  absorbance  band,  the  concentra¬ 
tion  of  Si02  units  (—1.92  X  1022  cm-3)  in  the  mesoporous 
film,  and  the  real  part  (1.84)  of  the  complex  refractive  index 
near  the  1050  cm-1  region.  We  approximate  Afmesoporous  by 


mesoporous  ' 


^mesoporous^y 
^thermal 


thermal  * 


(16) 


where  ^mesoporous  an(i  ^thermal  denote  the  real  part  of  the  com¬ 
plex  refractive  index  in  the  visible  range  for  mesoporous 
Si02  at  1.26  and  that  of  thermal  Si02  at  1.46,  respectively. 
The  experimentally  measured  ftmeSoporous  is  identical  to  a  re- 
ported  value,  thermal  is  2.2  X  10  cm-J  based  on  the  spe¬ 
cific  gravity  and  molecular  weight  of  thermally  grown 
Si02.44  ?zIR  of  mesoporous  Si02  is  similarly  approximated  to 
be  1.84  by  Eq.  (16),  based  on  n1R  of  thermal  Si02  at  2.12. 

For  C-F,  the  total  number  of  C-F  (A^c_f)  created  during 
etching  is  calculated  from 


45 


Ac_f: 


1  1  p2  Ac_F(v)dv 

= - c  —  X  £c_f  X  (j ER  Xt)XL , 

NrctC-vJ~Vx  vde(y) 


(17) 


where  crc_F  is  the  IR  cross-section  of  C-F  stretching  vibra¬ 
tional  mode  (2.84  X  10-20  cm2),46  £C_F  is  the  thickness  of  the 
fluorocarbon  sidewall  passivation,  and  L  is  the  length  of  the 
etched  trenches  (5  cm).  The  integration  is  evaluated  from 
^  =  1200  cm-1  to  v2- 1700  cm-1.  For  comparison  purpose 
only,  we  assume  that  £C_F  remains  constant32  at  1  nm,47  con¬ 
trary  to  the  reported  observation  that  £C_F  tapers  down  to¬ 
wards  the  bottom  of  the  etched  trenches.31  This  assumption 
helps  visualize  how  the  sidewall  passivation  thickness 
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Fig.  4.  Integrated  absorbance  of  Si-O-Si  is  plotted  vs  time  for  (a)  a  sample 
with  relatively  anisotropic  etch  profile  and  (b)  a  sample  with  highly  non¬ 
ideal,  bell-jar  shaped  etch  profile.  The  corresponding  cross-sectional  SEM 
images  are  shown  in  the  insets.  The  scale  bar  is  200  nm.  The  IR  absorbance 
corrected  for  the  exponentially  decaying  evanescent  field  is  used  to  calculate 
the  total  loss  of  Si-O-Si  bonds.  A  nonlinear  slope  of  total  Si-O-Si  loss 
(Asi-o-Si)  during  plasma-on  period  foretells  a  deviation  from  perfect  aniso¬ 
tropy.  The  nonideal  profile  is  particularly  pronounced  for  the  profile  shown 
in  (b). 


changes  as  a  function  of  time.  If  Nc_¥  deviates  from  a  linear 
line,  then  the  slope  of  AC_F  describes  a  number  of  scenarios, 
as  to  whether  the  actual  SC_F  tapers  down  towards  the  trench 
bottom,  thickens  towards  the  trench  bottom,  or  thickens  over 
the  entire  passivation  layer  while  maintaining  the  general 
tapered  down  cross-section. 

The  SEM  images  in  Fig.  4  illustrate  two  contrasting  ex¬ 
amples  of  profile  evolution:  (a)  anisotropic  profile  and  (b) 
highly  nonideal,  bell-jar  shaped  profile.  The  lateral  opening 
of  the  Si02  trench  is  —0.15  jum  for  both  profiles.  The  CHF3 
and  Ar  flow  rates,  rf  power  to  the  plasma  source,  and  rf 
power  to  the  substrate  are  maintained  at  22  and  45  seem,  800 
W,  and  150  W,  respectively,  for  the  anisotropic  profile  and  at 
11  and  50  seem,  400  W,  and  100  W,  respectively,  for  the 
bell-jar  shaped  profile.  Another  difference  is  that  the  ARC  is 
etched  in  an  02-Ar  plasma  for  5  min  for  the  anisotropic 
profile,  whereas  the  ARC  is  etched  under  the  identical  con¬ 
ditions  used  in  Si02  etch  for  the  bell-jar  shaped  profile.  The 


etch  is  stopped  30  s  (—10%  overetch)  after  the  presumed 
endpoint  based  on  the  etch  rates  observed  on  identically  pat¬ 
terned  samples.  Despite  the  overetch,  we  have  determined 
that  the  underlying  Ge  etch  is  below  detection  limit. 

Corresponding  to  the  two  etch  profiles,  Fig.  4  shows  the 
integrated  absorbance  of  Si-O-Si  (/s i_o-si)  before  correcting 
for  the  exponentially  decaying  evanescent  wave.  /si-o-Si 
qualitatively  indicates  that  Si02  is  removed  approximately  at 
a  constant  rate  for  the  anisotropic  profile  and  at  a  rate  that 
varies  throughout  the  etch  for  the  bell-jar  shaped  profile.  The 
detailed  quantitative  information  on  profile  evolution  is  ex¬ 
tracted  from  ASi_o-si  based  on  Eq.  (14).  Afsi_0_si  accounts  for 
the  exponential  decay,  and  its  time  derivative  conveys  the 
information  on  profile  evolution.  We  employ  a  fifth-order 
polynomial  function  ( — )  to  fit  ^/Vsi-o-Si  (•)•  The  smoothness 
of  the  polynomial  fit  reduces  the  noise  in  the  calculated  rate 
of  decrease  ( — ).  The  rate  of  decrease  is  equivalent  to  the 
removal  rate  of  Si02.  The  removal  rate  accounts  for  both 
lateral  and  vertical  losses;  therefore,  the  removal  rate  is  dif¬ 
ferent  from  the  blank  film  etch  rate.  For  instance,  the  dotted 
line  (— )  in  Fig.  4(a)  indicates  that  the  Si02  removal  rate 
remains  nearly  constant  until  the  very  end  of  the  etch  process 
between  100  and  120  s.  The  reduced  removal  rate  translates 
to  narrowing  profile,  and  the  SEM  image  in  Fig.  4(a)  shows 
a  slight  footing  near  the  bottom  of  the  trench. 

For  the  bell-jar  shaped  profile  in  Fig.  4(b),  we  demon¬ 
strate  that  the  profile  evolution  of  ARC  and  Si02  can  be 
successively  monitored.  To  this  end,  the  IR  absorbance  of 
characteristic  vibrational  modes  of  ARC,  which  appear  in  the 
1080  cm-1  region,  is  included  in  the  calculation  of  AfSi_0_si- 
Thus,  Afsi_o_si  partially  reflects  the  removal  rate  of  ARC  in 
addition  to  that  of  Si02.  The  nonlinearity  in  the  ARC  and 
Si02  removal  rate  is  consistent  with  the  bulging  profile  of 
ARC  and  the  lateral  erosion  of  Si02.  One  can  trace  Points 
1-5  on  the  removal  rate  curve  (— )  and  match  them  with  the 
corresponding  points  in  the  SEM  image.  Note  that  the  re¬ 
moval  rate  is  negative  with  its  corresponding  axis  on  the 
right  hand  side.  Thus,  the  absolute  removal  rate  increases 
with  increasingly  negative  quantity.  The  observed  removal 
rate  suggests  that  the  ARC  etching  lasts  —350  s,  after  which 
Si02  etch  begins  at  a  comparatively  accelerated  pace.  In  Fig. 
4(b),  /si-o-Si  at  350  s  is  —50,  matching  that  of  initial  /si-o-Si 
in  Fig.  4(a).  Since  the  Si02  thickness  of  the  sample  in  Fig. 
4(a)  is  the  same  as  that  in  Fig.  4(b),  this  equivalence  vali¬ 
dates  the  demarcation  of  when  ARC  etch  ends,  and  Si02  etch 
begins,  while  supporting  the  positional  accuracy  of  assigned 
numbers  (Points  1-5)  along  the  etch  profile.  In  the  future,  we 
plan  to  compare  the  etch  profiles,  that  are  predicted  and  in¬ 
dependently  verified  by  XSEM,  with  experimentally  mea¬ 
sured  ion  energy  and  angular  distribution  functions  in  Si02, 
Si3N4,  and  SiOJNTy  cases  where  the  etch  is  strongly  driven  by 
ion  bombardment. 

The  absolute  amount  of  C-F  formation  during  etching, 
calculated  from  the  absorbance  of  C-F  stretching  vibrational 
mode  by  Eq.  (17),  is  shown  in  Fig.  5.  Figure  5(a)  corre¬ 
sponds  to  Fig.  4(a),  and  Fig.  5(b)  corresponds  to  Fig.  4(b).  In 
contrast  to  AfSi_0_si,  NC_F  increases  throughout  the  etch.  The 


J.  Vac.  Sci.  Technol.  A,  Vol.  23,  No.  2,  Mar/Apr  2005 


353 


Gerung  et  a!.:  In  situ  real-time  monitoring  of  profile  evolution 


353 


Fig.  5.  IR  absorbance  of  C-F  stretching  vibrational  mode  near  1300  cm-1 
corrected  for  the  exponentially  decaying  evanescent  field  strength  is  used  to 
calculate  the  total  number  of  C-F  formed  during  etching  for  (a)  a  sample 
with  relatively  anisotropic  etch  profile  and  (b)  a  sample  with  highly  non¬ 
ideal,  bell-jar  shaped  etch  profile.  The  inset  in  (a)  conceptually  describes  the 
shape  and  thickness  of  sidewall  passivation  in  the  beginning  (B),  interme¬ 
diate  (I),  and  final  (F)  stage  of  etching.  Note  that  the  inset  is  not  drawn  to 
scale.  The  C-F  buildup  is  pronounced  towards  the  end  point,  and  a  critical 
buildup  is  necessary  to  maintain  the  anisotropic  profile. 


C-F  accumulation  rate  is  calculated  from  the  slope  of  /VC_F, 
in  the  same  way  the  Si02  removal  rate  is  calculated.  The 
accumulation  rate,  which  rises  sharply  towards  the  end  of  the 
etching  process,  suggests  that  the  most  of  C-F  accumulation 
occurs  towards  the  end,  when  the  trench  bottom  meets  the 
underlying  Ge  ATR  crystal.  If  the  sidewall  passivation  profile 
is  assumed  to  maintain  its  wedge  shape  tapering  down  to¬ 
wards  the  trench  bottom  during  etching,  the  sharp  rise  in 
Nc_ F  translates  to  a  profile  evolution  schematically  drawn  in 
the  inset  of  Fig.  5(a).  B,  I,  and  F  in  the  inset  represent  be¬ 
ginning,  intermediate,  and  final  stage  of  etching,  respec¬ 
tively.  The  profile  maintains  its  downward  wedge  shape,  but 
its  overall  thickness  rises  sharply  near  the  endpoint.  Both 
anisotropic  and  bell-jar  shaped  samples  commonly  share  this 
trend.  The  only  dramatic  difference  between  the  two  samples 
is  the  absolute  amount  of  C-F  accumulation.  The  anisotropic 
profile  exhibits  a  high  level  of  C-F  bond  accumulation 
throughout  the  etch  (8  X  1015  total  when  the  etch  is  com¬ 


plete),  whereas  the  bell-jar  shaped  profile  exhibits  a  com¬ 
paratively  low  level  of  C-F  bond  accumulation  (2  X  1015  to¬ 
tal  at  the  end).  Given  the  same  Si02  thickness  and  trench  top 
opening  width,  we  infer  that  a  critical  amount  of  C-F  exists, 
below  which  a  substantial  lateral  loss  in  Si02  occurs.  That  is, 
the  sample  corresponding  to  Figs.  4(b)  and  5(b)  does  not 
have  enough  fluorocarbon  buildup  on  the  sidewalls  and, 
therefore,  not  enough  passivation  thickness  to  prevent  the 
lateral  loss. 

IV.  CONCLUSIONS 

We  have  demonstrated  the  use  of  ATR-FTIRS  to  monitor 
the  etch  profile  evolution  of  mesoporous,  low-dielectric- 
constant  Si02  in  situ  and  in  real  time.  The  technique  is  non- 
intrusive  as  well  as  nondestructive,  eliminating  the  necessity 
to  cleave  the  etched  samples  for  SEM  imaging.  We  rely  on 
the  IR  absorbance  of  Si-O-Si  stretching  vibrational  mode 
during  etching  to  calculate  the  Si02  loss.  When  the  absor¬ 
bance  spectra  are  corrected  for  the  exponentially  decaying 
strength  of  the  evanescent  wave,  using  the  algorithm  de¬ 
scribed  in  Results  and  Discussion,  the  total  amount  of  Si02 
loss  can  be  monitored  in  real  time,  and  the  removal  rate  is 
calculated  from  the  slope  of  total  Si02  loss.  The  calculated 
removal  rate  and  the  corresponding  etch  profile  show  good 
agreement,  further  validating  that  the  ATR  technique  can  be 
used  to  predict  the  profile  evolution  during  etching.  In  a 
similar  approach,  we  have  investigated  the  fluorocarbon  ac¬ 
cumulation  during  etching.  The  total  accumulation  of  C-F 
and  the  corresponding  profile  suggest  that  a  critical  amount 
of  C-F  buildup  is  needed  to  maintain  an  anisotropic  profile 
and  that  the  most  of  C-F  buildup  occurs  towards  the  end  of 
etching. 
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Island  coalescence  during  Volmer- Weber  thin  him  growth  is  generally  accepted  to  be  a  source  of 
tensile  stress.  However,  the  stochastic  nature  of  unpattemed  him  nucleation  and  growth  prevents 
meaningful  comparison  of  stress  measurements  taken  during  growth  to  that  predicted  by  theoretical 
models.  We  have  overcome  this  by  systematically  controlling  island  geometry  using  selective 
growth  of  Ni  hlms  on  patterned  substrates  via  electrodeposition.  It  was  determined  that  the 
measured  power-law  dependence  of  mean  stress  on  island  size  agreed  well  with  theory.  However, 
our  data  clearly  demonstrates  that  the  majority  of  the  tensile  stress  associated  with  coalescence 
actually  occurred  after  the  initial  contact  of  neighboring  islands  as  the  him  planarizes  with 
additional  deposition.  ©  2005  American  Institute  of  Physics.  [DOI:  10.1063/1.1870109] 


I.  INTRODUCTION 

The  effective  mechanical  properties  of  a  material  are 
highly  dependent  upon  the  residual  stress  state  introduced 
during  fabrication.  This  is  particularly  true  for  thin  hlms 
where  the  residual  stress  can  be  a  signihcant  portion  of  the 
yield  strength.  Unfortunately,  the  evolution  of  intrinsic  stress 
during  deposition  of  Volmer- Weber  thin  hlms  is  quite  com- 
plex  and  still  highly  debated.  The  only  region  where  there 
is  a  consensus  as  to  the  mechanism  is  during  island  coales¬ 
cence,  where  tensile  stresses  have  been  observed  to  exceed 
one  gigapascal.4  Conceptually,  tensile  stress  is  generated 
when  adjacent  islands  initially  touch  one  another,  and  then 
elastically  deform  to  contact  over  a  hnite  area  in  order  to 
reduce  the  overall  surface  energy.  Existing  theoretical  mod¬ 
els  for  coalescence  stress  ah  determine  the  mean  tensile 
stress  in  mechanical  equilibrium,  as  a  function  of  island  size 
and  geometry  at  the  moment  of  coalescence.1,4-8  However, 
computational  necessity  requires  use  of  highly  simplihed  is¬ 
land  geometries  with  uniform  sizes  and  simultaneous  coales¬ 
cence.  In  real  hlms,  coalescence  events  are  stochastically  dis¬ 
tributed  in  time  and  occur  among  islands  with  a  broad  range 
of  sizes  and  shapes.  Furthermore,  multiple  mechanisms  for 
stress  generation  can  be  operating  at  the  same  time.  As  a 
result,  it  is  not  possible  to  quantitatively  equate  theoretical 
predictions  for  the  tensile  coalescence  stress  with  existing 
measurements  in  stochastically  nucleated  hlms  and  a  mean¬ 
ingful  comparison  is  only  obtainable  when  island  geometries 
are  systematically  controlled. 

We  obtain  both  the  functional  dependence  of  mean  ten¬ 
sile  stress  on  island  radius,  and  the  absolute  magnitude  of  the 
stress,  by  measuring  stress  changes  during  electrodeposition 
of  Ni  islands  where  the  coalescence  process  was  constrained 
via  lithographically-defined  island  nucleation  sites  and 
selective-area  growth  (Fig.  1).  We  demonstrate  that  the  ex¬ 
perimentally  measured  coalescence  stress  is  in  good  agree¬ 
ment  with  the  predictions  from  the  Hertzian  contact  model  of 


Freund  and  Chason7  and  with  two-dimensional  hnite  element 
(FE)  analysis.  We  have  also  determined  that  the  initial  coa¬ 
lescence  stress  was  the  minority  component  of  the  total  stress 
created  during  the  coalescence  and  planarization  of  the  him. 
This  determination  lays  the  groundwork  for  future  analysis 
of  the  postcontact  stress  generation  process,  which  is  consid¬ 
erably  more  complex  (and  beyond  the  scope  of  this  paper). 


FIG.  1 .  (a)  Schematic  of  an  idealized  cylindrical  structure,  (b)  ion  channel¬ 
ing  focused  ion  beam  (ICFIB)  cross-section  image  of  actual  structure  prior 
to  coalescence,  and  (c)  ICFIB  image  of  the  actual  structure  after  coales¬ 
cence.  Note,  the  islands  were  coated  after  electrodeposition  with  Pt  for 
processing  in  the  FIB. 
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II.  EXPERIMENTAL  TECHNIQUES 

Motivated  by  recent  models  for  coalescence  stress  by 
Freund  and  Chason7  and  by  Nix  and  Clemens,6  we  chose  to 
experimentally  evaluate  the  “two-dimensional”  coalescence 
geometry,  where  the  islands  were  idealized  as  an  array  of 
parallel  half-cylinders,  as  shown  in  Fig.  1(a).  In  order  to 
accomplish  this,  periodic  trench  arrays  were  patterned  into  a 
photoresist  layer  to  expose  a  1500  A  thick  Au  nucleation 
layer.  This  Au  film  was  grown  on  a  Ti  adhesion  layer  on  a 
thermally  oxidized  Si  (001)  substrate.  Island  size  was  con¬ 
trolled  by  varying  the  spacing  of  the  trenches,  while  keeping 
the  nominal  trench  pitch-to-width  ratio  (dpitch/dtrench)  =  2.65. 
The  effective  cylinder  radii,  R ,  ranged  from  0.3  jmm  to 
26  jmm.  Ni  films  were  potentiostatically  electroplated  from 
an  additive-free  Ni-sulfamate  bath  at  40  °C.  Prior  to  elec¬ 
trodeposition  of  the  coalescence  samples,  the  bath  was  con¬ 
ditioned  at  10  mA/cm2  for  4  h  to  remove  trace  ionic  con¬ 
taminants  such  as  Fe  and  Co.  Additionally,  ultrahigh  purity 
N2  was  bubbled  through  the  bath  for  a  minimum  of  24  h, 
which  increased  run-to-run  reproducibility.  The  plating  effi¬ 
ciency  was  determined  by  comparing  the  Faradic  current  to 
thicknesses  measured  using  Rutherford  backscattering  spec¬ 
trometry.  During  deposition,  the  substrate  curvature  induced 
by  thin-film  stress  was  measured  using  laser  deflectometry,3 
with  the  wafer  clamped  in  a  cantilever  configuration.  Be¬ 
cause  film  thicknesses  were  often  an  appreciable  fraction  of 
the  substrate  thickness,  the  usual  Stoney’s  equation  did  not 
hold,  and  the  following  relation  for  curvature  (ac)  was  used:9 

k-«M _ _ 1  (1) 

h]  [l+HY(4  +  6H  +  4H2)+H4Y2y  K 

where  H  is  the  ratio  of  the  film  thickness  (h-f),  to  the  sub¬ 
strate  thickness  (hs),  cr  is  the  mean  film  stress,  and  Y  is  the 
ratio  of  the  film  modulus  to  the  substrate  modulus.  Young’s 
modulus  was  used  rather  than  the  biaxial  modulus  due  to  the 
uniaxial  loading  geometry  used  here. 


FIG.  2.  Stress-thickness  vs  time  plot  for  a  4.2  jmm  radius  coalescence 
sample  showing  a  measured  change  in  stress*thickness  during  coalescence 
of  96  GPa  A. 


coalescence  there  must  be  a  corresponding  maximum  in  the 
plating  current  at  the  same  moment.  To  quantify  this  we  de¬ 
veloped  a  geometric  model  that  predicts  the  evolution  of  the 
surface  area  and  in  turn,  the  deposition  current  of  a  cylindri¬ 
cal  geometry  as  shown  in  Fig.  3.  Prior  to  coalescence,  the 
surface  is  given  by  the  sum  of  two  quarter  cylinders  and  the 
trench  that  grow  radially  outwards  due  to  the  growth  direc¬ 
tion  being  normal  to  the  growth  surface  during  electrodepo¬ 
sition  [Fig.  3(a)].  This  results  in  the  total  current  (7tot)  of  the 
wafer  prior  to  coalescence  being 


dr  dq 

hot=(^r+wtt)  —  —ntIws, 


(2) 


where,  r  is  the  island  radius,  niv  is  the  number  of  trench 
across  the  samples,  ws  width  of  the  sample,  wtr  width  of  the 
trench,  dr/  dt  is  the  deposition  rate,  and  dq/dV  is  the  plating 
efficiency  measured  to  be  3.7  X  107  mC/cm3  from  RBS 


III.  RESULTS 

Figures  1(b)  and  1(c)  demonstrate  the  selective  lateral 
growth  that  can  be  obtained  using  electrodeposition.  At  coa¬ 
lescence,  the  islands  were  not  true  half-cylinders,  as  shown 
in  Figs.  1(b)  and  1(c),  and  we  will  account  for  this  in  our 
analysis.  Figure  2  shows  the  measured  evolution  of  “stress- 
thickness,”  crhf ,  calculated  from  curvature  using  Eq.  (1).  In 
the  figure,  Ni  was  electrodeposited  into  an  array  of  trenches 
that  yielded  nominally  4.2  jmm  radius  cylinders  [with  the  ra¬ 
dius  as  defined  in  Fig.  1(b)].  The  first  rise  in  the  tensile  stress 
was  the  result  of  random  Ni  nucleation  and  coalescence 
within  the  trenches  themselves  to  form  the  Ni  lines.  The 
second  tensile  rise  was  due  to  the  coalescence  of  adjacent 
cylindrical  islands,  and  was  the  stress  change  that  we  ana¬ 
lyzed.  In  order  to  do  this,  we  found  it  necessary  to  differen¬ 
tiate  between  the  stress  created  during  the  initial  coalescence 
event  and  the  subsequent  stress  created  as  the  film  evolved 
from  cylinders  towards  a  planar  surface. 

During  potentio static  deposition  plating  current  is  lin¬ 
early  proportional  to  the  surface  area.  Therefore,  since  the 
surface  roughness  of  a  film  is  maximum  at  the  moment  of 


(a) 


^ _ island 


substrate 


FIG.  3.  Idealized  cross-sectional  geometry  used  in  current  evolution  model, 
(a)  prior  to  coalescence,  (b)  postcoalescence. 
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FIG.  4.  Overlay  of  current  predicted  from  geometric  model  (dashed  line) 
with  the  current  measured  during  deposition  (solid  line). 


thickness  measurement  of  unpattemed  films.  After  contact, 
the  islands  asymptotically  approached  a  planar  surface  due  to 
the  overlapping  of  the  cylinder  [Fig.  3(b)]  with  the  total  cur¬ 
rent  being  equal  to 


hot  — 


.  R  . 

2 r  •  arcsin[  —  )  +  wtr 


dr  dq 

Jtdvn  trWs 


(3) 


where  R  is  the  radius  of  the  islands  when  they  coalesced. 

Figure  4  is  an  overlay  of  the  predicted  (dashed  line)  and 
measured  (solid  line)  currents  for  a  single  representative  ex¬ 
periment.  The  only  fitting  parameter  was  the  position  of  the 
maximum  current,  which  was  required  because  the  plating 
rate  is  larger  than  the  steady-state  value  during  the  formation 
of  the  diffusion  zone.  This  initial  increase  in  the  deposition 
rate  causes  the  peak  to  occur  earlier  than  was  predicted  by  a 
constant  deposition  rate.  In  comparing,  the  modeled  and 
measured  current  two  main  features  are  apparent,  rounding 
of  the  measured  peak  current  and  a  higher  asymptotic  cur¬ 
rent.  Rounding  of  the  peak  in  the  measured  current  was 
likely  the  results  of  variation  in  the  thickness  of  the  photo¬ 
resist,  variation  in  the  trench  width  across  the  samples,  and 
surface  roughness,  where  as  the  difference  between  the  mea¬ 
sured  and  predicted  asymptotic  currents  is  likely  the  result  of 
the  increased  surface  area  due  to  surface  roughness.  For  the 
sample  used  in  Fig.  4  the  difference  between  the  asymptotic 
currents  was  —12%,  where  as  AFM  measurements  of  the 
surface  indicated  the  surface  area  was  —6%  larger  than  a 
planar  film,  which  is  reasonable  considering  the  reduction  in 
the  measured  surface  roughness  due  to  the  finite  radius  of 
curvature  of  the  AFM  tip. 


IV.  DISCUSSION 

Figure  4  demonstrates  that  the  moment  of  initial  coales¬ 
cence  corresponds  to  the  maximum  surface  area  of  the  film. 
Therefore,  the  change  in  stress-thickness  due  to  the  initial 
coalescence  of  the  islands  was  determined  as  the  deviation  in 
stress-thickness  from  the  background  level  to  the  point  of 
maximum  current,  as  shown  in  Fig.  2.  The  remainder  of  the 
tensile  rise  was  the  result  of  continual  coalescence  of  the 
islands  during  subsequent  deposition  and  planarization.  Our 
data  demonstrates  that  the  majority  of  the  tensile  stress  was 
actually  the  result  of  postcontact  coalescence  processes. 
Similar  behavior  has  been  observed  during  chemical  vapor 
deposition  of  unpatterned  diamond  films  and  was  attributed 
to  the  continual  coalescence  of  faceted  grains.10  Our  results 


FIG.  5.  RMS  roughness  surface  roughness  of  planar  films  measured  using 
ex  situ  AFM.  The  %  RMS  roughness  decreased  significantly  and  stabilized 
at  ~1  micron  of  thickness. 


support  the  generality  of  that  observation.  The  detailed  un¬ 
derstanding  of  continual  coalescence  requires  significant  ad¬ 
ditional  experimentation  and  analysis,  which  will  be  reported 
in  future  work. 

Figure  5  summarizes  our  results  for  the  measured  stress 
associated  with  initial  coalescence  as  a  function  of  the  island 
contact  radius.  This  volume- average  stress  was  obtained  by 
dividing  the  total  change  in  stress-thickness  due  to  coales¬ 
cence,  as  defined  above  and  in  Fig.  2,  by  the  mean  thickness 
determined  from  the  plating  current.8,11  On  a  log-log  plot  the 
data  exhibits  a  roughly  linear  increase  in  mean  stress  with 
decreasing  island  radius.  There  is  some  negative  deviation 
from  linear  behavior  at  smaller  radii,  which  was  typically 
associated  with  depositions  where  the  very  poor  signal-to- 
noise  ratio  created  unacceptably  large  error  in  the  measured 
stress.  Additionally,  AFM  measurements  indicate  that  the 
relative  roughness  on  surfaces  of  the  half-cylinders  was  sig¬ 
nificantly  larger  for  cylinder  radii  below  1  jam  (Fig.  6).  Sur¬ 
face  roughness  decreases  the  contact  area  between  islands, 
which  reduces  the  measured  stress.  Therefore,  in  the  folio w- 


FIG.  6.  Plot  of  measured,  FC,  and  FE  calculated  island  coalescence  stress  as 
a  function  of  radius.  Both  models  assume  perfect  cylinders  coalescing  and 
therefore  should  overestimate  the  coalescence  stress. 
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in g  analysis  we  shall  only  consider  samples  with  radii  larger 
than  1  jmm  for  quantitative  comparison  with  the  analytical 
and  FE  models. 

The  Freund  and  Chason  model  (FC)  (Ref.  7)  uses  Hert¬ 
zian  contact  theory  to  determine  the  volume  average  stress 
(o-avg)  created  within  the  islands  as  a  function  of  the  initial 
coalescence  geometry  of  the  island,  e.g.,  blocks,  cylinders,  or 
hemispheres.  Our  experiments  used  cylinders  with  their  axes 
parallel  to  the  substrate,  whereas  FC  used  cylinders  with 
their  axis  of  symmetry  normal  to  the  substrate.  This  required 
a  slight  modification  to  the  FC  model  as  follows. 

According  to  FC  the  total  contact  force  per  unit  length 
(P)  for  the  general  case  for  cylinder  contacting  under  plane 
strain  along  their  axis  of  symmetry  is  given  by 

P  =  p(irRE)(gR)213,  (4) 

where  p  the  normalized  contact  force  for  cylinders  and  was 
determined  by  FC  to  equal  0.3,  E  is  the  plane  strain  modulus, 
^=2yl  ttE,  and  y  is  the  difference  between  the  surface  en¬ 
ergy  and  half  the  grain  boundary  energy,  i.e.,  the  energy  of 
the  interface  between  the  coalesced  islands.  The  volume  av¬ 
erage  stress  is  found  by  dividing  the  contact  force  per  unit 
area  by  the  mean  film  height  for  the  modified  geometry, 
which  for  a  half  cylinder  is  given  by  ttR/4.  Therefore,  the 
volume  average  stress  for  a  cylinder  with  its  axis  parallel  to 
the  substrate  is 

cravg=1.2E^-^-j  =0.888£1/3(^)  .  (5) 

This  volume  average  stress,  for  half  cylinders  coalescing 
with  their  axis  of  symmetry  parallel  to  the  substrate,  is  twice 
that  of  cylinders  with  their  axis  perpendicular  to  the  sub¬ 
strate.  However,  the  power-law  dependence  of  the  predicted 
stress  with  island  radius  is  the  same  for  both  geometries  and 
equal  to  -2/3. 

As  shown  in  Fig.  1,  the  true  geometry  of  the  coalescing 
islands  used  in  these  experiments  was  more  complex  than  the 
idealized  analytical  model  due  to  the  additional  block  of  ma¬ 
terial  in  and  above  the  trench.  This  had  two  effects  on  the 
volume  average  stress:  First,  it  decreased  the  number  of 
boundaries  formed  per  unit  area  causing  an  overall  decrease 
in  the  magnitude  of  the  volume  average  stress.  Second,  the 
material  above  the  trench  acted  as  a  “shear-lag  zone”  that 
transmitted  the  stress  to  the  substrate,  which  acted  to  increase 
the  volume  average  stress.  2D  FE  modeling  was  used  to 
quantify  the  effect  of  the  material  in  and  above  the  trench  on 
the  functional  dependence  of  the  volume  average  stress  on 
island  radius.  The  details  of  using  FE  to  model  island  coa¬ 
lescence  have  been  presented  elsewhere.12  Two  geometries 
were  compared  using  FE  modeling:  ideal  half  cylinders  [Fig. 
1(a)]  and  the  actual  island  structure  (Fig.  7).  We  found  no 
significant  variation  in  the  functional  dependence  of  stress  on 
island  radius  (slope  of  -0.76  to  -0.74)  as  a  result  of  the 
different  geometries  and  only  a  slight  decrease  in  the  magni¬ 
tude  of  the  stress  with  the  addition  of  the  trench  and  photo¬ 
resist.  This  result  supports  the  use  of  a  simplified  geometry 
(pure  cylinders)  in  the  FC  model.  In  calculating  the  theoret¬ 
ical  stress  shown  in  Fig.  5  no  fitting  parameters  were  used 


FIG.  7.  In-plane  stress  (ayy)  fields  in  the  full  structure  as  calculated  from  FE 
model.  Darker  color  indicates  stronger  tensile  stress. 

and  both  the  FE  and  FC  models  were  calculated  assuming 
y=1.85  J/m2,13  and  E- 200  GPa  (measured  in  our  films  us¬ 
ing  nanoindentation).  It  should  be  noted  that  both  the  FE  and 
the  analytical  models  predict  the  maximum  possible  stress 
created  during  initial  coalescence  and  any  error  in  y  or  E 
would  result  in  a  shift  in  the  magnitude  of  both  curves  and 
cannot  be  used  to  explain  the  difference  between  the  results. 
Mechanisms  such  as  incomplete  coalescence  due  to  surface 
roughness  or  localized  yielding  will  always  act  to  reduce  the 
observed  stress.  Additionally,  the  slight  difference  in  the  ex¬ 
ponents  between  the  FE  (-3/4)  and  the  Hertzian  models 
(-2/3)  was  likely  the  result  of  the  lack  of  an  exact  solution 
for  the  force  distribution  in  contacting  cylinders  used  in  the 
Hertzian  model.7,14 

Figure  5  compares  the  calculated  coalescence  stress  for 
the  FE  (dashed  line)  and  the  FC  (solid  line)  models  with  the 
experimentally  measured  initial  coalescence  stress.  In  com¬ 
paring  theory  vs  data,  we  consider  the  power-law  behavior 
and  the  absolute  stress  magnitudes  independently,  because 
the  magnitude  depends  on  materials  properties  whereas  the 
power-law  behavior  is  solely  a  function  of  geometry.  The 
observation  that  the  stress  magnitudes  of  the  measured  data 
and  FE  results  are  similar  might  appear  fortuitous,  because 
there  is  considerable  uncertainty  in  the  surface  and  grain¬ 
boundary  energies  used  in  the  models  and  because  of  poten¬ 
tial  surface  roughness  effects.  However,  since  the  observed 
stress  was  of  a  similar  magnitude  to  the  predicted  stress, 
which  is  an  upper  bound,  it  is  reasonable  to  conclude  that 
any  reduction  in  the  initial  coalescence  area  by  surface 
roughness  was  minimal.  More  definitively,  Fig.  5  shows  that 
the  power-law  behavior  predicted  by  both  the  FE  and  the  FC 
models  is  within  the  experimental  variation  of  the  measured 
coalescence  stress  versus  island  radius  for  samples  larger 
than 

1  /zm  (-0.70  +  0.25).  Furthermore,  the  data  lie  outside  the 
predicted  behavior  for  both  slope  and  magnitude  for  the  ID 
and  3D  coalescence  geometries  in  the  FC  model.  Therefore, 
the  analytical  models  capture  the  dominant  physical  mecha- 
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nisms  underlying  the  generation  of  tensile  stress  during  is¬ 
land  coalescence. 

V.  CONCLUSION 

In  conclusion,  using  lithographically  defined  nucleation 
and  selective  area  growth  via  electrodeposition,  we  have  ob¬ 
tained  the  first  meaningful  comparison  of  experimentally 
measured  stresses  due  to  island  coalescence  with  the  stresses 
predicted  by  theory.  These  experiments  yielded  two  main 
results:  First,  the  measured  initial  coalescence  stress  had  a 
functional  behavior  similar  to  that  of  the  FC  and  FE  models 
for  coalescence  stress.  From  this  we  conclude  that  both  mod¬ 
els  correctly  account  for  ah  of  the  dominant  physical  mecha¬ 
nisms  active  during  the  initial  coalescence  of  islands.  Sec¬ 
ond,  the  majority  of  the  stress  associated  with  island 
coalescence  occurred  during  planarization  of  the  films,  not 
during  the  initial  contact  of  the  islands.  This  striking  result 
had  not  been  previously  recognized  in  unpatterned  metal 
films  due  to  the  inability  to  differentiate  the  stress  created  at 
the  initial  contact  from  that  created  during  planarization. 

ACKNOWLEDGMENTS 

The  authors  would  like  to  acknowledge  fruitful  discus¬ 
sions  with  Ben  Freund,  Thomas  Buchheit,  Brian  Sheldon, 
Eric  Chason,  and  Harley  Johnson.  This  work  was  supported 
by  the  DOE  office  of  Basic  Energy  Science  and  by  the  ARO/ 


MURI  in  Deep  Sub  wavelength  Optical  Nanolithography  at 
the  University  of  New  Mexico.  Sandia  is  a  multiprogram 
laboratory  operated  by  Sandia  Corporation,  a  Lockheed  Mar¬ 
tin  Company,  for  the  United  States  Department  of  Energy’s 
National  Nuclear  Security  Administration  under  Contract 
No.  DE-AC04-94AL85000. 

lR.  Koch,  J.  Phys.:  Condens.  Matter  6,  9519  (1994). 

2J.  A.  Floro,  S.  J.  Hearne,  J.  A.  Hunter,  P.  Kotula,  E.  Chason,  S.  C.  Seel, 
and  C.  V.  Thompson,  J.  Appl.  Phys.  89,  4886  (2001). 

3J.  A.  Floro,  E.  Chason,  and  S.  R.  Lee,  Mater.  Res.  Soc.  Symp.  Proc.  405, 
(1996)  (381). 

4R.  Abermann  and  P.  Martinz,  Thin  Solid  Films  115,  185  (1984). 

5R.  W.  Hoffman,  Thin  Solid  Films  34,  185  (1976). 

6W.  D.  Nix  and  B.  M.  Clemens,  J.  Mater.  Res.  14,  3467  (1999). 

7L.  B.  Freund  and  E.  Chason,  J.  Appl.  Phys.  89,  4866  (2001). 

8Allison  Suh,  Ning  Yu,  Ki  Myung  Lee,  Andreas  A.  Polycarpou,  and  H.  T. 
Johnson,  J.  Appl.  Phys.  96,  1  (2004). 

9L.  B.  Freund,  J.  A.  Floro,  and  E.  Chason,  Appl.  Phys.  Lett.  74,  1987 
(1999). 

10B.  W.  Sheldon,  K.  H.  A.  Lau,  and  A.  Rajamani,  J.  Appl.  Phys.  90,  5097 
(2001). 

nL.  B.  Freund  and  S.  Suresh,  Thin  Film  Materials  (Cambridge  University 
Press,  Cambridge,  2003),  Vol.  1,  p.  206. 

12S.  C.  Seel,  C.  V.  C.  V.  Thompson,  S.  J.  Hearne,  and  J.  A.  Floro,  J.  Appl. 
Phys.  88,  7079  (2000). 

13J.  P.  Hirth  and  J.  Lothe,  Theory  of  Dislocations,  2nd  ed.  (Wiley,  New  York, 
1975),  p.  839. 

14K.  L.  Johnson,  Contact  Mechanics,  1st  ed.  (Cambridge  University  Press, 
Cambridge,  1985),  Chap.  4.2,  p.  99. 


Downloaded  18  Apr  2005  to  134.253.26.12.  Redistribution  subject  to  AIP  license  or  copyright,  see  http://jap.aip.org/jap/copyright.jsp 


Demonstration  of  imaging  interferometric  microscopy 


Christian  J.  Schwarz,  Yuliya  Kuznetsova  and  S.  R.  J.  Brueck 
Center  for  High  Technology  Materials,  University  of  New  Mexico,  MSC04  2710,  1313  Goddard  SE, 

Albuquerque,  NM,  USA  87106 


ABSTRACT 

We  demonstrate  imaging  interferometric  microscopy  (IIM)  for  binary  objects  in  two  dimensions.  Combining  multiple 
exposures  with  different  off-axis  illumination  configurations  together  with  interferometric  restoration  of  the  zero-order 
beam  during  dark- field  conditions,  IIM  provides  high  lateral  resolution  at  low  numerical  apertures  ( NA ).  It  retains  the 
large  field-of-view,  long  working  distance  and  large  depth-of-field  of  a  low-W4  imaging  system.  Also  we  include  a  first 
demonstration  of  imaging  of  a  phase  mask.  All  these  properties  are  increasingly  important  for  in  semiconductor  mask 
metrology.  IIM  relies  on  image  processing  to  reconstruct  the  image;  we  present  the  processes  necessary  to  obtain  the 
combined  image.  Finally  we  compare  the  experiment  with  a  simple  Fourier  optics  model. 

Keywords:  Semiconductor  metrology,  optical  high  resolution  microscopy,  phase  sensitive  metrology,  defect  analysis, 
holography,  synthetic  aperture  radar,  Fourier  optics,  frequency  space  synthesis,  interferometry,  off-axis  illumination 

1.  INTRODUCTION 

Sub-wavelength  resolution  in  the  optical  regime  is  of  increasing  importance,  both  for  lithographic  printing  of  features  in 
integrated  circuits1  and  for  microscopy,  particularly  of  biological  specimens.2  The  specific  technological  requirements  of 
mask  inspection  for  microlithography  include:  a  large  field-of-view  (FOV)  in  order  to  cover  as  large  of  an  area  of  mask 
in  one  image  as  possible,  a  long  working  distance  (WD)  in  order  to  be  able  to  inspect  masks  that  are  protected  by  a  pelli¬ 
cle  mounted  several  millimeters  away  from  the  mask  surface,  and  high  lateral  image  resolution,  on  the  order  of  scaled 
critical  dimensions  (CD)  or  smaller,  in  order  to  detect  defects.  Further  a  large  depth-of-focus  (DOF)  and  phase  sensitiv¬ 
ity  are  advantageous  for  a  general  inspection  tool.  Imaging  interferometric  microscopy  (IIM)  offers  all  these  properties  at 
the  same  time.  As  an  optical  technology  IIM  allows  for  high  throughput,  standard  environmental  conditions  and  is  both 
noncontact  and  nondestructive,  which  makes  it  attractive  not  only  for  mask  inspection,  but  also  to  other  fields  of  applica¬ 
tion,  such  as  in  vivo  investigations  of  cells. 

IIM  takes  advantage  of  the  fact  that  information  about  an  object  is  contained  in  diffraction  directions  that  are  not  col¬ 
lected  using  conventional  illumination,  but  can  be  recovered  by  extreme  off-axis  illumination  (so  that  the  zero-order  of 
the  mask  is  outside  of  the  lens  collection  cone)  along  with  interferometric  reintroduction  of  a  zero-order  reference  beam 
on  the  low-W4  side  of  the  microscope.  IIM  developed  out  of  an  effort  aimed  at  improved  high-resolution  printing, 
namely  imaging  interferometric  lithography  (IIL).3  Both  IIL  and  IIM  approach  the  problem  of  high  resolution  imaging 
from  the  point  of  view  of  linear  systems  response  in  which  a  lens  or  any  optical  system  is  characterized  by  its  linear  re¬ 
sponse  function  in  frequency  space,4  and  a  lens  is  characterized  as  a  low  pass  filter.  The  question  of  high-resolution  im¬ 
aging  then  becomes,  how  to  transmit  as  many  spatial  frequencies  as  possible  through  the  lens  using  multiple  exposures 
and  different  illumination  conditions,  and  can  be  associated  with  the  problem  of  frequency  multiplexing  in  telecommu¬ 
nication.  The  difference  of  IIM  and  IIL  is  in  the  recording  technique.  IIM,  with  electronic  image  capture,  allows  for  im¬ 
age  subtraction,  IIL,  with  chemical  image  capture  in  the  resist  latent  image,  does  not.  Also  the  underlying  mathematical 
problem  is  different  due  to  the  difference  in  applications.  In  IIL  we  know  what  we  want  to  print  and  the  question  be¬ 
comes  (how  do  we  print  it  the  best,  most  efficient  way)  leading  to  an  optimization  problem.  In  IIM,  we  ultimately  don’t 
know  what  we  are  looking  at  leading  us  to  a  difficult  inverse  reconstruction  problem.  For  the  mask  inspection  applica¬ 
tion,  the  mask  database  is  known,  greatly  simplifying  the  inverse  problem. 

In  this  paper  we  first  review  the  principle  of  IIM  and  explain  our  experimental  arrangement,  then  present  experimental 
data  for  two-dimensional  objects,  introduce  a  Fourier  optics  computational  model,  compare  our  experiment  with  the 
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model,  and  with  data  from  a  conventional  high-W4  microscope  image.  Finally  we  discuss  the  presented  material  and  its 
future  possibilities  and  implications. 


2.  PRINCIPLE  OF  IIM 

Any  given  two-dimensional  object  can  be  described  by  the  Fourier  transform  representing  its  spatial  frequency  content. 
The  linear  systems  analysis  of  the  impact  of  the  optical  system  on  that  spectral  content  provides  insight  into  IIM.  The 
spatial  frequency  response  of  an  optical  system  depends  on  both  the  illumination  and  imaging  optics.  The  simplest  case 
is  coherent  illumination,  e.g.  plane  wave  illumination  from  a  specific  direction.  Scattering  by  the  mask  generates  a  range 
of  spatial  frequencies,  corresponding  to  a  band-limited  Fourier  transform  of  the  object.  The  band  limit  is  the  result  of  the 
finite  range  of  propagating  wave  vectors.  Scattering  from  higher  spatial  frequency  components  of  the  object  that  results 
in  non-propagating  evanescent  waves  cannot  be  captured  by  any  optical  system  relying  on  propagating  waves.  The  high¬ 
est  frequency  that  can  be  scattered  corresponds  to  MX  for  normal  incidence  illumination  (e.g.  to  light  propagating  along 
the  plane  of  the  object).  For  grazing  incidence  illumination,  the  highest  spatial  frequency  scattering  corresponds  to  back 
reflection,  or  to  a  spatial  wavelength  of  2/X.  This  simple  example  serves  to  illustrate  the  higher  spatial  frequencies  acces¬ 
sible  with  off-axis  illumination. 


transmitted 


reduce 

resolution 


through  the  lens 
and  interfere  with 
the  zeroth  order 


Figure  2:  For  extreme  off-axis  illumination,  the  optical  system 
captures  higher  frequencies.  Re  introduction  of  the  zero-order  on 
the  low  NA  side  of  the  optical  system  restores  the  image  compo¬ 
nents. 

Introducing  a  lens  after  the  object  further  restricts  the  Fourier  coefficients  that  will  be  transmitted  through  the  lens  (see 
Figs.  1  and  2).  Optical  systems  can  be  characterized  for  imaging  purposes  as  linear  low-pass  filters  as  mentioned  in  the 
introduction.4  The  restricted  spectral  bandpass  leads  to  a  larger  depth-of-field  (DOF)  as  compared  with  a  hypothetical 
lens  with  an  NA  of  unity.  In  addition,  because  of  the  aberrations  associated  with  high-W4  geometric  optics,  a  lower  NA 
lens  typically  has  a  longer  working-distance  (WD)  from  the  lens  to  the  object  and  images  a  larger  field-of-view  (FOV) 
than  a  higher-W4  lens.  The  strategy  of  IIM  is  to  use  a  low-W4  objective  with  desirable  DOF,  WD  and  FOV  and  to  extend 
the  range  of  captured  spatial  frequencies  by  using  extreme  off-axis  illumination  to  capture  additional  parts  of  the  spatial 
frequency  spectrum  as  illustrated  in  Fig.  2.  The  “extreme”  is  to  designate  that  the  off-axis  illumination  angle  is  so  steep 
that  the  zero-order  transmission  of  the  mask  is  not  captured  by  the  lens,  but  is  instead  interferometrically  reintroduced 
around  the  lens.  The  recording  medium,  a  CCD  camera  or  film,  is  a  square  law  device.  The  recorded  spatial  frequencies 
correspond  both  to  the  desired  image  information  (the  interference  between  the  zero-order  and  each  of  the  scattered  or¬ 
ders)  and  to  dark-field  components  (the  autocorrelation  of  the  scattered  fields)  that  can  be  subtracted  from  the  final  im¬ 
age.  In  principle  IIM  allows  us  to  capture  the  entire  available  frequency  space  up  to  the  propagation  limit  of  2/X  noted 
above. 

For  the  case  of  a  thin  one-dimensional  grating,  illustrated  in  Figs.  1  and  2,  where  we  have  two  positive  and  two  negative 
orders  within  the  optical  system  limit,  we  extract  the  linear  image  terms  by  subtracting  the  dark  field  image  (no  zero 


Figure  1:  For  normal  incidence  illumination,  the  imaging 
optical  system  captures  frequencies  up  to  NA/X. 
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order)  from  the  total  intensity  image.  First,  for  an  optical  system  that  captures  all  of  the  spectral  components  in  a  single 
image: 


I totaljin  I  total  ^  total, dark 


-  \E0  +2 Ex  cos(2^r/j)+2JE'2  cos(4;zx/r/)|2  -|2 Ex  cos(2^x/j)+2JE'2  cos(4;zx/<i)|2 
fsf +4\e,ex\  cos(2;r  x/d)+  4\E0EX  |  cos(4;r  x/d) 


(1) 


These  linear  image  terms  can  also  be  recovered  by  taking  a  low-frequency  image  (Fig.  1)  that  only  captures  the  zero  and 
first  orders  in  our  example  and  a  high  frequency  image  that  captures  the  second  order.  This  gives  us: 


I  low  ,Un  1 low  I  low, dark 


-  \E0  +2 Ex  cos(2;rx/<i)|2  -|2 Ex  cos(2;rx/<i)|2 
=  \Eq  |2  +  4\EqEx  |  cos(2;r  x/d) 


(2) 


and 


I  high  Jin  ^ high  ^  high, dark  ^  ref 


=  \2E0  +  E2ei4m/d\  -\E2\2 -4\E0\ 
=  4|  E0E2 1  cos(4;zx  /  d) 


(3) 


where  the  reference  beam  has  been  increased  by  a  factor  of  two  to  compensate  for  the  single-sideband  detection  implicit 
in  off-axis  illumination.  Together  we  have: 


1 total,, tin  =  11 lowMn+highM  =  \E()\  +  *\E0E\\  COs(l  7t  x/ d)  +  A\E0Ex\  COs(4^x/fi?) 

The  same  principle,  works  for  the  two  dimensional  continuous  case,  i.  e.  for  arbitrary  images. 
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Fig.  3:  Illustration  of  the  addition  of  spectral  components  from  different  parts  of  frequency  space,  corresponding  to  different  extreme 

off-axis  illumination  conditions. 
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3.  THE  EXPERIMENTAL  SETUP 


In  order  to  demonstrate  IIM,  we  built  an  experimental  setup.  As  an  illumination  source  we  use  a  He-Ne  laser  at  a  633-nm 
wavelength.  Beamsplitters  allow  for  an  on-axis  illumination  and  two  off-axis  illuminations,  in  the  x-  and  y-  directions, 
respectively,  and  two  reference  beams  for  the  two  extreme  off-axis  images.  The  angles  of  the  off-axis  illumination  and 
reference  beams,  their  polarizations  and  intensities  can  be  adjusted  individually.  The  shape  of  the  beam  is  Gaussian  and 
its  diameter  is  about  ~2  mm  compared  to  a  FOV  of  32x24  pm2  giving  essentially  uniform  illumination  across  the  field. 
After  the  mask,  the  diffracted  light  from  the  mask  is  captured  by  a  20x,  0.4  NA  microscope  objective.  Since  the  pupil 
plane  of  the  objective  is  not  accessible,  we  relay  out  a  conjugate  pupil  plane,  using  two  20-cm  focal  length  achromats  in 
a  confocal  configuration.  Also  we  use  an  additional  lOx,  0.1  NA  microscope  objective  in  order  to  magnify  the  image 
onto  to  our  CCD  sensor,  which  is  a  14”  diameter  sized  640x480  pixel  sensor  where  each  pixel  is  about  10x10  pm2. 

The  reference  beam  is  passed  through  a  second,  matched  20x,  0.4  NA  microscope  objective,  in  order  to  mode-match  the 
reference  beam  to  the  scattered  beams.  After  passing  through  this  objective,  the  beam  is  split  in  two,  one  for  the  x-  and  a 
second  for  the  y-direction  reference  beams.  They  are  reintroduced  in  the  first  image  plane  of  the  imaging  path  (after  the 
first  microscope  objective,  see  Fig.  3  using  two  beam  splitters  and  a  half- wave-plate  in  order  to  match  the  polarization  of 
the  vertical  beam.  The  illumination  angle  is  chosen  to  be  2sin_1(W4)  with  NA  =  0.4,  the  angles  of  the  reference  beam 
have  to  be  matched  to  2sin_1(W4)/20,  where  20  is  the  magnification  of  the  first  microscope  objective.  This  gives  illumi¬ 
nation  angles  of  53°  and  reference  beam  angles  of  2.3°.  In  order  to  control  the  relative  phase  between  the  diffracted  light 
from  the  mask  and  the  respective  reference  beam,  we  installed  piezo-actuators  on  two  of  the  steering  mirrors.  The  im¬ 
ages  were  acquired  with  an  8-bit  video  camera  and  an  analog  video  card  and  processed  using  LabView™. 


Figure  4:  Experimental  layout.  The  lower  part  of  the  figure  shows  the  five  input  beams,  1 -on-axis,  2-  and  3-  extreme  off-axis  in  y-  and 
x-,  respectively,  and  4-  and  5-  the  two  reference  beams  reintroduced  after  the  first  microscope  objective. 

The  illumination  angle  for  the  extreme  off-axis  illumination  beams  is  adjusted  by  using  the  back  reflection  of  the  mask 
with  the  steering  mirror  mounted  on  a  rotary  stage.  This  allows  angle  adjustments  of  about  0.5°  on  the  illumination  side. 
Observation  of  the  interference  frequency  between  the  on-axis  illumination  beam  and  each  of  the  reference  beams  is 
used  to  set  the  angles  for  the  reference  beams.  The  focus  of  the  images  is  mainly  controlled  by  the  distance  of  the 
mask/object  from  the  first  microscope  objective  and  is  checked  best  with  an  object  containing  as  many  spatial  frequen¬ 
cies  as  possible.  We  typically  use  one  of  our  Manhattan  structures.  The  phase  of  the  two  reference  beams  has  to  be  cali¬ 
brated  on  a  known  object;  here  again  we  use  one  of  our  Manhattan  structures.  The  intensities  of  the  different  beams  are 
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controlled  by  variable  density  filters  and  are  set  most  precisely  by  using  a  global  average  function  over  the  whole  CCD 
array. 


4.  EXPERIMENTAL  DATA 

After  all  these  parameters  are  adjusted  we  take  a  complete  set  of  images  as  described  above,  which  results  in  the  nine 
intensity  distributions  listed  in  Table  1,  where  the  numbering  of  the  beams  is  as  in  Fig.  3. 


Table  1.  Overview  of  the  different  image  acquisition  configurations 


Open 

Blocked 

Resulting  image 

Name 

1 

2, 3, 4, 5 

Conventional  on-axis 

flow 

1-0  order 

2, 3, 4, 5 

Dark-field  of  on-axis 

flow, dark 

1:  clear 

2, 3, 4, 5 

Background  on-axis 

Ilow,bgd 

2,5 

1,3,4 

High-frequency  H  (horizontal  structures) 

Ihigh,y 

2 

1,3, 4, 5 

Dark-field  of  H 

Ihigh,y,dark 

5 

1,2, 3, 4 

Reference  beam  for  H 

. f  high,  y,  ref . 

3,4 

1,2,5 

High-frequency  V  (vertical  structures) 

Ihigh,x 

3 

1,2, 4, 5 

Dark-field  of  V 

Ihigh,x,dark 

4 

1,2, 3, 5 

Reference  beam  for  V 

Ihigh,x,ref 

The  major  experimental  results  are  shown  in  Figs.  5  to  7  for  a  Manhattan  geometry  (x-y  lines)  object  with  a  dense  critical 
dimension  of  0.5  pm.  Figure  5  shows  the  sequence  of  images  used  to  obtain  the  low  frequency  image.  The  left  panel 
shows  the  as-observed  low  frequency  image,  the  middle  panels  show  the  dark-field  (not  normalized)  and  the  background 
obtained  by  a  slightly  shifted  mask  so  that  there  are  no  objects  in  the  FOV.  The  background  speckle  pattern  arises  from 
multiple  reflections  between  the  optical  components. 


CD  ~  0.5  jjiii 
NA  =  0.4 
A,  =  633nm 


linear 

low-frequency 

image: 


f!ow,lin 


Figure  5:  Low  frequency  image.  The  left  panel  is  the  measured  image.  The  middle  panels  are  the  measured  dark- field  (obtained  by 
blocking  the  zero-order  in  the  relayed  pupil  plane)  and  the  background  with  no  object.  The  interference  fringes  are  the  result  of 
speckle  from  multiple  reflections  among  the  optical  surfaces.  Subtracting  the  dark-field  and  background  images  from  the  measured 

image  obtain  the  final  low-frequency  image. 
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Figure  6  shows  a  similar  sequence  for  the  y-directed  extreme  off-axis  exposure.  Finally,  figure  7  shows  the  result  of 
combining  all  of  these  images  to  get  a  composite  image  with  all  of  the  spatial  frequencies  included.  The  reconstructed 
image  was  subject  to  a  linear  Fourier  filter  in  order  to  eliminate  the  spatial  frequencies  that  don’t  make  it  through  the 
system  and  therefore  must  be  noise. 


Figure  6:  High  frequency  images  for  the  y-directed  extreme  off-axis  case  (see  table  1). 


Figure  7:  Reconstruction  of  an  image  using  IIM.  The  left  panel  shows  the  on-axis,  low-frequency  image.  Note  that  the  individual 
lines  are  not  resolved.  The  two  middle  images  show  the  x-  and  y-  directed  high  frequency  images,  respectively.  Here  only  the 
high  frequency  edges  of  the  large  box  are  observed.  Finally,  the  right  panel  shows  the  combined,  reconstructed  image.  The  defect 
on  the  lowest  horizontal  line  is  a  true  representation  of  the  object  as  shown  below. 
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Also,  we  acquired  data  using  a  standard  white-light,  incoherent-illumination  microscope  with  a  lOOx  NA  0.9  objective. 
The  image  was  taken  in  reflection  so  the  distortion  on  the  bottom  line,  which  is  due  to  corrosion  of  the  mask  Cr  layer 
stands  out  brightly  relative  to  the  rest  of  the  image.  More  discussion  of  this  image  along  with  a  comparison  of  the  inten¬ 
sity  crosscuts  for  both  IIM  and  conventional  imaging  will  be  presented  below  after  the  Fourier  optics  model  is  discussed. 


If 


Figure  8:  White-light,  incoherent-illumination,  reflection  image  of  the  same  object  taken  with  a  lOOx,  NA  =  0.9 

microscope  objective. 

Also  we  obtained  an  image  from  a  phase  object,  in  this  case  a  simple  grating.  One  of  the  advantages  of  IIM  is  that  it  is 
sensitive  to  phase  objects  for  the  off-axis  exposures  since  the  zero-order  reference  beam  does  not  experience  the  phase 
shifts  that  the  diffracted  beam  does.  As  expected,  the  image  is  simply  a  grating,  since  one  diffracted  order  was  collected 
and  interfered  with  the  reference  beam  to  give  a  simple  grating.  More  work  on  phase-shift  masks  has  to  be  undertaken  to 
demonstrate  the  full  advantages  of  IIM. 


5.  SIMULATION 

In  order  to  gain  a  better  understanding  of  the  imaging  process  of  IIM  we  implemented  a  simple,  scalar  Fourier-optical 
simulation  of  the  optical  system.  Starting  from  the  Manhattan  structure  we  are  imaging,  we  compute  its  Fourier  trans¬ 
form.  Then  linear  filters  are  applied  specific  to  each  exposure.  After  application  of  these  filters  the  absolute  square  is 
taken  in  order  to  simulate  the  intensity  response  of  the  detector.  This  model  allows  us  to  simulate  the  low-  and  high-  fre¬ 
quency  images  and  their  dark  fields  for  different  aperture  sizes  and  off-axis  angles  in  both  directions.  Fig.  10  shows  the 
results. 
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Figure  10:  Fourier-optics  simulation  of  IIM.  Row  A)  Low-frequency  images,  RowB)  High-frequency  images  withy-offset  illumina¬ 
tion,  RowC)  High-frequency  images  withx-offset  illumination,  Col.l)  Raw  image  including  the  autocorrelation  terms,  Col.2)  Dark 
field  images  (autocorrelation),  Col.3)  linear  images  (after  subtraction  of  dark  field),  RowD)  Linear  addition  of  images  A,B,C  for  each 
column,  respectively,  leading  to  the  simulated  image  of  IIM  (D3)  that  should  be  compared  with  the  experimental  result.  Finally, 
rowE)  provides  images  in  the  respective  columns  assuming  coherent,  normal-incidence  illumination  with  a  full  aperture  of  NA=1.2, 

covering  the  same  spectral  range  as  the  IIM  experiment. 


6.  COMPARISON  OF  IIM,  STANDARD  MICROSCOPY  AND  THE  FOURIER  MODEL 


We  can  now  compare  these  simulations  with  the  experiment.  Figure  1 1  shows  the  simulations  and  the  experiment  for 
both  IIM  (NA  =  0.4)  and  conventional,  incoherent-illumination  imaging  (NA  =  0.9).  A  more  quantitative  comparison  can 
be  made  by  comparing  intensity  cross  sections  (taken  along  a  horizontal  line  that  cuts  across  the  large  square  and  all  of 
the  individual  fingers  as  shown  in  Fig.  12.  Note  that  the  high-frequency  contrast  (amplitude  of  the  intensity  swing  across 
the  fingers)  in  the  IIM  image  is  better  than  in  conventional  imaging.  This  is  a  result  of  the  effective  unity  transfer  func¬ 
tion  of  IIM  out  to  the  edges  of  its  frequency  coverage  compared  with  the  decreasing  contrast  of  incoherent  image  as  the 
spatial  frequency  is  increased.  This  effect  becomes  more  pronounced  as  the  frequency  content  of  the  image  is  increased. 


Figure  11:  Comparison  of  the  Fourier  optics  simulation  (top)  and  the  measurement  (bottom)  for  IIM  (NA  =  0.4,  left)  and  conventional 

incoherent-illumination  microscopy  (NA  =  0.9,  right). 


Figure  12:  Comparison  of  the  cross-sections  of  AT  =  0.4  IIM 
(A:  simulation,  C:  reconstructed  experiment)  and  the  NA  =  0.9  incoherent-illumination  microscope 

image  (B:  simulation,  D:  experiment). 
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7.  DISCUSSION 


These  initial  experiments  serve  to  illustrate  the  potential  of  IIM.  The  mask  test  structure,  at  a  dense  CD  of  0.5  pm,  is  just 
small  enough  to  demonstrate  the  possibilities  of  IIM,  the  dense  CD  resolution  limit  of  the  present  optical  arrangement  is 
~  260  nm.  This  is  illustrated  in  Fig.  13,  which  shows  the  frequency  coverage  of  the  present  experiment. 


Figure  13:  Frequency  coverage  of  present  experiment. 

The  large  circle  with  radius  2/A,  is  the  frequency  space  limit  imposed  by  optical  propagation.  The  small  circle  in  the  cen¬ 
ter  of  radius  NA/X  with  NA  =  0.4  is  the  frequency  space  coverage  of  the  normal  incidence  coherent  exposure.  The  first 
order  diffraction  of  the  fingers  is  located  at  the  black  dots  inside  the  offset  circles  that  correspond  to  the  two  extreme  off- 
axis  exposures  offset  by  2NA/X.  Smaller  structures  could  be  imaged  with  the  existing  system.  The  limit  is  to  have  the 
first  order  spectral  frequencies  just  at  the  extreme  edges  of  the  offset  exposures,  for  this  633-nm  wavelength  this  corre¬ 
sponds  to  a  CD  of  264  nm  (CD  -  X/6NA). 

The  dotted  circles  in  Fig.  13  indicate  that  additional  frequency  space  coverage  is  available.  Accessing  this  frequency 
coverage  requires  both  additional  off-axis  tilt,  and  tilt  of  the  mask  relative  to  the  optical  axis  of  the  microscope.  This  will 
introduce  astigmatism  and  require  a  much  more  sophisticated  optical  system.  Just  increasing  the  tilt  of  the  illumination 
beams  to  grazing  incidence  extends  the  resolution  of  this  system  to  a  CD  of  -226  nm  [~  XH(\+NA)\  compared  with  the 
ultimate  resolution  of  ~  A/4  -  158  nm.  These  minimum  CD’s  all  scale  with  wavelength. 

8.  CONCLUSIONS  AND  FUTURE  DIRECTIONS 

We  have  introduced  and  demonstrated  the  principle  of  imaging  interferometric  microscopy  (IIM).  The  reconstructed 
image  shows  the  potential  of  higher  contrast  at  high  spatial  frequencies  as  compared  to  standard  microscopy.  Also  we 
achieve  the  same  resolution,  with  higher  contrast  at  a  higher  magnification  factor  and  with  a  larger  FOV,  a  longer  WD, 
and  a  larger  DOF  using  the  IIM  approach.  We  used  a  20 x  NA  =  0.4  objective  with  coherent  illumination  and  compared  it 
with  a  100x  NA  =  0.9  objective  with  incoherent,  white-light  illumination.  The  FOV,  DOF  and  WD  of  the  lower-AT  ob¬ 
jective  were  retained  in  these  experiments,  but  at  a  resolution  comparable  to  that  of  the  higher  NA  objective.  Future  work 
will  explore  the  resolution  limits  as  well  as  explore  a  shift  to  shorter  wavelengths.  In  order  to  achieve  the  full  potential  of 
IIM,  significant  improvements  in  both  the  optics  and  the  electronics  are  necessary.  Work  is  underway  to  provide  a  more 
quantitative  assessment  of  IIM. 
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We  introduce  and  demonstrate  a  new  microscopy  concept:  imaging  interferometric  microscopy  (IIM),  which  is 
related  to  holography,  synthetic-aperture  imaging,  and  off-axis -dark-field  illumination  techniques.  IIM  is  a 
wavelength-division  multiplex  approach  to  image  formation  that  combines  multiple  images  covering  different 
spatial-frequency  regions  to  form  a  composite  image  with  a  resolution  much  greater  than  that  permitted  by 
the  same  optical  system  using  conventional  techniques.  This  new  type  of  microscopy  involves  both  off-axis 
coherent  illumination  and  reinjection  of  appropriate  zero-order  reference  beams.  Images  demonstrate  high 
resolution,  comparable  with  that  of  a  high-numerical-aperture  (NA)  objective,  while  they  retain  the  long  work¬ 
ing  distance,  the  large  depth  of  field,  and  the  large  field  of  view  of  a  low-NA  objective.  A  Fourier-optics 
model  of  IIM  is  in  good  agreement  with  the  experiment.  ©  2003  Optical  Society  of  America 
OCIS  codes :  180.0180,  180.3170,  110.0180,  100.2000,  090.2880,  100.6640. 


Microscopy  is  among  the  oldest  applications  of  optical 
science1  and  remains  one  of  the  most  widely  used 
optical  technologies.  In  recent  years  there  have  been 
several  efforts  to  exceed  the  diffraction  limit  of  optical 
microscopy.  Especially  in  fluorescence  microscopy, 
approaches  have  been  reported  that  permit  better 
axial  as  well  as  lateral  resolution.2  In  particular, 
these  techniques  take  advantage  of  a  larger  set  of 
angles  about  the  sample  to  collect  the  light3  and  explore 
different  illumination  schemes4  as  well  as  nonlinear 
processes5  such  as  multiphoton  absorption.6  These 
techniques  are  enhanced  further  by  growing  capabili¬ 
ties  in  computational  image  processing.7 

Improvements  in  microscopy  will  have  significant 
effects  in  many  fields.  Subwavelength  resolution 
imaging  has  been  of  intense  interest,  particularly 
for  lithographic  application  in  semiconductor  manu¬ 
facturing.8  The  concept  of  imaging  interferometric 
microscopy  (IIM)  grew  out  of  imaging  interferomet¬ 
ric  lithography,  a  recently  introduced  resolution- 
enhancement  technique.9 

Many  applications  present  conflicting  requirements 
for  high-resolution  imaging  over  large  fields  at  a  large 
working  distance  (WD)  from  the  specimen.  For  tra¬ 
ditional  optical  designs,  high-numerical-aperture  (NA) 
and  hence  high  resolution  are  associated  with  a  short 
working  distance,  a  small  field  of  view  (FOV),  and 
a  small  depth  of  focus  (DOF).  Mask  inspection  for 
the  semiconductor  industry  provides  one  example  of 
these  requirements.  The  dimensions  of  mask  features 
are  decreasing  apace  with  Moore’s  law  and  are  cur¬ 
rently  smaller  than  360  nm  (and  even  smaller  for  sub¬ 
resolution  assist  features),  assuming  a  4X  reduction 
lithographic  system.  High-speed  inspection  demands 
an  imaging  solution  in  place  of  the  traditional  serial, 
raster-scan  approach.  A  lithographic  mask  has  a  pro¬ 
tective  pellicle  mounted  several  millimeters  above  it; 
the  imaging  optics  must  stand  off  from  the  mask,  re¬ 
quiring  long-WD  optics.8 

IIM  approaches  this  problem  by  using  a  low-NA 
objective  that  inherently  provides  a  long  WD,  a  large 
DOF,  and  a  large  FOV.  To  overcome  the  resolution 

0146-9592/03/161424-03$15.00/0 


limits  associated  with  the  lower  optical  frequency 
cutoff  of  this  optical  system,  multiple  coherent  images 
with  different  off-axis  illumination  directions  are 
recorded.  For  illumination  directions  beyond  the  NA 
of  the  imaging  lens,  a  zero-order  illumination  beam 
is  reinjected  into  the  optical  system  on  the  low-NA 
side  of  the  lens.  Effectively,  the  off-axis  illumination 
downshifts  the  high-spatial-frequency  content  of  the 
image  such  that  the  image  can  pass  through  the  lens, 
and  interference  with  the  reinjected  reference  beam 
upshifts  the  image  content  back  to  the  proper  spatial 
frequencies.  The  result  is  a  spatial  wavelength- 
division  multiplex  approach  that  captures  all  the 
available  spatial-frequency  content.  Signal  process¬ 
ing  is  then  used  to  reconstruct  the  complete  image. 
IIM  is  related  to  synthetic-aperture  imaging,  in  which 
one  achieves  different  channels  by  sending  radar 
pulses  from  an  airplane  as  it  flies  over  the  target. 
Each  radar  subimage,  corresponding  to  a  dark-field 
image  with  a  relatively  low  NA  appropriate  to  the 
antenna  size,  is  stored,  and  the  full  image  is  recon¬ 
structed  by  combination  of  these  subimages.10’11 

In  IIM,  spatially  coherent  light  is  used  to  illuminate 
the  object.  For  normal  incidence  illumination,  only 
the  beams  diffracted  from  the  mask  that  correspond 
to  spatial  frequencies  of  <NA/A,  including  the  zero- 
order  transmitted  beam,  are  transmitted  through 
the  optical  system.  At  the  image  plane  the  inter¬ 
ference  between  the  diffracted  beams  and  the  zero- 
order  beam  reconstructs  a  band-limited  image.  If 
the  illumination  direction  is  shifted  off  axis,  the  lens 
accepts  higher  spatial  frequencies  in  the  direction 
opposite  the  offset  angle.  For  illumination  at  an 
angle  corresponding  to  2NA,  only  the  diffracted  beams 
that  correspond  to  a  circle  of  radius  NA/A  centered 
on  2NA/A  are  accepted  by  the  optical  system  (Fig.  1). 
If  these  diffracted  beams  alone  are  incident  upon  the 
detector,  the  result  is  a  dark-field  image  with  frequen¬ 
cies  up  to  2NA/A.  The  original  frequencies  can  be 
restored  by  the  introduction  of  a  reference  beam  at 
the  appropriate  angle  on  the  low-NA  side  of  the  lens, 
much  as  in  holography,  that  interferes  with  the  image 
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Fig.  1.  Frequency  space  coverage  of  three-exposure  IIM 
with  a  NA  of  0.4  and  of  conventional  incoherent  illumina¬ 
tion  imaging  with  a  NA  of  0.9.  The  experiment  is  illus¬ 
trated  by  the  figure  at  the  left  with  one  offset  illumination 
in  each  of  the  x  and  y  directions.  Additional  exposures  can 
enhance  the  resolution  in  the  x—y  direction,  as  indicated 
by  the  short-dashed  circles,  or  oriented  in  other  directions 
(e.g.,  45°  lines)  as  indicated  by  the  dashed-dotted  circles. 

diffraction  beams  to  reset  the  spatial  frequencies. 
The  square  law  (power)  detection  process  results  in 
recording  the  real  part  of  the  interference  between  the 
reference  beam  and  the  image  beams,  i.e.,  to  the  two 
x-offset  circles  shown  in  Fig.  1.  As  in  single-sideband 
communications,  the  full  spectrum  is  re-created  by 
the  nonlinearity  of  the  detection  process  even  though 
only  one  sideband  is  transmitted  through  the  system. 
This  process  is  repeated  twice,  once  offset  in  the  x 
direction  once  in  the  y  direction.  Since  the  images 
are  stored  electronically,  the  dark-field  images  can  be 
separately  measured  and  subtracted  from  the  off-axis 
images.  Combining  these  off-axis  images  along  with 
the  on-axis  image  reconstructs  a  high-resolution  image 
with  spatial  frequencies  extending  to  —3NA/A.12  As 
is  clear  from  Fig.  1,  additional  parts  of  frequency 
space  can  be  covered  with  additional  exposures. 
Pupil  plane  filters  can  be  used  to  ensure  that  every 
region  of  frequency  space  is  weighted  uniformly. 

For  illumination  we  use  a  633-nm  wavelength 
He-Ne  laser  source.  The  object  is  a  binary  chrome- 
on-glass  mask  with  dense  0.5-yu-m  critical  dimension 
features.  In  this  Letter  we  demonstrate  the  imaging 
of  a  Manhattan-geometry  structure  composed  of 
rectangles  oriented  along  the  orthogonal  x  and  y 
axes.  The  mask  pattern  consists  of  a  large  square 
(5  jmm  X  5  /im)  and  5  L-shaped  nested  lines  (0.5  jmm 
wide  on  1-jmm  centers)  with  various  lengths. 

For  imaging  we  use  a  20X,  0.4-NA  objective.  Fol¬ 
lowing  the  image  plane  of  this  lens,  a  pair  of  confocal 
achromatic  lenses  allows  access  to  the  image  Fourier 
plane.  The  relayed  image  is  further  magnified  with 
a  10  X,  0.25-NA  objective  and  detected  with  a  CCD 
camera.  The  overall  system  has  a  magnification  of 
200X,  and  the  640  X  480  pixels  on  the  CCD  array 
correspond  to  a  32  jmm  X  24  jmm  FOV.  The  same  ob¬ 
ject  on  the  mask  can  be  illuminated  from  three  direc¬ 
tions:  normal  incidence,  off  axis  from  the  x  direction, 
and  off  axis  from  the  y  direction.  The  off-axis  illumi¬ 
nation  angles  are  sin_1(2NA),  or  53.1°  from  the  optical 
axis.  In  this  case  the  off-axis  zero-order  transmitted 
beams  are  not  collected  by  the  objective  lens,  and  refer¬ 
ence  beams  are  reintroduced  at  the  image  plane  at  an 


angle  matched  to  sin_1(2NA/20)  =  2.3°  from  the  opti¬ 
cal  axis  (see  Fig.  2).  Piezo  drivers  are  used  to  adjust 
the  phases  of  the  reference  beams.  A  collimated  beam 
is  used  for  illumination,  with  the  result  that  the  zero 
order  is  divergent  in  the  first  image  plane  in  this  non- 
telecentric  optical  system.  Matched  20 X,  0.4-NA  ob¬ 
jectives  are  used  to  mode  match  the  reference  beams. 

The  schematic  experimental  setup  for  the  x-directed, 
high-frequency  image  is  shown  in  Fig.  2.  A  second 
pair  of  beams,  not  shown  here,  permits  illumination 
of  the  y-directed  high-frequency  image.  By  sequen¬ 
tially  blocking  two  of  the  three  illumination  beams 
and  the  respective  reference  beams,  we  record  differ¬ 
ent  parts  of  the  spatial-frequency  space  of  our  image, 
as  shown  in  Fig.  1.  Off-axis  illumination  without  any 
reference  beam  gives  us  the  dark-field  images.  Mov¬ 
ing  the  mask  to  a  clear  spot  away  from  any  chrome 
features  provides  the  background  for  the  low-frequency 
image,  and  transmitting  a  single  reference  beam  with¬ 
out  any  illumination  serves  as  the  reference  for  the 
high-frequency  images.  Using  an  8-bit  camera  and 
analog-to-digital  image-acquisition  board,  we  digitally 
recorded  the  images. 

Figures  3  and  4  show  the  experimental  results. 
Figure  3A  shows  the  on-axis  image,  which  contains 
only  the  low-frequency  mask  information.  The 
square  and  the  nested  letters  L  are  merged  into  a 
single  unresolved  structure.  Only  the  extended  lines 
are  visible.  The  background  of  the  structure  is  clear 
on  the  mask  but  appears  nonuniform  in  the  image. 

The  high-frequency  components  are  imaged  sepa¬ 
rately.  One  image  captures  the  high-frequency  com¬ 
ponents  of  the  horizontal  structures  (y  direction, 
Fig.  3B);  another,  the  high-frequency  components  of 
the  vertical  structures  (x  direction,  Fig.  3C).  The 
background  of  the  image  is  rather  homogeneous  but 
not  completely  uniform;  the  spaces  between  the  lines 
show  higher  intensity  than  the  background.  The  line 
structures  and  the  high-frequency  edges  of  the  square 
are  seen  in  their  respective  images. 

Electronic  addition  of  the  low-  and  high-frequency 
images  gives  the  reconstructed  image  (Fig.  3D).  A 
digital-image-processing  low-pass  Fourier  filter  was 
applied  after  the  reconstruction  simply  to  eliminate 
any  electronically  induced  noise  at  spatial  frequencies 


Fig.  2.  Schematic  of  the  high-frequency  imaging  setup. 
The  incoming  beam  is  split  into  illumination  beam  A  and 
reference  beam  B;  beam  B  is  reintroduced  to  the  image 
plane  at  the  CCD,  replacing  the  zero  order  that  did  not 
pass  through  the  optical  system  (dashed  lines).  The  three 
beams  impinging  upon  the  optical  system  represent  the 
range  of  image  spatial  frequencies  collected.  Beam  B  is 
mode  matched  with  a  second  microscope  objective  (M).  FP 
is  the  relayed  Fourier  plane. 
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Fig.  3.  A,  On-axis  image  after  dark-field  and  background 
subtraction  and  renormalization.  Only  the  low-frequency 
components  of  the  mask  are  imaged.  B,  High- 
frequency  image  of  horizontal  structures  after  dark-field 
and  reference  image  subtraction  and  renormalization. 
C,  High-frequency  image  of  vertical  structures  after 
dark-field  and  reference  image  subtraction  and  renor¬ 
malization.  D,  Reconstructed  image.  E,  Incoherent 
illumination;  white-light  image  from  a  100 X,  0.9-NA  lens. 


Fig.  4.  Comparison  of  the  cross  sections  of  three-exposure, 
0.4-NA  IIM  (A,  simulation;  C,  reconstructed  experiment) 
and  the  0.9-NA  incoherent  illumination  aerial  images 
(B,  simulation;  D,  experiment). 


outside  the  optical  system  capabilities.  The  individ¬ 
ual  lines  are  now  separated  from  the  square  and  have 
their  appropriate  lengths.  The  edges  of  the  lines  are 
clearly  visible.  Irregularities  are  evident  at  the  left 
side  of  the  square  as  well  at  the  lowest  horizontal 
line.  Finally,  Fig.  3E  shows  an  image  of  the  same 
structure  taken  in  reflection  with  a  standard  micro¬ 
scope  using  a  100X,  0.9-NA  objective  with  incoherent, 
white-light  illumination  (A  ~  500  nm).  The  edges  of 
the  structure  are  not  as  sharp  as  in  the  reconstructed 
image.  The  FOV  was  adjusted  to  compensate  for  the 
different  magnifications  employed  in  the  two  images. 
Some  corrosion  of  the  mask  metallization  is  evident 
on  the  lowest  horizontal  finger.  This  defect  was  also 
resolved  in  the  IIM  image. 


To  compare  the  pictures  more  quantitatively,  we 
show  the  cross  sections  of  (Fig.  4A)  a  Fourier-optics 
simulation  of  IIM  (NA  of  0.4), 13  (Fig.  4B)  an  in¬ 
coherently  illuminated  image  (NA  of  0.9)  and  the 
experimental  data,  (Fig.  4C)  the  reconstructed  IIM 
image,  and  (Fig.  4D)  the  conventional  microscope 
image.  The  larger  contrast  of  IIM  at  high  spatial 
frequencies  than  for  the  standard  image  can  be  seen. 
This  difference  is  related  to  the  low  modulation 
transfer  function  of  incoherent  imaging  at  high  spatial 
frequencies,  whereas  the  effective  modulation  transfer 
function  of  IIM  is  unity. 

We  have  introduced  and  demonstrated  the  principle 
of  imaging  interferometric  microscopy.  The  recon¬ 
structed  image  shows  the  potential  of  higher  contrast 
than  that  of  standard  microscopy  at  high  spatial 
frequencies.  Also,  we  achieved  the  same  resolution, 
with  higher  contrast  at  a  higher  magnification  factor 
and  with  a  larger  FOV,  a  longer  WD,  and  a  larger 
DOF,  by  using  the  IIM  approach.  We  used  a  20  X, 
0.4-NA  objective  with  coherent  illumination  and  com¬ 
pared  it  to  a  100X,  0.9-NA  objective  with  incoherent, 
white-light  illumination.  The  FOV,  the  DOF,  and  the 
WD  of  the  lower-NA  objective  were  retained  in  these 
experiments,  because  addition  of  the  reference  beam 
does  not  affect  any  of  these  aspects  of  the  low-NA 
image,  but  at  resolution  comparable  to  that  of  the 
higher-NA  objective.  The  mask  test  structure,  at 
a  critical  dimension  of  0.5  jmm,  is  just  small  enough 
to  demonstrate  the  possibilities  of  IIM.  The  dense 
critical  dimension  resolution  limit  of  the  present  opti¬ 
cal  arrangement  is  —0.26  juli n.  We  intend  to  explore 
these  resolution  limits. 
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Abstract 

Self-calibration  is  one  of  the  main  approaches  to  obtain  greater  accuracy  in  precision  engineering.  By 
measuring  a  rigid  artifact  placed  at  different  locations  and  orientations  on  a  stage,  it  is  possible  to  separate 
the  stage  error  map  from  artifact  errors  by  comparing  the  different  views.  However,  it  is  impossible  in 
practice  to  realize  an  exact  90  rotation  about  an  exact  pivot  point,  and  achieve  an  exactly  known 
translation  with  absolutely  no  rotation.  The  key  issue  is  to  decouple  stage  error  when  there  are  artifact 
errors,  alignment  errors,  and  measurement  noise.  Previous  approaches  are  to  isolate  the  alignment  error 
first  by  using  least  squares,  which  is  effective  when  the  measurement  noise  has  a  normal  distribution. 
Then,  polynomial  fitting,  or  Fourier  series,  or  other  orthogonal  function  decompositions  are  used  to 
decouple  the  stage  error  from  three  measurement  views,  view  0,  view  1(90°  rotation)  and  view  2  (one 
interval  translation).  It  is  difficult  to  realize  these  three  measurement  views  in  practice. 


We  present  a  testbed  physical  realization  of  an  X-Y  stage  for  nanolithography.  This  stage  is  an  air¬ 
bearing  dc  motor  driven  capstan  drive  system  being  retrofitted  with  a  NanoGrid  XY-grid-based  encoder 
for  feedback.  The  NanoGrid  greatly  reduces  possible  effects  from  air  turbulence  (as  might  be  seen  with 
laser  interferometers)  and  Abbe  offsets  from  using  separate  X  and  Y  scales.  However,  the  metrology  is 
based  on  an  artifact:  The  accuracy  of  the  grid  itself.  We  will  present  a  circle-closure  based  self¬ 
calibration  method  for  grid  and  stage  calibration. 

The  principle  of  circle  closure  (the  sum  of  the  angles  around  any  point  in  a  plane  equals  360)  is  for  self¬ 
calibration  of  the  rotation  angle  of  the  rigid  artifact  (e.g.  XY  encoder  grid).  When  the  mark  position  of 
rigid  plate  is  measured  in  turn  with  about  90  rotation,  difference  between  each  rotation  and  an  unknown 
reference  angle  x  can  be  measured  using  a  differential  measurement  (which  can  achieve  better  resolution). 
Circle  closure  provides  a  constraint  that  the  sum  of  all  the  rotations  must  be  360  .  This  provides  sufficient 
information  for  the  solution  for  all  unknown  rotation  angles.  We  only  need  to  decouple  the  computation 
of  the  pivot  point  position  in  the  self-calibration  algorithm.  The  mathematical  model  is  simpler,  and  may 
be  easier  to  implement. 

Redundant  measurements  may  be  used  to  improve  calibration  accuracy.  Different  combinations  from  the 
4  measurement  views  (0,  90°,  180°,  270°)  can  be  used  to  calibrate  the  stage  errors.  The  final  calibration  is 
calculated  by  using  least  square  methods.  The  mathematical  model  and  algorithm  with  and  without 
additive  measurement  noise  are  simulated  using  MATLAB. 


Keywords:  self-calibration,  nanolithography,  calibration  algorithm,  two-dimensional  stage 


Introduction 

Two-dimensional  XY  stages  are  used  in  many  precision  engineering  applications.  The  specific 
application  that  we  are  interested  in  is  nanolithography,  where  it  is  desirable  to  maintain  nm  to  sub-nm 
accuracies  over  a  2-dimensional  workspace  spanning  a  wafer  (e.g.  300  mm).  There  is  generally  a 

deviation  between  the  measured  stage  position  and  its  true  position;  this  is  called  the  stage  position 
measurement  error. 

The  stage  position  measurement  error  is  the  sum  of  random  measurement  noise  and  systematic 
measurement  error,  which  is  a  function  of  the  stage  position.  Calibration  is  the  procedure  used  to 
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determine  the  stage  error  map,  to  map  the  measured  stage  position  to  its  position  in  a  Cartesian  reference 
frame.  In  conventional  calibration  method,  two  steps  are  needed:  First,  measure  a  standard  artifact;  i.e,  an 
artifact  plate  that  has  marks  with  known  positions.  The  difference  between  the  measured  position  and  the 
known  position  of  the  marks  is  the  stage  error  at  that  mark  location.  Second,  set  up  a  mapping  function  (a 
stage  calibration  function)  between  the  measured  coordinates  and  the  actual  coordinates;  e.g.,  using  a 
piece-wise  linear  function  or  a  polynomial.  The  stage  calibration  function  is  an  approximation  to  the  stage 
error  map.  The  accuracy  depends  on  the  traceability  of  the  artifact.  Precision  stages  are  tested  against 
artifacts  which  have  been  certified  by  a  calibration  service  whose  “grand  master”  are  traceable  to  national 
measurement  or  standard  institutes  which  have  realized  and  transferred  the  base  units. 

In  our  nanlithography  application,  there  is  no  standard  artifact  available  with  the  desirable  nm  accuracy. 
In  fact,  we  are  unable  to  fabricate  an  artifact  with  mark  positions  known  better  than  the  metrology  system. 
Here  we  run  into  the  chicken-or-egg  problem:  How  do  you  construct  an  accurate  measurement  standard 
without  an  accurate  standard  to  calibrate  the  machine  that  would  make  it?  Therefore  we  need  to  do  self¬ 
calibration. 

Previous  work  on  self-calibration  of  x-y  stages 

Self-calibration  has  a  long  history,  and  the  advent  of  e-beam  mask  pattern  generators  has  increased 
interest  in  its  use.  Raugh  (1984)  developed  a  mathematical  theory  for  2-dimensional  self-calibration; 
however,  the  algorithm  presented  is  not  numerically  robust,  and  is  also  sensitive  to  random  measurement 
noise.  Ye,  Pease  et  al.  (1997)  developed  a  discrete  Fourier  transform-based  algorithm,  based  on  Raugh’s 
theory,  which  is  numerically  robust.  This  approach  extracts  the  stage  measurement  error  from  the  artifact 
mark  position  measurements  by  comparing  three  different  views  of  a  measurement  artifact  plate.  These 
three  views  are  an  initial  set  of  measurements,  a  set  of  measurements  with  the  artifact  plate  rotated  90 
degrees  with  respect  to  the  stage,  and  a  set  of  measurements  with  the  artifact  plate  translated  by  one  grid 
interval  with  respect  to  the  stage.  The  problem  now  is,  how  do  you  achieve  an  exact  90  degree  rotation, 
and  how  do  you  translate  the  artifact  plate  one  grid  interval,  without  adding  rotational  motion.  Finally, 
the  discrete  Fourier-transform  based  algorithm  is  complicated. 

Testbed  physical  realization  of  x-y  stage  for  nanolithography 

Our  interest  is  to  develop  a  stage  for  imaging  interferometric  lithography  (ILL,  Brueck,  1998).  This 
technique  can  help  optical  lithography  approach  the  fundamental  linear  systems  limit  of  optics  of  X/2. 
Any  lithographic  technique  will  require  more  accurate  and  precise  stage  positioning  (and  dimensional 
stability)  to  enable  registration  across  multiple  levels.  Hence,  the  motivation  to  investigate  self¬ 
calibration.  Current  metrology  for  precision  and  ultraprecision  stages  is  based  on  heterodyne  laser 
interferometers;  with  a  HeNe  laser  at  632.8  nm  wavelength,  interpolation  of  210  to  211  (2^/1024  to  A72048) 
is  available  for  two-pass  interferometers.  This  corresponds  to  resolutions  of  0.6  to  0.3  nm.  Unfortunately, 
the  refractive  index  of  air  varies  with  C02  and  water-vapor  content,  and  air  turbulence.  This  is  the 
primary  limitation  on  the  accuracy  with  operation  in  air — while  the  resolution  of  the  interferometers  is 
nm,  the  actual  accuracy  may  only  be  sub-pm.  Other  metrology  systems  have  been  proposed  to  reduce  the 
errors  due  to  air,  such  as  holographic  encoders  and  two-color  interferometers;  neither  technology  has  yet 
been  proven  accurate.  For  metrology,  we  are  using  the  OPTRA  NanoGrid  XY  encoder  system,  which 
offers  nm-level  resolution,  but  with  accuracy  at  the  submicron  level.  The  objective  of  the  self-calibration 
research  is  to  develop  algorithms  to  improve  the  accuracy  of  the  encoder  to  its  resolution — from 
submicron  to  nanometer  accuracy,  and  to  implement  these  in  a  lithographic  stage.  We  have  obtained, 
courtesy  of  Sematech,  an  air-bearing,  capstan  drive  interferometric  lithography  stepper,  with  HP  laser 
interferometers  for  feedback  (shown  in  Figure  1).  This  stepper  will  serve  as  a  testbed  for  the  stage 
development,  which  will  primarily  investigate  self-calibration  and  thermal  compensation  methods. 


This  stage  is  configured  as  a  stacked- 
slide  X-Y  stage,  although  the  moving 
stage  itself  is  supported  by  air  bearings 
on  the  granite  surface  plate.  The  wafer 
chuck  is  mounted  on  a  piezo-driven 
flexure  stage.  The  existing  laser 
interferometer  will  be  retained  for 
calibration  purposes,  although  its  basic 
resolution  is  5  nm  (e.g.  coarser  than  the 
resolution  of  the  NanoGrid  metrology). 

The  X-Y  encoder  system  resolution  is 
0.3  nm;  it  is  insensitive  to  variations  in 
the  index  of  refraction  of  air — but  its 
basic  accuracy  is  on  the  order  of  1  pm, 
since  its  accuracy  is  based  on  the 
absolute  accuracy  of  the  encoder  artifact 
plate,  whereas  the  laser  interferometer 
accuracy  is  fundamentally  dependent  on 
the  wavelength  of  the  laser.  We  will  mount  the  NanoGrid  system  on  the  stage,  with  the  encoder  plate 
parallel  to,  and  in  the  same  plane  as  the  wafer  chuck.  The  encoder  plate  will  move  with  the  wafer  chuck, 
while  the  readheads  are  stationary.  This  will  minimize  Abbe  offsets.  At  the  same  time,  since  there  is  a 
large  planar  area  available  around  the  wafer  chuck,  multiple  encoder  plates  and  readheads  may  be 
installed  in  order  to  obtain  redundant  measurements. 

Stage  error  function  and  artifact  plate  error  function 

The  stage  error  function  S(x,y)  is  the  systematic  measurement  error  at  (x,y),  where  (x,y)  is  the  true 
location  of  the  sample  site  in  a  Cartesian  grid.  Assume  the  field  to  be  calibrated  is  LxL.  The  origin  of  the 
x-y  axis  is  chosen  at  the  center  of  this  area.  So, 

S  (x,  y)  =  Sx  (x,  y)ex  +  Sy  (x,  y)ev 

where  ex  and  ey  are  the  unit  vectors  of  the  stage  axis.  Sx(x,y)  and  Sy(x,y)  are  functions  on  the  continous 
2-D  field  LxL.  Let  the  sample  sites  be  in  a  NxN  square  array  covering  the  field  LxL 
(For  simplicity,  let  N  be  odd).  The  sample  site  locations  in  the  Cartesian  grid  are: 
xm  =  mA 

yn  =  nA 

where  m=  -  (N-l)/2-  (N-2)/2,  ...  ,0  ,...,  (N-2)/2  ,(N-l)/2,  n=  -(N-l)/2,-(N-2)/2,  ...  ,0  ,...,  (N-2)/2, 
(N-l)/2  and  A  =  (L/  N) ,  which  is  called  the  sample  site  interval.  So  the  discrete  stage  error  function  can 
be  expressed  as 

S  (Xm  >yn)  =  SX  ( Xm  ,  yn  K  +  Sy  (Xm  ,  yn  K 

The  artifact  plate  has  a  square  (NxN)  mark  array  of  the  same  size  as  the  stage  sample  site  array.  The 
origin  of  the  plate  x-y  axis  is  located  at  the  center  on  the  mark  array.  The  nominal  mark  locations  in  the 
plate  coordinate  system  are  the  same  as  the  sample  site  locations  in  the  stage  coordinate  system.  The  plate 
is  rigid;  i.e,  the  relative  positions  between  the  marks  on  the  plate  do  not  change  when  we  rotate  it  on  the 
stage.  However  it  is  not  perfect,  which  means  that  each  actual  mark  at  (xm,yn)  deviates  from  its  nominal 
location  by  A(xm,yn). 

A(x,n  ,y„)  =  Ax  (xm ,  yn  )ex  +  Ay  (xm ,  yn  )ey 

It  should  be  noted  that  every  mark  on  the  artifact  plate  has  an  identification  number  (m,n).  The  mark’s 
identification  number  does  not  change  during  the  plate’s  motions  on  the  stage. 


Decoupling  the  alignment  error  based  on  angle  calibration  using  circle  closure 

Circle  closure  is  used  to  measure  the  rotation  angle.  The  XY  grid  is  placed  on  a  rotation  table  which  is 
mounted  on  the  stage;  a  schematic  is  shown  in  Figure  2.  There  are  two  sensor  heads  above  the  XY  grid 
which  are  used  to  get  translation  information  in  each  measurement  view.  We  can  select  two  points  on  the 
four  sides  of  xy  grid  plate  to  determine  the  direction  of  each  side.  The  direction  change  of  each  side  can 
be  measured  by  an  autocollimator  or  other  high  resolution  sensor.  According  to  circle  closure,  the  sum  of 
4  inner  angles  is  equal  to  360  degrees.  The  principle  of  circle  closure  is  a  natural  conservation  law  for 
plane  angles  in  Euclidean  geometry.  Denoting  the  deviations  from  the  nominal  value  by  A9i.  (i=0, 1,2,3) 
and  the  actual  rotation  angle  9i  .We  have 


Sensor  Head  Rotati 

\  Vj 

ion 

A 

/  ► 

(  1 

_ 

T\ 

\V 

6 

i  0O 

i 

i  / 

p 

0i° 

APT) 

\ 

1 

1  i 

o 

_ r 

JkU 

@2  o 

/ 

Position  sensor  - 

XY  Nano 

Figure  2.  Decoupling  alignm 
calibration  via  circle  closure. 

At/ 

„  .  .  Piezoelectric  Actuator 

Grin 

ent  errors  based  on  angle 
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0,  =  90°  +  A0, 

A6>,  =x-90°  +a, 

j^AS,  =4(x-90")  +  i>, 
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With  the  constraint  of  the  circle  closure 

3  1  3 

^  A6j  =  9 ,  we  have  x  =  99°  4 - 


i= 0 


i= 0 


With  x,  we  can  calculate  the  actual 
rotation  angle  9i  with 


e  =9o°-iyv 

1  4  ^  1 


+  a, 


i= 0 


We  only  need  to  decouple  the  computation  of  the  pivot  point  position  in  the  self-calibration  algorithm.  In 
this  way,  only  two  90°-rotation  views  are  needed  to  exactly  calibrate  the  stage  errors  at  those  sites 
sampled  by  the  mark  array.  The  mathematical  model  is  simpler,  and  may  be  easier  to  implement. 

Summary 

The  mathematical  model  and  algorithm  with  and  without  additive  measurement  noise  are  simulated  using 
MATLAB;  the  results  of  the  simulation  are  shown  in  the  poster.  The  algorithm  gives  an  exact  self¬ 
calibration  on  the  discrete  sample  sites.  When  there  is  random  measurement  noise,  different  combinations 
from  different  rotations  and  two  sensor  head  (4  rotation  measurement  views  and  4  translation 
measurement  views)  are  used  to  calibrate  the  stage  errors.  The  final  calibration  is  calculated  by  using  least 
square  methods. 

References 

Brueck,  S.  R.  J.,  “Imaging  Interferometric  Lithography,”  Microlithography  World ,  Winter  1998. 

Evans,  C.  J.,  Hocken,  R.  J.,  and  Estler,  W.  T.,  “Self-Calibration:  reversal,  redundancy,  error  separation, 
and  ‘absolute  testing,’  ”  CIRP  Annals,  Vol45(2),  1996. 

Raugh,  M.R.  “Absolute  two-dimensional  sub-micron  metrology  for  electron  beam  lithography”  Prec  Eng 
vol.  7(1),  p.  3-13,  1985. 

Ye,  J.,  Takac,  M.,  Berglund,  C.  N.,  Owen,  G.,  and  Pease,  R.  F.,  “An  exact  algorithm  for  self-calibration  of 
two-dimensional  precision  metrology  stages,”  Prec  Eng  vol.  20(1),  p.  16-32,  1997. 

Carlson,  D.,  Dorval,  R.  K. ,  Gargas,  J.  A.,  Hercher,  M.,  Mansur,  D.,  and  Tran,  H.  D.,  “A  Versatile  XY 
Stage  with  a  Flexural  Six-Degree-of-Freedom  Fine  Positioner,”  Proc.  10th  ASP £  Annual  Meeting ,  p.  203- 
206,  October  1995. 


MULTIPOINT  TEMPERATURE  CONTROL  USING  THERMOELECTRIC 

MODULES 


Jian  Ouvang*,  Phuong-Thao  Ton-Nu+,  and  Hy.  D.  Tran* § 

*  Dept,  of  Mechanical  Engineering  +  Intel  Corporation 

University  of  New  Mexico  Rio  Rancho,  NM  87124 

Albuquerque,  NM  87131 

Abstract 


Temperature  control  is  a  critical  issue  in  precision  engineering,  especially  for  metrology  structures  and 
metrology  frames.  While  oil  shower  and  air  showers  have  been  shown  to  achieve  millikelvin  temperature 
stability,  some  applications  do  not  permit  oil  shower  or  air  shower  systems  (e.g.  clean  room 
environments).  We  demonstrate  that  by  adding  active  temperature  control  to  a  structure,  using 
thermoelectric  devices  for  cooling  or  heating,  we  can  maintain  a  structure  at  a  more  uniform  temperature 
despite  environmental  temperature  fluctuations.  While  it  is  not  possible  to  maintain  completely  uniform 
temperature,  it  is  possible  to  maintain  temperature  at  selected  points  in  the  structure,  where  the  sensors 
and  actuators  are  located.  Our  objective  is  to  demonstrate  millikelvin  temperature  stability  despite  non- 
uniform  external  thermal  disturbances.  Some  of  the  control  issues  that  arise  are  cross  coupling  between 
different  sensors  and  actuators,  and  effect  of  bias  and  drift  of  sensors  (especially  at  millikelvin 
resolution).  The  analysis  is  based  on  a  state  variable  finite  difference  thermal  model.  While  state-space 
based  control  design  may  better  handle  cross  coupling  in  the  dynamics  of  the  thermal  system,  simpler  PID 
type  controllers  are  easier  to  implement  and  test.  The  major  advantages  with  active  temperature  control 
are  that  by  separating  the  functions  of  temperature  control  from  structural  stability,  one  can  choose 
materials  for  metrology  structures  based  on  structural  criteria,  independent  of  thermal  properties.  We  will 
show  initial  experiments  using  a  4x4  array  of  thermoelectric  coolers  to  control  the  temperature  of  an 
aluminum  plate  despite  environmental  disturbances.  The  results  and  control  methods  can  be  extended  to 
large  structures,  such  as  metrology  frames  in  lithography  steppers.  The  advantages  of  this  approach  are 
not  only  better  temperature  control,  but  also  superior  dimensional  control  using  inexpensive  materials, 
and  also  more  rapid  thermal  equilibration  during  equipment  startup  and  shutdown. 


Introduction 


Thermally  induced  errors  in 
precision  mechanical  structures  are 
a  major  part  of  the  error  budget  in 
precision  mechanical  systems, 
especially  with  lithography 
applications.  With  the  increasing 
demand  for  improved  minimum 
critical  dimension  (CD)  in  semi¬ 
conductor  fabrication,  the  desire  to 
maintain  a  dimensional  stability  of 
10%  of  the  CD  (currently  0.18  pm) 
becomes  increasingly  challenging. 


Air  Convection 
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//// 


Radiation  from  surroundings  &  from 
environmental  tool  operation 


Conduction  from  connected  body 


Fig.  1  Environmental  Thermal  Disturbances 


In  spite  of  the  manufacturers’  effort,  there  are  always  some  residual  thermal  displacements  that  are  related 
to  coefficient  of  thermal  expansion  (CTE)  of  the  materials  of  the  structure,  which  is  subject  to  the 
fluctuations  of  the  surroundings.  Therefore,  it  is  desirable  to  maintain  structures  at  a  uniform  temperature, 
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regardless  of  environment.  As  an  example,  consider  a  100mm  square  structure  with  a  maximum  lOnm  of 
dimensional  strain  allowable  to  thermal  effects.  If  the  structure  is  made  of  super- invar  (with  a  CTE  of 
0.63x10"6/K),  it  is  necessary  to  maintain  its  temperature  at  160mK;  however  if  the  structure  is  made  of 
aluminum  (CTE  of  22.5xlO"6/K),  it  is  necessary  to  maintain  the  temperature  should  be  kept  within  5mK. 
However,  aluminum  has  better  specific  stiffness,  and  is  therefore  more  desirable  mechanically.  Active 
thermal  control  using  a  feedback  control  system  emerges  as  a  logical  and  practical  solution. 

Current  techniques  in  the  large  equipment  is  to  use  air  or  oil  shower  to  maintain  the  temperature;  oil 
showers  are  not  desirable  in  a  clean  room  environment.  Commercially  available  thermoelectric  coolers 
(TEC)  exploiting  the  Peltier  effect  and  acting  like  solid-state  heat  pump  with  no  moving  parts  are  a  very 
nearly  ideal  solution  for  small  temperature  ranges.  With  TECs,  we  can  control  structure  temperature 
slightly  above  and  below  ambient  temperature  just  by  controlling  current  to  the  TEC;  this  allows  isolation 
of  a  structure  from  possibly  turbulent  heat  exchangers.  We  will  describe  the  modeling  of  the  system  (4x4 
TECs),  and  a  proof-of-concept  experiment  and  simulations  for  multipoint  temperature  control. 

Heat  transfer  calculation 

Consider  the  calculation  shown  in  Figure  2.  L=0.1m;  v=60ft/min 

p=0.07361bm/ft3; 

Air  flow  @  ji=1.24xl0'5lbm/ft.s 

60ft/min,  25°C  Surface  temperature  T=20°C  Cp-0.24Btu/lbm  F  (at  77  F) 

m  r  k=0 .0151  Btu/hr .  ft.  °F 

Aluminum  plate 
(area  lOOmmx  100mm) 


Fig.  2  Heat  Transfer  Calculation  Setup 


The  heat  convection  coefficient  of  the  surrounding  air 

For  the  design  purpose,  take  average  value  for  Nusselt  number 


h  =  - 


N„  x  k 


N. 


uL 


=  0.664  xRe1/2xPr1/3 


Equation  (2)  is  applied  when  Reynolds  number  Re  <  2  x  105 ,  where  Re  =  Lvp /  u  =  1947 
We  can  also  calculate  Prandtl  number  Pr  as 


(1) 

(2) 

(3) 

(4) 


Pr  =  juCp/k  =  0.70951 

From  (2),  (3)  and  (4),  we  can  obtain  NuL  =  26.132  and  h  =  1.203  Btu/hr.°F. fit2  =  7.22  W/m2K.  With  a  total 
area  of  0.01m2,  we  have  Q  =  hxAxAT  =  0.361  W.  The  air  velocity  of  60  ft/min  is  typical  in  laminar  flow 
clean  rooms  (between  60  and  100  ft/min  at  the  face  of  the  HEPA  filters).  The  average  cooling  power  of  a 
Ferrotec-USA  module  TE63 02/065/0 18A  is  5W,  so  an  array  of  these  modules  provides  more  than 
sufficient  cooling  or  heating  power. 


Experimental  setup 


Figure  3  shows  a  schematic  concept  for  a 
temperature  control  experiment. 
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The  temperature  range  we  need  to  control  is 
not  large  (±5  K),  however  the  requirement 
for  temperature  precision  is  at  millikelvin 
level.  We  choose  thermistors  for  their  high 
sensitivity  and  long  term  stability. 
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Fig.  3  Experimental  block  diagram 


Fig.  4  Test  Structure  Assembly  View  Fig.  5  Photograph  of  Test  Structure 

The  test  structure  is  an  aluminum  plate,  shown  in  Figure  4  and  Figure  5. 

Finite  Difference  Model 


Ta  =  25  °C 


1 ,  1 ,2/^T  13  ll4 


k=  177  W/m.K 
p  =  2770  kg/m3 
Cp  =  875  J/kg.K 
L  =  31.25  mm 
1  =  6.154  mm 
Atec  =  135.52  mm2 
7i  =  0.02937  Volt/K 


Fig.  6  Finite  Difference  Model 


The  test  structure  is  virtually  cut  into  4x4  parts  as 
figure  6  shows  and  the  sixteen  elements  are 
controlled  by  individual  TECs  and  the  temperature 
is  measured  individually  by  thermistors.  However 
there  exists  thermal  coupling  between  these 
elements.  Coupling  between  diagonal  elements 
(e.g.  [1,1]  and  [2,2])  is  smaller  than  between 
adjacent  elements,  and  is  omitted  in  the  model.  The 
effects  of  bolting  holes  are  also  omitted  in  the 
model.  Since  the  system  involves  both  conduction 
and  surface  convection  effects,  it  is  necessary  to 

determine  the  Biot  number  that  provides  a  measure  of  the  temperature  drop  in  the  solid  relative  to  the 
temperature  difference  between  the  surface  and  the  fluid. 

Bi  =  hL/k  =  0.00334  <0.1  (5) 

Hence  the  assumption  of  a  uniform  temperature  distribution  for  each  element  is  reasonable. 

There  are  three  types  of  elements  for  this  model:  inner  ([2,2],  [2,3],  [3,2],  [3,3]);  corner  ([1,1],  [1,4],  [4,1], 
[4,4]);  side  ([1,2],  [1,3],  [2,1],  [3,1],  [4,2],  [4,3],  [2,4],  [3,4]). 

•  Inner  Nodes 

Take  [2,2]  as  an  example.  By  conservation  of  energy,  we  can  obtain  the  following  equation 


^2,2  “ 


pL2Cl 


_(C,2  +72,1  +72,3  +73,2  -472,2)-- 
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(2  Lz-Atec)(T2a-Ta)- 
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1  2,2 


pLllCp  ’  pLllCp 

•  Corner  Nodes 

Take  [2,2]  as  an  example.  By  conservation  of  energy,  we  can  obtain  the  following  equation 


Tu  = 


pL1Cl 
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•  Side  Nodes 

Take  [1,2]  as  an  example.  By  conservation  of  energy,  we  can  obtain  the  following  equation 
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pl}Cl 
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In  equation  (6),  (7)  and  (8),  the  k-term  is  related  with  the  heat  conduction  between  different  elements,  h- 
term  heat  convection  from  the  surroundings,  7i-term  heat  drawn  by  TEC.  Here  we  omit  the  resistance  and 
conductance  effects  of  TEC  to  simplify  the  problem  and  assume  that  heat  sinks  attached  to  TECs  work 
perfectly.  It  is  also  assumed  that  the  temperature-related  coefficients  k,  h,  Cp  and  n  are  constant  since  the 
temperature  range  is  within  ±5  K. 


Plugging  in  the  value  of  parameters,  we  obtain  the  state-space  equations  of  the  system  as  follows. 


T  =  (ru 
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Where  E4  is  a  4x4  identity  matrix,  0  is  a  4x4  zero  matrix,  E16  is  a  16x16  identity  matrix,  Ta  is  the 
temperature  of  the  surrounding  air.  This  system  is  linear  and  controllable. 


Summary 

The  results  of  the  simulations  and  experiments  will  be  presented  at  the  conference. 
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Abstract: 

In  most  precision  engineering  applications,  minimization  of  thermal  distortions  is  done 
by  the  use  of  either  active  control  using  temperature  controlled  air  or  liquid  showers, 
artificial  heaters  or  ultra  low  expansion  materials  such  as  Invar,  Zerodur.  This  paper 
explores  the  use  of  heat  pipes  at  low  heat  loads  (0.1  W  -  5  W)  in  precision  engineering 
applications  where  the  requirement  of  stable  and  near  uniform  temperature  is  stringent. 
Experiments  are  performed  on  a  commercial  heat  pipe  and  its  performance  characteristics 
such  as  effective  thermal  conductivity,  transient  response  and  temperature  gradient  at 
steady  state  are  determined. 

Introduction: 

The  process  of  miniaturization  of  existing  technologies  to  the  nano  and  micro  level  with 
applications  in  medical  sciences,  aerospace,  communications,  automobiles,  electronics 
have  put  a  tremendous  demand  for  higher  accuracy  on  the  manufacturing  processes  and 
their  concerning  metrology.  Of  all  the  sources  of  inaccuracy  in  a  precision  machine  tool, 
thermally  induced  distortions  are  the  most  prominent  source  of  error  and  fortunately  they 
are  not  random  errors.  These  distortions  are  worsened  by  the  increase  in  automation  in 
precision  machine  tools  and  instruments  since  the  gauging  system  is  now  thermally 
connected  to  the  heat  sources  of  the  machine  tool  and  also  its  dimensions  are  large 
compared  to  the  manual  gauging.  It  is  a  well-known  fact  that  mitigating  the  thermal 
errors  in  machine  tools  results  in  increase  in  accuracy  of  the  process  by  an  order  of 
magnitude  and  also  improves  the  repeatability  of  the  process.  These  effects  can  be 
reduced  in  3  ways:  (a)  Compensation  (b)  Temperature  controlled  coolant  shower  (c) 
Passive  cooling  using  heat  pipes,  heat  sinks. 

Compensation  requires  a  thermal  deformation  model  of  the  machine  tool  to  determine  the 
thermal  displacements  in  real  time  and  to  account  for  these  displacements  in  the  process. 
Unfortunately  most  of  the  models  that  have  been  developed  are  either  very  complex  and 
slow  or  very  simple  and  inaccurate  and  do  not  represent  the  full  range  of  the  operating 
conditions.  Also  it  is  very  difficult  to  compensate  for  the  bending  mode  of  deformation 
by  using  the  position  control  system  of  machine  tool. 

Temperature  controlled  coolant  shower  method  is  widely  used  in  machine  tools  since  it  is 
very  simple  to  implement  and  has  fast  response.  It  has  the  added  benefit  of  keeping  the 
work  area  clean  of  chips,  dust  and  other  extraneous  materials,  which  act  as  heat  source  or 
corrosion  agent.  Depending  on  the  amount  of  heat  removal  and  on  the  requirement  of  dry 
work  area,  the  coolant  can  be  air  or  fluid.  Liquid  shower  has  high  heat  capacity  and 
therefore  remove  heat  under  near  isothermal  conditions  thus  minimizing  thermal 
distortions.  On  the  other  hand,  air  shower  is  better  in  reducing  the  effect  of  external  heat 
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sources  on  the  machine  tool.  It  has  been  found  out  that  in  the  machine  tools,  the  process 
of  pumping  the  liquid  coolant  alone  generates  a  significant  amount  of  heat  and  acoustic 
vibration.  Bryan  et  al  [1]  reports  that  for  the  production  of  mirror  finishes  a  structural 
motion  of  less  than  2  p  inches  is  required  in  the  machine  tool. 

Passive  cooling  by  heat  pipes  and  heat  sinks  are  effective  at  high  operating  temperatures 
and  high  heat  flux.  This  paper  explores  the  use  of  heat  pipes  as  a  means  of  passive 
cooling  at  low  power  inputs  encountered  in  the  lithography  process  and  other  precision 
engineering  applications  to  maintain  stable,  very  low  temperature  gradients  across  the 
surface.  The  effective  thermal  conductivity  of  heat  pipes  and  the  transient  response  of  the 
heat  pipe  are  determined  for  different  heat  inputs  ranging  from  0.1  W  to  5  W  at  the 
operating  temperatures  of  0  C  and  17  C  respectively. 

Experimental  Apparatus: 

A  4  mm  diameter  circular  copper  water  heat 
pipe  of  length  125  mm  from  Noron  Products 
with  thermistors  along  its  length  is  insulated 
as  shown  as  in  the  Fig  1.  A  5W  flexible 
kapton  heater  (Watlow)  is  wrapped  around 
the  heat  pipe  at  one  end.  The  other  end  of 
the  heat  pipe  is  inserted  into  the  condenser. 

A  thermocouple  is  glued  17  mm  from  the 
end  on  the  condenser  section  of  the  heat 
pipe.  This  entire  set  up  is  then  placed  in  a 
vacuum  chamber  to  further  reduce  the  heat 
loss  through  convection  and  radiation  to  the  surrounding  environment.  The  lengths  of  the 
condenser  and  the  evaporator  sections  are  57  mm  and  43  mm  respectively.  The 
temperature  of  the  condenser  was  allowed  to  vary  +  0.5  deg  C  in  the  case  of  freezing 
water  (0  deg  C)  and  ±  0.5  deg  C  in  the  case  of  the  tap  water  (17  deg  C)  to  simulate  the 
real  life  conditions. 

Results  &  Discussion: 

The  effective  thermal  conductivity,  transient  response,  steady  state  temperature 
distribution  of  the  heat  pipe  at  0  deg  C  and  17  deg  C  respectively  are  plotted  in  figures  2- 
7.  The  effective  thermal  conductivity  of  the  heat  pipe  was  found  to  be  equal  to  that  of 
copper  at  0.4  W  and  0.1  W  of  power  input  at  0  deg  C  and  17  deg  C  respectively.  The 
heat  pipe  time  constant  was  found  to  be  ranging  from  60  to  70  secs  for  both  the  cases  thus 
indicating  that  the  transient  response  was  independent  of  the  operating  temperature  of  the 
heat  pipe  at  low  power  inputs.  The  temperature  gradient  along  the  adiabatic  section  of  the 
heat  pipe  at  steady  state  was  found  to  be  varying  linearly  from  -0.068  deg  C  /mm  to 
-0.28  deg  C/mm  at  0  deg  C  and  from  -0.03  deg  C/mm  to  -0.26  deg  C/mm  at  17  deg  C 
between  power  inputs  0.1  W  to  5  W  respectively.  This  shows  that  the  temperature 
gradient  is  not  very  sensitive  to  the  operating  temperature  of  the  heat  pipe. 

The  conductivity  of  OFHC  copper  is  approximately  380  W/m-K.  With  heat  flux  levels  as 
low  as  1  W,  the  effective  thermal  conductivity  of  a  heat  pipe  is  several  times  that  of  a 
solid  conductor. 


Fig  2.  Effective  conductivity  at  0  deg  C. 


Fig  3  Thermal  conductivity  at  17  deg  C 
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Fig  4  Temperature  distribution  at  0  deg  C 


Fig  5  Temperature  distribution  at  17  deg  C 


Transient  response  @  0  deg  C 
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Fig  6  Transient  response  at  0  deg  C 


Fig  7  Transient  response  at  17  deg  C 


Conclusion: 

The  effective  thermal  conductivity,  transient  response  and  temperature  gradient  of  the 
heat  pipe  at  low  power  inputs  were  evaluated.  From  the  results  we  conclude  that  heat 
pipes  respond  quite  well  in  transporting  randomly  varying  low  heat  loads  which  occur 
either  due  to  friction  between  components  in  motion  or  due  to  concentrated  heat  input  in 
case  of  the  lithography  process  at  stable  temperature  gradients.  In  the  future,  this  would 
open  up  the  possibility  of  manufacturing  precision  equipments  by  using  a  structure 
combining  a  structure  with  desirable  mechanical  properties  with  heat  pipes  and  active 
thermal  control  to  minimize  thermally  induced  strain. 
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Abstract 

Two-dimensional  precision  stages  for  semiconductor  lithography  require  sub-nm-level 
accuracy.  Current  ultraprecision  stage  metrology  based  on  heterodyne  laser  interferometers  with 
a  632.8-nm  HeNe  laser  and  interpolation  to  A/2048  provide  resolution  of  0.3  nm.  Unfortunately, 
the  refractive  index  of  air  varies  with  CO2  and  water  vapor  content,  as  well  as  with  air 
turbulence.  Without  expensive  atmospheric  compensation,  the  actual  accuracy  may  only  be  sub- 
pm  for  operation  in  air. 

In  our  approach  two  scale  based  metrology  system  (OPTRA  NanoGrid)  are  used.  The 
NanoGrid  greatly  reduces  possible  effects  from  air  turbulence  and  Abbe  offsets  from  using 
separate  X  and  Y  scales.  However,  the  metrology  is  based  on  an  physical  artifact,  and  the 
accuracy  of  the  metrology  is  limited  to  the  accuracy  of  the  grid  artifact  itself:  about  1pm. 
Physical  scales  may  provide  greater  precision,  but  not  always  greater  accuracy.  Real-time  self¬ 
calibration  is  implemented  to  improve  the  accuracy  of  stage  to  the  level  limited  by  repeatability 
of  stage. 

Fourier  transform  is  used  to  decouple  the  distortion  of  artifact  metrology  and  stage 
metrology  at  sampling  points  from  3  measurement  view.  Based  on  feedback  principle  high 
accuracy  translation  and  rotation  is  implemented  to  minimize  misalignment  error.  Square 
reversal  is  used  to  calculate  rotation  error  8y.  Iterative  algorithm  is  developed  to  isolate  the 
rotation  error  8y  in  self-calibration. 

Keywords:  Self-Calibration  , Nanolithography,  two-dimensional  stage,  scale  based  metrology 


1. Introduction 

The  key  issues  in  self-calibration  is  how  to  solve  measurement  equations.  In  other  words, 
isolate  the  scale  and  stage  distortion  from  measurement  equations  when  there  are  misalignment 
errors  during  translation  and  rotation.  In  a  previous  approach,  Ye,  Pease  et  al.  (1997)  developed  a 
discrete  Fourier  transform-based  algorithm  by  approximating  A(x,y)  and  S(x,y)  with  Fourier 
series,  which  is  numerically  robust  to  measurement  noise.  The  alignment  error  is  isolated  in  the 
statistical  meaning  by  using  least  square  method  under  several  assumption,  which  is  effective 
when  the  measurement  noise  has  a  normal  distribution  and  required  space  sample  interval  is  big 
enough.  Second,  translation  and  rotation  error  will  affect  the  sample  position.  This  will  be 
problematic  when  space  sample  interval  is  small  e.g  sub- pm.  Third,  because  the  x  and  y 
translation  and  rotation  values  of  the  stage  point  set  can  not  be  given  by  the  algorithm  or  known 
before  the  self-calibration,  it  is  difficult  to  find  the  origin  and  the  axis  orientation  which  meet  the 
assumptions.  Thus  the  stage  error  map  obtained  from  this  method  has  an  unknown  coordinate 
system,  which  might  not  be  directly  related  to  the  machine  coordinate  system. 
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2.Testbed  physical  realization  of  an  X-Y  stage  for  Nanolithography 

The  testbed  is  an  air-bearing  , capstan  drive  interferometric  lithography  stepper,  with  HP  laser 
interferometers  for  feedback.  This  stage  is  configured  as  a  stacked-slide  X-Y  stage.  The  moving 
stage  is  supported  by  air  bearings  on  the  granite  surface  plate.  Two  scale  based  metrology 
systems  are  mounted  on  the  same  plane  of  moving  stage  to  implement  real-time  self-calibration. 
The  configuration  in  the  plane  is  shown  in  Figure  1. 


Figure  1  Configuration  of  2-D  real-time  self-calibration 


Scale  based  metrology  system  is  a  2-D  ultra-high  resolution  diffraction  interferometer.  It  can 
measure  X  and  Y  plane  motion  simultaneously  with  0.3  nm  resolution  and  to  detect  small  angle 
motion  around  Z  axis. 

One  of  the  main  error  source  in  nanolithography  come  from  the  disturbance  of  guideway. 
The  translation  error  of  guideway  in  X  and  Y  direction  will  be  canceled  by  feedback  of  accurate 
stage  metrology.  The  disturbance  from  the  rotation  error  of  guideway  around  Z  axis  will  be 
minimized  based  on  Abbe  principle  in  X  direction.  For  1  nm  accuracy,  compare  with  traditional 
configuration,  the  tolerance  of  rotation  error  will  increase  from  1  prad  to  80  prad.  In  Y  direction 
the  rotation  error  of  guideway  around  Z  axis  will  be  measured  by  two  scale  based  metrology 
system  and  compensated  with  software.  So  the  accuracy  of  nanolithography  mainly  depends  on 
the  accuracy  of  metrology. 

3.  Mathematical  model  and  Self-Calibration  Algorithm 

3.1  Decouple  2-D  grating  distortion  when  there  is  no  alignment  error 

Two  scale  based  metrology  system  are  used  to  generate  measurement  view.  One  is  stage 
metrology  .The  other  is  artifact  metrology.  The  distortion  s(x,y)  of  stage  grating  and  distortion 
a(x,y)  of  artifact  grating  at  sampling  points  (xm,yn)  can  be  expressed  as 

=  +Sy(xm>y*)ey 

a(.Xm>y*)  =  ax(Xm>y«)ex+ay(Xm>y*)ey 

We  need  3  measurement  views  to  solve  s(xm,yn)  and  a(xm,yn).  In  View  0  or  home  view,  two 
metrology  system  is  aligned  with  the  air  bearing  of  the  stage  by  minimizing  the  angle  between 
the  moving  direction  of  air  bearing  and  X-axis  of  the  metrology.  s(xm,yn)  and  a(xm,yn)  is 
compared  at  sampling  points,  so 

°<X  >  yn )  =  v  (*«  >  y„ )  -  ax  (• xm  >  yn )  (^ ) 

°<X  >  y« )  =  s,  <X  >  y« )  -  ay  (xm > y„ )  (16) 

In  View  1,  the  sensor  head  is  translated  from  View  0  by  one  sample  site  interval  relative  to  the 
artifact  grating,  yield  the  measurement  equations 


V  Kxm  ,y„)  =  sx(xm,yn)-ax  (xm+1 ,  yn )  (2  a) 

vy  Kxm  ,yn)  =  sy(xm,yn)~  ay  (xm+l ,  yn )  (2b) 

Based  on  translation  property  of  2-D  discrete  Fourier  transform  to  solve  Ax(u,v)  from 
equation  (1)  and  (2) 


Ax(u,v)  = 
where 
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Equation  (3)  only  gives  solution  at  u^O.  Based  on  the  rotation  property  of  2-D  Fourier  transform 
Ax(0,v)  will  be  solved  from  equation  (5)  in  View2  where  artifact  grating  is  counterclockwise 
rotated  90°  relative  to  View  0. 


K  2  (*» » y« )  =  (*« ,  yn )  -  ROT90(ay  (xm ,  yn  ))  (5a) 

Vy  2(xm  ,yn)  =  sy(xm,yn)-  ROT90(ax  (xm ,  yn ))  (5b) 

where  ROT90  denotes  “90°  rotation.” 

3.2  Minimize  the  misalignment  error  based  on  feedback  and  iterative  algorithm 

The  sensor  head  is  mounted  on  6  degree  kinematics  stage.  The  direction  and  location  of 
sensor  head  relative  to  the  2-D  grating  are  determined  by  the  6  balls  on  the  kinematics  stage.  A 
picomotor  is  used  to  drive  the  relative  movement  of  sensor  head  relative  to  the  2-D  grating  with 
30  nm  resolution  .  The  sensor  head  is  aligned  with  the  2-D  grating  stage  by  minimizing  the  angle 
between  the  moving  direction  of  6  degree  kinematics  stage  and  X-axis  of  the  metrology.  The 
relative  translation  between  sensor  head  and  2-D  grating  will  be  measured  by  2-D  metrology 
system  itself.  The  feedback  is  used  to  guarantee  required  one  sample  site  interval  translation 
accuracy  determined  by  sampling  requirement. 

A  2-D  grating  is  mounted  on  the  bearing  and  rotation  can  be  driven  by  a  picomotor  with 
30"  resolution.  Square  reversal  is  used  to  measure  the  90°  rotation  error  Ay  which  will  be  reduced 
by  feedback  to  the  level  limited  by  the  picomotor  resolution. 

The  accuracy  to  implement  translation  and  rotation  depends  on  the  accuracy  of  2-D  grating. 
The  translation  error  can  be  minimized  by  implement  1  -D  self-calibration  in  X  axis  first. 

The  2-D  self-calibration  error  due  to  90°  rotation  error  Ay  can  be  compensated  if  Ay  is 
accurately  measured  when  Ay  is  tiny.  From  experiment  and  computer  simulation  the 
measurement  accuracy  of  Ay  is  insensitive  to  the  accuracy  of  metrology  which  means  error 
transfer  coefficient  is  less  than  1.  This  will  guarantee  that  an  iterative  algorithm  will  be 
convergent.  So  an  iterative  algorithm  is  developed  to  minimize  disturbance  from  measurement 
accuracy  of  rotation  error  5y. 


4.  Experiment  results  and  summary 

Real-time  calibration  was  implemented  on  the  Nanolithography  testbed.  Dynamic  data 
sampling  methodology  was  developed  to  minimizes  effects  of  quasi-static  disturbance  from 
measurement  environment  change,  such  as  temperature.  The  repeatability  of  scale  based 
metrology  is  about  8.5  nm  (see  Fig  2).  The  initial  calibration  result  is  shown  in  Fig  3.  The 
distortion  of  2-D  Grating  is  about  1  pm  .  Real  time  self-calibration  will  increase  the  accuracy  of 
scale  based  metrology  from  1  pm  to  the  level  limited  by  the  measurement  noise. 
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A  Fuzzy  Logic  Based  Adaptive  Feedforward  PI  Controller  for 
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Abstract 

A  technology  driver  in  high  precision  manufacturing  is  a  position  measurement  and 
control  system  which  allows  manufacturers  to  control  the  movement  of  their  equipment  at 
extremely  high  levels  of  precision.  Performances  of  controllers  play  important  roles  in  nano¬ 
meter  level  positioning.  A  robust,  disturbance  resistant  controller  is  necessary  for  nanometer 
positioning.  However,  the  time  varying  and  nonlinear  properties  of  plants  frequently  result  in  the 
performance  degradation  of  widely  used  PID  controllers. 

The  performance  of  feedback  system  can  be  improved  considerably  by  combining  PI 
feedback  control  with  feedforward  control  together.  Feedback  controllers  are  very  capable  of 
compensating  disturbances  because  feedback  controllers  are  basically  error  driven.  However,  they 
often  suffer  from  a  trade-off  between  high  performance  and  robust  stability.  With  good  dynamic 
property  knowledge  available,  a  feedforward  controller  may  be  able  to  prevent  control  errors, 
because  feedforward  controller  output  is  based  on  the  reference,  instead  of  the  error  signal. 
Feedforward  controllers  improve  the  control  performance  by  including  a  feedforward  term  to  the 
PI  output  so  that  the  controller  can  react  to  the  command  more  quickly  in  order  to  bring  the  plant 
to  the  desired  setpoint.  A  fuzzy  logic  controller  will  adjust  the  feedforward  and  PI  feedback  gains 
(Kffv  ,Kffa ,  Kffdj  KP  and  Ki )  to  adapt  the  change  of  dynamic  behavior  caused  by  the  time  varying 
and  nonlinear  properties  inherent  in  the  positioning  systems. 

The  developed  control  algorithm  is  simulated  in  MATLAB,  and  is  implemented  with  a 
TMS320M67  DSP  in  an  experimental  nanolithography  stage.  A  X-Y  grating  based  metrology  is 
used  to  measure  the  position  of  stage,  which  is  more  robust  to  environmental  change  and  more 
suitable  with  high  performance  servomechanisms.  Experiments  show  that  stage  will  follow  10  nm 
step  input  with  nm  level  steady  state  error,  which  is  mainly  caused  by  floor  and  acoustic 
vibrations. 

1.  Introduction 

A  robust  and  disturbance  resistant 
controller  is  necessary  for  nanometer 
positioning.  However,  the  time  varying 
and  nonlinear  properties  inherent  in  the 
system  dynamics  frequently  result  in 
degradation  of  PID  controller 
performance.  For  example,  air-bearing 
supported  capstan  drives  are  one  of 
several  systems  used  in  precision 
positioning  stages  (such  as  the  one 
presented  here).  They  exhibit  very  low 
friction  and  no  backlash.  However, 
capstan  friction  results  in  non  linearity. 
Open  loop  testing  indicates  that  the 
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frequency  responses  of  in  macro  (1  |im  maximum  amplitude  )  and  micro  (50  nm)  range  motion 
are  different. 

Fixed  gain  PID  controllers  are  not  robust  enough  and  further  improvement  is  needed.  Rao 
and  Ro  stated  that  state  feedback  controllers  had  better  robustness  and  disturbance  rejection 
capability  than  PID  controllers  do.  Ro  and  Hubbel  adopted  Model  reference  adaptive  control 
(MRAC)  theory  in  controlling  a  stage.  As  a  result,  separate  mathematical  models  and  MRACs 
had  to  be  developed  for  micro  and  macro  modes  according  to  the  scale  of  the  movement. 
Although  this  dual-model  MRAC  did  improve  the  consistency  between  micro  and  macro  modes 
as  opposed  to  a  single  model  MRAC  ,  the  problem  of  deciding  the  proper  threshold  for  switch 
from  micro  to  macro  mode  would  make  a  dual  model  MRAC  controller  unsuitable  for,  say,  a  500 
nm  movement. 

2  Design  of  Fuzzy  Logic  Based  Adaptive  Feedforward  PI  Controller 

A  block  diagram  of  an  analog  PI  controller  with  fuzzy  logic  based  adaptive  feedforward 
loop  is  shown  in  Fig  2.  The  PI  feedback  controller  is  designed  for  robustness  and  accuracy.  The 
integrator  increases  the  type  number  of  the  system,  allowing  it  to  track 


r 


Fig  3  The  structure  of  fuzzy  logic  based  adaptive  feedforward  PI  controller 


with  no  (or  limited)  error.  In  other  words,  the  integrator  will  eliminate  steady  state  error  to 
guarantee  nm  level  control  accuracy.  However,  implementing  an  I-controller  by  itself  will  cause 
instability,  and  it  must  be  combined  with  a  proportional  action  to  stabilize  the  system.  On  the 
other  hand,  the  PI  controller  does  not  help  to  speed  up  the  system  response.  Feedforward 
controllers  improve  the  control  performance  by  including  a  feedforward  term  to  the  PI  output  so 
that  the  controller  can  react  to  the  command  more  quickly,  in  order  to  bring  the  plant  to  the 
desired  setpoint.  Note  that  this  function  is  outside  the  feedback  control  loop,  and  thus  does  not 
affect  the  system’s  stability.  Feedforward  helps  to  minimize  following  error  during  motion,  by 
generating  most  of  the  DAC  output  signal  from  the  motion  profile  rather  than  the  position  error. 

The  performance  of  feedforward  and  feedback  controller  depends  on  the  dynamic 
behavior  of  plant.  Also,  feedforward  controllers  may  require  large  control  signals  which  are 
beyond  the  permissible  maximum  output  of  the  drive.  A  fuzzy  logic  controller  will  adjust  the 


feedforword  and  feedback  gain  (deciding  the  proper  threshold  for  switching  from  micro  to  macro 
mode)  according  to  the  reference  gain  to  deal  with  those  issues. 

3  Simulation  and  Experiment  Setup 

The  proposed  control  algorithm  is  simulated  in  MATLAB.  Simulation  results  show  that 
the  fuzzy  logic  adaptive  controller  will  adapt  to  change  of  dynamic  behavior  caused  by  the  time 
varying  and  nonlinear  properties  inherent  in  the  positioning  system  dynamics.  The  testbed  for 
optical  Nanolithography  which  is  used  to  investigate  the  result  of  adaptive  feedforward  controller 
(shown  in  Figure  1)  is  an  interferometric  lithography  stepper,  featuring  an  air-bearing  stage,  with 
capstan  drive,  with  HP  laser  interferometers  for  feedback.  This  stage  is  configured  as  a  stacked- 
slide  X-Y  stage.  The  moving  stage  is  supported  by  air  bearings  on  the  granite  surface  table. 

An  XY  grid-based  planar  encoder  system  is  used  to  measure  2-dimensional  ultra-precise 
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Fig3  Comparing  results  for  repeatability  testing  of  metrology  with  air  bearing  turn  on 
(upper)  and  air  bearing  turn  off  (bottom) 


planar  displacements  of  stage,  which  avoids  the  turbulence  effects  which  are  commonly 
encountered  with  laser  interferometers  or  the  Abbe  errors  associated  with  separate  linear  scales. 
The  2-D  grid  measurement  resolution  is  0.3nm  The  update  rate  for  this  metrology  is  375  kHz, 
which  is  suitable  for  high  performance  servo-mechanisms. 

The  control  algorithm  is  implemented  digitally  with  TMS320C67  digital  signal  processor 
which  is  mounted  on  an  Innovative  Integration  M67  card.  This  is  a  32-bit  standard  full-size  PCI 
card.  The  digital  position  outputs  of  the  X-Y  grating  based  metrology  are  accessed  through  the 
DSP  card  parallel  digital  I/O  connector,  which  is  faster  than  through  the  PC’s  bus.  The  sampling 
period  is  10ps.  The  16  bit  D/A  output  of  the  DSP  is  amplified  and  then  drives  the  DC  motor. 

4  Results  And  Future  Work 


The  measurement  noise  is  shown  in  Fig  3.  When  the  stage  is  not  floated  (resting  on 
granite  plate),  the  amplitude  of  measurement  noise  is  about  2.5  nm.  Comparatively,  the  amplitude 
of  measurement  noise  is  about  8.5  nm  when  stage  is  floated.  This  may  be  because  the  air-bearing 
is  more  sensitive  to  the  vibration  disturbance  from  floor  or  the  air  pressure  in  the  air  bearing 
changes  with  time.  The  controller  is  tested  with  different  size  of  step  input.  The  stage  will  follow 
10  ,20  ,50  ,500  and  5000  nm  step  input  with  the  steady  state  error  around  nano-meter  level  after 
filtering  out  the  vibration  disturbance  (Shown  in  Fig  4). 


Step  Response  from  lOnm  to  5000nm 
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Fig  4  Step  response  of  fuzzy  logic  based  adaptive  feedforward  PI  controller 


The  PI  feedback  controller  ensures  the  control  accuracy  of  stage  motion.  Adaptive 
feedforward  controller  improve  the  transient  control  performance  considerably  when  there  are  the 
changes  of  dynamic  behavior  caused  by  the  time  varying  and  nonlinear  properties  inherent  in  the 
positioning  systems. 

In  our  current  approach,  the  gain  of  feedforward  controller  is  determined  by  trial  and 
error  based  on  experiments.  One  possible  approach  in  the  future  is  learning  feed-forward  (LFF) 
controllers.  LFF  may  be  able  to  prevent  control  errors,  because  its  output  is  based  on  the 
reference,  instead  of  the  error  signal.  They  are  not  necessarily  based  on  a  physical  process  model 
and  are  potentially  able  to  learn  and  reproduce  an  ‘arbitrary’  continuous  function  to  any  desired 
degree  of  accuracy,  even  if  it  concerns  non-linear  and/or  time- variant  functions. 
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The  concept  behind  immersion  lithography  is  the  insertion  of  a  high  refractive  index  liquid  in  the 
space  between  the  final  projection  lens  of  an  exposure  system  and  the  device  wafer  to  improve  the 
overall  resolution  of  the  exposure  process.  Computational  fluid  dynamics  (CFD)  simulations  were 
performed  in  order  to  investigate  the  process  of  initially  filling  the  lens-wafer  gap  with  immersion 
fluid.  The  CFD  models  were  used  to  investigate  the  effects  of  dispense  velocity,  gap  height,  and  fluid 
dispense  angle  on  the  fill  process;  specifically  on  the  possibility  of  air  entrainment.  The  simulations 
revealed  that  there  is  an  optimal  region  in  the  parameter  space  of  gap  height  and  dispense  velocity 
for  which  the  gap  fills  completely.  Outside  of  this  region,  either  excessive  inertial  or  surface  tension 
forces  cause  an  undesirable,  incomplete  filling  process.  The  optimal  region  was  found  to  shift 
somewhat  based  on  the  fluid  dispense  angle.  Finally,  experiments  were  performed  to  verify  the  CFD 
models.  The  CFD  simulations  and  the  experimental  results  were  in  good  agreement,  both 
qualitatively  with  regard  to  the  shape  and  evolution  of  the  free  surface  and  quantitatively  with  regard 
to  the  velocity  of  the  contact  line.  ©  2004  American  Vacuum  Society.  [DOI:  10.1116/1.1824065] 


I.  INTRODUCTION 

Immersion  lithography  has  been  proposed  as  a  method  for 
extending  optical  lithography  resolution  to  45  nm  and 
below.1  The  premise  behind  immersion  lithography  is  to  im¬ 
prove  resolution  by  increasing  the  index  of  refraction  in  the 
space  between  the  final  projection  lens  of  an  exposure  sys¬ 
tem  and  the  device  wafer  by  inserting  a  high  index  liquid  in 
place  of  the  low  index  air  that  currently  fills  the  gap.  The 
liquid  in  the  gap  is  an  optical  element  and  therefore  must 
have  a  uniform  index  of  refraction.  To  this  end,  the  gap  must 
be  completely  filled  with  liquid  (water  for  193-nm  lithogra¬ 
phy  with  other  fluids  being  evaluated  for  157-nm 
lithography2)  and  air  bubbles  cannot  be  tolerated. 

In  order  for  immersion  lithography  to  be  practical,  the 
introduction  of  liquid  into  the  wafer-lens  gap  must  not  ad¬ 
versely  impact  the  overall  manufacturing  process.  Since  the 
immersion  fluid  acts  as  an  optical  component  during  the 
lithographic  process,  it  must  retain  a  high  and  uniform  opti¬ 
cal  quality.  It  is  critical  that  the  fluid  management  system 
reliably  and  rapidly  fill  the  entire  lens- wafer  gap,3  maintain 
the  fill  under  the  lens  throughout  the  entire  exposure  process, 
and  ensure  that  no  bubbles  are  entrained  during  filling  or 
scanning.  Optimization  of  the  fluid  dispensing  process  is  es¬ 
sential  to  the  success  of  immersion  lithography;  therefore, 
two-  and  three-dimensional  (2D  and  3D)  computational  fluid 
dynamics  (CFD)  models  have  been  developed  to  simulate 
the  filling  process.  Several  previous  studies  have  used  CFD 
simulations  to  investigate  potential  problems  that  may  arise 
when  the  immersion  lithography  process  is  operating  in  a 
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quasisteady  manner;  these  problems  include  fluid  heating4,5 
and  air  entrainment  due  to  free  surface  flows  over  wafer 
topology.6  The  simulations  presented  in  this  work  parametri¬ 
cally  study  the  filling  process  that  must  occur  prior  to  initia¬ 
tion  of  immersion  exposures.  The  parameters  investigated 
include  the  fluid  dispense  velocity  and  angle  as  well  as  the 
gap  height  between  the  lens  and  the  wafer. 

An  optimal  region  in  the  parameter  space  of  fluid  velocity 
and  gap  height  is  identified.  The  region  is  delineated  on  one 
side  by  a  critical  Reynolds  number,  above  which  the  dis¬ 
pense  jet  impinges  on  the  wafer  forming  a  thin  him  that  does 
not  completely  fill  the  gap.  On  the  other  side,  the  region  is 
delineated  by  a  critical  Weber  number,  below  which  a  me¬ 
niscus  is  formed  from  the  dispense  port  and  crawls  along  the 
lens.  The  subsequent  rupture  of  the  meniscus  upon  its  even¬ 
tual  contact  with  the  wafer  is  likely  to  entrain  unacceptable 
air  bubbles.  The  fluid  dispense  angle  has  an  effect  on  the 
location  of  this  optimal  region. 

In  order  to  verify  some  aspects  of  the  CFD  simulations, 
the  model  predictions  were  compared  with  an  experiment  in 
which  water  was  used  to  fill  a  gap  that  replicates  the  lens-to- 
wafer  gap  in  an  immersion  lithography  tool.  There  was  good 
agreement  between  the  CFD  and  experimental  results,  both 
qualitatively  in  terms  of  the  shape  of  the  free  surface  and  its 
progression  across  the  gap  and  quantitatively  in  terms  of  the 
velocity  of  the  filling  process. 

The  use  of  2D  CFD  models  for  the  parametric  studies  and 
3D  CFD  models  for  experimental  verification  was  required 
by  practical  constraints.  It  was  important  to  verify  that  the 
CFD  models  were  predictive  and  therefore  some  experimen¬ 
tal  comparison  was  required;  however,  the  experimental  ap¬ 
paratus  involved  3D  flow  and  therefore  a  3D  CFD  model 


3454  J.  Vac.  Sci.  Technol.  B  22(6),  Nov/Dec  2004  0734-21 1  X/2004/22(6)/3454/5/$1 9.00  ©2004  American  Vacuum  Society  3454 


3455 


Abdo  et  ai:  Optimizing  the  fluid  dispensing  process 


3455 


Dispense  velocity 


Fig.  L  (a)  Schematic  of  the  dispensing  process.  The 
behaviors  are  delineated  in  (b)  which  shows  the  fluid 
dispense  velocity  as  a  function  of  gap  height  with  three 
characteristic  regions  identified.  The  effects  of  dispense 
velocity  and  gap  height  on  the  fill  process  are  shown  in 
(c)  for  the  behavior  just  after  the  meniscus  breaks  for  a 
low  dispense  velocity  (note  trapped  air  bubble),  in  (d) 
with  the  thin,  high  velocity  fluid  film  associated  with  a 
high  dispense  velocity,  and  in  (e)  for  the  complete  and 
rapid  filling  process  that  occurs  when  an  acceptable  dis¬ 
pense  velocity  is  used. 


was  necessary  to  yield  quantitative  agreement.  Unfortu¬ 
nately,  the  use  of  a  3D  CFD  model  to  carry  out  the  paramet¬ 
ric  studies  would  require  a  prohibitive  amount  of  computa¬ 
tional  time.  Therefore  2D  CFD  models  were  used  for  the 
parametric  studies  of  the  underlying  physics.  The  correlation 
between  the  2D  and  3D  results  has  been  observed  experi¬ 
mentally;  for  example,  it  is  possible  to  experimentally  ob¬ 
serve  the  transition  between  impinging  jet  and  optimal  dis¬ 
pense  at  a  constant  gap  height  by  increasing  the  velocity 
using  the  experimental  test  facility  described  in  this  paper.  In 
other  testing,  the  transition  between  meniscus  crawl  and  op¬ 
timal  dispense  behavior  has  been  observed  when  the  dis¬ 
pense  velocity  is  reduced.  While  the  experimentally  observed 
transition  velocities  are  not  exactly  equal  to  those  shown  in 
the  paper,  they  are  nominally  consistent  with  the  critical  We¬ 
ber  number  and  Reynolds  number  identified  by  the  2D  CFD 
model. 

II.  CFD  MODELING  RESULTS 

Figure  1(a)  shows  a  schematic  of  the  filling  process  used 
in  the  two-dimensional  CFD  simulations.  Immersion  fluid 
(water  for  these  simulations)  was  dispensed  with  a  uniform 
velocity  from  a  port  adjacent  to  the  lens  surface.  The  lens 
and  dispense  port  are  positioned  at  the  same  distance  from  a 
wafer  which  is  stationary  for  the  simulations  and  experi¬ 
ments  described  in  this  article.  The  progress  of  the  filling 
process  is  monitored  after  the  initiation  of  the  fluid  dispense. 

The  CFD  simulation  results  showed  that  the  fluid  dispense 
process  is  sensitive  to  a  number  of  parameters  including  the 
fluid  dispense  velocity,  the  wafer-to-lens  gap  height,  and  the 
fluid  dispense  angle.  Three  different  gap  heights  (i.e.,  0.3 
mm,  0.6  mm,  and  0.9  mm)  were  examined  in  a  parametric 
study;  for  each  height,  the  fluid  dispense  velocity  was  varied 
from  0.0  to  3.0  m/s.  The  parametric  studies  showed  that  the 
dispense  process  behavior  falls  into  different  regimes  de¬ 


pending  on  these  process  variables,  as  illustrated  in  Fig.  1(b). 
The  simulation  was  conducted  with  the  three  gap  heights 
only;  the  connecting  line  shown  in  the  graph  is  qualitative,  to 
illustrate  the  trend  of  the  simulation  results. 

At  low  fluid  dispense  velocity  the  liquid  forms  a  hanging 
droplet  that  does  not  immediately  touch  the  wafer  surface. 
As  more  liquid  is  added  to  the  droplet  it  tends  to  crawl  along 
the  hydrophilic  lens  surface;  eventually  the  meniscus  touches 
the  wafer  and  at  this  point  it  breaks  very  rapidly,  as  shown  in 
Fig.  1(c).  This  behavior  is  undesirable  because  the  sudden 
rupture  of  the  meniscus  is  unpredictable  and  may  entrain 
macroscopic  air  bubbles  that  can  be  clearly  identified  in  Fig. 
1(c).  This  air  cannot  escape  from  the  meniscus  to  wafer  gap 
during  the  rupture  process  and  therefore  forms  small  bubbles 
under  the  influence  of  surface  tension  forces.  At  each  gap 
height,  a  series  of  simulations  at  progressively  lower  dis¬ 
pense  velocities  was  used  to  delineate  the  transition  from  a 
completely  filled  gap  to  this  meniscus  crawl  behavior,  as 
shown  in  Fig.  1(b).  The  specification  of  this  transition  is 
somewhat  qualitative  but  is  based  on  the  extent  to  which  the 
meniscus  has  progressed  in  the  direction  of  the  gap  at  the 
time  when  it  touches  the  wafer. 

At  very  low  fluid  dispense  velocities,  the  inertia  of  the 
dispensed  fluid  is  insufficient  to  overcome  surface  tension 
and  break  the  meniscus.  Thus  it  has  the  chance  to  crawl 
along  the  lens.  The  transition  into  this  behavior  is  therefore 
governed  by  a  balance  between  fluid  inertia  and  surface  ten¬ 
sion  which  constitutes  the  Weber  number  (We): 


a 

where  p  and  cr  are  the  density  and  surface  tension  of  the 
fluid,  D  is  the  length  scale  associated  with  the  dispense  port, 
and  v  is  the  dispense  velocity.  Note  that  the  Weber  number  as 
defined  in  this  way  is  independent  of  gap  height  which  is 
consistent  with  the  results  shown  in  Fig.  1(b),  in  that  the  gap 


JVST  B  -  Microelectronics  and  Nanometer  Structures 


3456 


Abdo  et  ai:  Optimizing  the  fluid  dispensing  process 


3456 


2.8 

2.4 

? 

I  2.0 

f  1.6 

1 

$  1.2 

i 

a  08 

0.4 


v\ — 

X  \ 

\  lm 

Jinginj 

1  let 

=  750 

<5% 

•  1  ■  . 

ptimun 

pensin 

x\ 

1  X 

> 

„**tm 

>9 

•- 

i 

k: 

Meniscus  crawis  ^ 
along  the  lens 

— i _ i _ . _ l _ . _ 1 _ . _ 

''We,-- 

=i 

L__ 

0.0' 

0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1.0 


Gap  Height  (mm) 

Fig.  2.  Delineation  of  the  different  physical  behaviors  of  the  fluid  dispense 
process  based  on  critical  Weber  and  Reynolds  numbers. 


height  has  only  a  very  small  effect  on  the  fluid  dispense 
velocity  at  the  onset  of  the  meniscus  crawl  behavior. 

At  very  high  fluid  dispense  velocities  the  fluid  inertia  is 
sufficient  to  create  a  thin,  high  velocity  fluid  film  on  the 
wafer  surface;  if  the  fluid  film  is  sufficiently  thin,  the  gap 
does  not  entirely  fill  with  liquid,  as  shown  in  Fig.  1(d).  This 
behavior  is  also  undesirable.  At  each  gap  height,  a  series  of 
simulations  at  progressively  higher  dispense  velocities  was 
used  to  delineate  the  transition  from  a  completely  filled  gap 
to  this  impinging  jet  behavior,  also  shown  in  Fig.  1(b).  The 
specification  of  this  transition  is  more  clear  than  the  menis¬ 
cus  crawl  transition  as  it  is  based  entirely  on  whether  an 
unfilled  region  persists  adjacent  to  the  dispense  port. 

The  thickness  and  velocity  of  the  fluid  layer  that  is  formed 
on  the  wafer  can  be  approximately  determined  by  balancing 
fluid  momentum  and  shear  in  the  thin  film  while  enforcing 
that  the  mass  carried  by  the  film  is  consistent  with  the  dis¬ 
pensed  mass  flow  rate.  If  the  fluid  momentum  is  large 
enough  then  the  thickness  of  the  liquid  film  is  less  than  the 
gap  height  and  impinging  jet  behavior  occurs.  The  transition 
into  this  behavior  is  therefore  governed  by  a  balance  between 
fluid  inertia  and  viscous  shear  which  constitutes  a  Reynolds 
number  (Re): 


(2) 


where  h  is  the  gap  height  and  /x  is  the  fluid  viscosity.  Notice 
that  this  Reynolds  number  is  proportional  to  the  gap  height, 
which  is  consistent  with  the  results  shown  in  Fig.  1(b)  in  that 
the  transition  velocity  decreases  as  the  gap  height  increases. 

These  two  extreme  cases  bracket  a  range  of  acceptable 
dispensing  velocities.  An  acceptable  dispense  velocity  is 
characterized  as  being  sufficiently  high  so  that  the  meniscus 
immediately  breaks  directly  under  the  dispense  port  but 
small  enough  so  the  entire  gap  is  filled,  resulting  in  the  rapid 
and  controlled  fill  process  shown  in  Fig.  1(e).  Figure  1(b) 
illustrates  that  the  range  of  acceptable  dispensing  velocities 
narrows  as  the  gap  height  increases. 

Figure  2  overlays  the  CFD  results  onto  lines  associated 


Lens 


Stationary  Wafer 

(a) 


(b)  Dispense  at  3.0  m/s  and  30° 


(c)  Dispense  at  1.8  m/s  and  60° 


(d) 


Fig.  3.  (a)  Schematic  illustrating  the  different  dispensing  angles  investi¬ 
gated,  (b)  the  CFD  simulation  for  a  30°  dispense  angle  with  a  3.0  m/s 
dispense  velocity,  (c)  the  CFD  simulation  for  a  60°  dispense  angle  with  a 
1.8  m/s  dispense  velocity,  and  (d)  the  effect  of  changing  the  dispensing 
angles  and  velocities  on  the  optimum  dispensing  zone:  the  0.3  mm  gap 
(shown  by  the  dotted  line)  was  used  to  illustrate  the  fluid  dispensing  behav¬ 
ior  for  three  dispensing  angles,  the  velocities  indicated  on  the  graph  shows 
the  new  locations  for  the  transition  zones  (i.e.,  0.8  and  0.9  are  the  meniscus 
crawl  critical  velocities  for  30°  and  60°  angles).  The  arrows  show  the  new 
potential  location  of  the  transition  lines  as  the  dispensing  angle  increases. 


with  a  constant  Weber  number  and  Reynolds  number.  Notice 
that  the  transition  from  optimal  dispense  to  impinging  behav¬ 
ior  is  approximately  consistent  with  a  transition  Reynolds 
number  (Ref)  of  750  and  the  transition  from  optimal  dispense 
to  meniscus  crawl  behavior  is  approximately  consistent  with 
a  transition  Weber  number  (We?)  of  1.  There  is  some  discrep¬ 
ancy  between  the  transitions  associated  with  these  critical 
values  of  Weber  and  Reynolds  number  and  the  transitions 
predicted  by  CFD,  particularly  at  larger  gap  heights.  These 
discrepancies  can  be  partly  explained  by  the  fact  that  the 
transition  between  dispense  behaviors  is  actually  a  gradual 
change  rather  than  an  abrupt  one  and  therefore  the  determi¬ 
nation  of  the  critical  dispense  velocities  that  delineate  these 
regions  relies  both  on  judgment  and  the  resolution  of  the 
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Fig.  4.  (a)  Schematic  illustrating  the  details  of  the  model  verification  experi¬ 
ments,  (b)  a  close-up  showing  the  fixture  and  glass  window  replicating  the 
lens,  (c)  a  schematic  of  the  lens  showing  the  recesses  and  the  fluid  dispense 
and  recover  ports,  and  (d)  the  CFD  half-symmetry  model  developed  to  rep¬ 
licate  the  experiment. 


velocities  used  in  the  CFD  simulation.  Also,  the  physics  of 
the  problem  changes  as  the  gap  increases;  for  example,  a 
larger  gap  allows  the  meniscus  that  forms  on  the  lens  to 
expand  laterally  by  a  greater  amount  prior  to  contacting  the 
wafer  and  as  a  result  the  length  scale  associated  with  the 
Weber  number  is  no  longer  directly  related  to  the  dispense 
port  as  given  by  Eq.  (1). 

Additional  CFD  simulations  showed  that  the  dispensing 
behavior  is  affected  by  the  fluid  dispense  angle.  Figure  3(a) 
illustrates  the  three  dispensing  angles  investigated  in  this 
work,  i.e.,  0°,  30°,  and  60°,  with  the  positive  angles  indicat¬ 
ing  that  the  fluid  is  directed  towards  the  lens.  For  a  moderate 
dispense  angle  (e.g.,  30°),  the  same  regions  of  behavior  exist 
but  the  transitions  are  shifted.  The  inertia  of  the  fluid  in  the 
direction  perpendicular  to  the  wafer  is  reduced  when  the 
fluid  is  dispensed  at  an  angle.  Both  the  meniscus  crawl  and 
impinging  jet  transitions  are  proportional  to  this  fluid  inertia 
and  therefore  both  lines  shift  upwards;  a  higher  dispense  ve¬ 
locity  is  required  to  yield  the  same  inertia.  This  effect  is 


Fig.  5.  Photographs  showing  the  progress  of  the  filling  water  front  for  (a) 
0.5  mm  gap,  (b)  1.5  mm  gap,  and  the  corresponding  CFD  simulation  results 
showing  similar  behavior  for  (c)  0.5  mm  gap,  (d)  1.5  mm  gap. 


shown  Fig.  3(d)  which  contains  the  original  points  delineat¬ 
ing  the  dispense  process  regimes  for  the  0°  fluid  dispense 
angle  case  as  well  as  additional  points  for  a  gap  height  of  0.3 
mm  and  fluid  dispense  angles  of  30°  and  60°.  Note  that  for  a 
30°  fluid  dispense  angle,  indicated  by  the  triangles  in  Fig. 
3(d),  both  the  meniscus  crawl  transition  and  impinging  jet 
transition,  shown  in  Fig.  3(b),  are  shifted  to  higher  velocities. 
For  a  60°  dispense  angle,  the  meniscus  crawl  transition  again 
moves  to  a  higher  velocity.  However,  the  impinging  jet  tran¬ 
sition  no  longer  occurs.  Instead  a  new  behavior  is  exhibited 
in  which  the  fluid  remains  attached  to  the  lens  and  tends  to 
pull  air  into  the  gap,  as  shown  in  Fig.  3(c). 

III.  EXPERIMENTAL  VERIFICATION 

In  order  to  verify  some  aspects  of  the  CFD  model,  the 
simulated  dispensing  process  was  compared  with  an  experi¬ 
ment  in  which  a  metered  amount  of  water  was  used  to  fill  a 
controlled  lens-to-wafer  gap.  Figure  4(a)  shows  a  schematic 
of  the  experiment.  The  progress  of  the  filling  process  was 
monitored  using  a  camera  that  was  mounted  over  a  glass 
window  (which  takes  the  place  of  the  lens  element).  Figure 
4(b)  shows  a  cut-away  view  of  the  mounting  fixture  and 
indicates  passages  for  the  dispense  and  recovery  of  water. 
Figure  4(c)  is  a  solid  model  of  the  bottom  of  the  holder 
which  shows  the  recesses  in  the  structure  used  to  manifold 
the  water  to  the  dispense  and  recovery  ports. 

A  three-dimensional  CFD  model  was  developed  with  the 
goal  of  replicating  the  actual  experimental  geometry  as 
closely  as  possible,  as  shown  in  Fig.  4(d).  The  details  of  the 
recessed  dispense  port,  the  recessed  recovery  port,  and  the 
gap  can  be  identified  in  the  CFD  model.  The  experiment  was 
carried  out  at  the  same  flow  rate  but  varying  gap  heights. 
Three  of  these  experiments  were  simulated,  including  0.5 
mm,  1.0  mm,  and  1.5  mm  gap  heights.  Figures  5(a)  and  5(b) 
show  pictures  of  the  fluid  front  that  forms  as  the  immersion 
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Abdo  et  ai:  Optimizing  the  fluid  dispensing  process 
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Fig.  6.  Distance  from  the  fluid  front  to  the  dispense  port  along  the  centerline 
of  the  glass  window  measured  from  experimental  results  and  predicted  by 
the  CFD  simulations  for  (a)  0.5  mm  gap,  (b)  1.0  mm  gap,  and  (c)  1.5  mm 
gap. 
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fluid  fills  a  0.5  mm  and  1.5  mm  gap,  respectively.  Notice  that 
the  fluid  front  formed  for  the  smaller  gap  is  much  flatter 
across  its  advancing  edge  than  for  the  larger  gap.  This  dif¬ 
ference  is  related  to  whether  the  dispense  flow  rate  is  suffi¬ 
cient  to  fill  the  entire  width  of  the  gap  at  the  rate  that  the 
leading  edge  of  the  meniscus  is  drawn  through  the  gap  by 
surface  tension.  The  CFD  simulations  for  the  filling  process 
for  the  same  gaps,  shown  in  Figs.  5(c)  and  5(d),  show  similar 
behavior.  Quantitative  comparisons  between  the  experiments 
and  the  CFD  simulations  are  shown  in  Figs.  6(a),  6(b),  and 
6(c),  where  the  distance  between  the  water  front  to  the  dis¬ 
pense  port  measured  along  the  center  of  the  circular  glass 
window  is  shown  as  a  function  of  time  relative  to  the  initia¬ 
tion  of  the  fill  process  for  the  0.5  mm,  1.0  mm,  and  1.5  mm 
gaps,  respectively.  The  results  from  the  CFD  simulations  and 
the  experiments  are  in  very  good  agreement. 

IV.  SUMMARY  AND  CONCLUSIONS 

Two-dimensional  CFD  models  were  used  to  investigate 
the  effects  of  fluid  dispense  velocity,  gap  height,  and  fluid 
dispense  angle  on  the  process  of  filling  the  lens-wafer  gap 
prior  to  immersion  lithography.  The  simulations  revealed  that 
an  optimal  region  exists  in  the  space  of  fluid  dispense  veloc¬ 
ity  and  gap.  The  position  of  this  region  shifts  as  the  fluid 
dispense  angle  is  changed. 

A  three-dimensional  CFD  model  was  developed  in  order 
to  replicate  an  experiment  in  which  a  controlled  fill  is  moni¬ 
tored  through  a  glass  window.  The  simulation  results  agreed 
well  with  the  experimental  observations  both  quantitatively 
in  terms  of  the  progression  of  the  fluid  front  with  time  and 
qualitatively  in  terms  of  the  shape  of  the  fluid  front. 
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ABSTRACT 

Phase-modulation  scatterometry  is  a  metrology  technique  for  determining  the  parameters  of  gratings  using  as  a  key  device  a 
phase  modulator.  For  measurement  purposes  the  phase  modulator  requires  a  complicated  calibration  procedure  that  is 
analyzed  here  in  detail.  The  main  source  of  error  to  be  dealt  with  are  the  fluctuations  of  the  phase  modulation  amplitude.  The 
measurables  are  the  direct  term  and  the  first  two  harmonics  of  the  output.  For  the  fitting  of  the  experimental  data  we  used  the 
ratio  of  the  harmonics  to  the  direct  term  because  it  improves  significantly  the  accuracy.  A  sensitivity  analysis  was  performed 
for  two  samples,  one  real  and  one  theoretical,  to  find  the  measurement  configuration  that  insures  optimum  determination 
precision  for  the  grating  parameters.  For  the  real  sample,  comparison  of  the  theoretical  predictions  for  sensitivity  with  the 
actual  values  showed  a  good  agreement.  For  both  samples  the  sensitivity  analysis  indicated  sub-nanometric  precision  for  the 
critical  dimension  (grating  line  width). 

Keywords:  scatterometry,  phase  modulation,  sensitivity 

1.  INTRODUCTION 

In  the  ongoing  effort  to  develop  scatterometry  as  a  sensitive  metrology  technique  ^  we  continued  trying  to  improve  it  by 
implementing  new  measurement  techniques.  In  this  article  the  implementation  of  a  phase-modulation  technique  (PMT)  in 
scatterometry  will  be  discussed.  The  phase  modulation  technique  (PMT)  was  first  applied  for  optical  measurement  by 
Jasperson  et  al.  and  since  then  was  continuously  developed  ^T0  The  technique  was  previously  applied  to  scatterometry 
10,  but  an  analysis  of  the  sensitivity  and  simplified  synthetic  formulae  for  the  measurables,  to  our  knowledge,  are  not 
available  in  the  literature  and  they  are  thus  an  original  contribution.  Also  the  error  analysis  of  the  calibration  from  paragraph 

(4)  is  pushed  to  a  deeper  level  than  that  of  Acher  et  al  ^  It  turns  out  that  not  all  of  the  calibration  configurations  proposed  by 
them  can  be  applied,  except  in  a  modified  form.  A  sensitivity  analysis  formally  identical  to  the  one  performed  for  the 

ellipsometric-scatterometer  in  Reference  H, 
namely  the  application  of  the  SAF  formalism  is 
done  for  the  phase-modulation  scatterometer. 


2.  EXPERIMENTAL  ARRANGEMENT 

2.1.  General  view 

The  experimental  arrangement  that  we  call  "phase- 
modulator  scatterometer"  is  still  an  ellipsometric- 
scatterometer.  Compared  to  the  scatterometer 

described  elsewhere  H,  there  is  only  one  new 
device  added  in  the  system,  the  phase-modulator 
shown  in  Fig  1 .  This  one  device,  however, 
radically  modifies  the  nature  of  the  experimental 
arrangement,  as  we  shall  see  in  the  following. 

The  phase-modulator  is  basically  a  silica  bar  glued 
to  a  piezoelectric  quartz  bar  transducer.  The 
transducer  is  subjected  to  an  oscillating  voltage 


Fig.  1  The  experimental  arrangement  of  the  phase- 
modulator  scatterometer. 


Correspondence:  Email:  logofatu@unm.edu;Telephone  505  272  7808;  Fax  505  272  7801 
Permanent  address:  National  Institute  of  Laser,  Plasma  and  Radiation  Physics  (INFLPR),  Bucharest-M'gurele,  PO-Box  MG-36,  76900, 
Romania 


with  a  frequency  of  50  kHz.  Because  of  its  piezoelectric  properties,  the  quartz  bar  then  applies  a  pressure  oscillating  with  the 
same  frequency  to  the  silica  bar.  The  geometry  of  the  homogenous  silica  bar  changes,  becomes  anisotropic,  and  the  bar 
behaves  like  a  uniaxial  crystal  with  dychroic  properties.  The  result  is  that  oscillation  modes  of  the  refractive  index  of  the 
silica  bar  are  excited,  and  oscillating  phase  retardation  is  introduced  in  the  light  that  passes  through  the  bar.  Specifically  the 
vector  electric  field  component  parallel  to  the  longitudinal  axis  of  the  bar  is  retarded.  Because  the  eccentricity  of  the 
polarization  ellipse  oscillates  at  the  analyzer,  the  photodetector  receives  an  oscillating  signal.  A  lock-in  amplifier  measures 
the  50  kHz  harmonics  of  the  signal.  These  harmonics  are  the  measurables  of  the  experimental  arrangement,  and  they  are 
functions  of  the  geometrical  parameters  of  the  sample. 

For  the  phase-modulator  scatterometer  we  need  a  photo-detector  of  large  bandwidth,  because  the  oscillation  frequency  of  the 
modulator  is  50  kHz.  For  the  ellipsometric- scatterometer  H,  where  the  chopper  frequency  in  use  is  about  200  Hz,  there 
was  no  need  for  a  large  bandwidth  photodetector.  It  is  useful  to  note  that  there  is  such  a  large  gap  between  the  frequency  of 
the  chopper  and  that  of  the  modulator  that  both  can  be  used  together  in  the  experimental  arrangement,  because  they  do  not 
practically  interfere. 

2.2.  Alignment 

An  important  experimental  issue  is  to  determine  the  location  on  the  modulator  surface  where  the  amplitude  of  the  standing 
stress  wave  is  maximum,  because  the  spatial  variation  is  minimum  there  and  we  have  uniform  modulation  amplitude.  We  can 
do  this  by  monitoring  the  amplitude  of  the  first  or  second  harmonics  of  the  signal  when  the  light  spot  moves  across  the 
modulator  surface.  The  first  harmonic  ^varies  approximately  as  Ji(A),  where  A  is  the  modulation  amplitude,  while  the 
second  harmonic  S2 «  as  J2(A )  (Bessel  functions).  The  harmonics  of  the  signal  are  defined  in  paragraph  3,  where  the  purport  of 
these  considerations  will  be  better  understood.  For  values  of  A  below  90°,  Ji  and  J2  are  monotonically  increasing  with  A  .  The 
location  where  A  is  maximum  is  obviously  the  place  where  the  harmonics  S®  and  S2(i)  have  a  maximum.  Also,  the  beam  spot 
area  has  to  be  small  to  insure  the  uniformity  of  the  amplitude  modulation. 

For  alignment  purposes  only,  we  will  use  as  sample  a  phase  retarder  instead  of  a  grating.  The  orientation  angle  of  the  retarder 
axis  is  \(/and  the  phase  retardation  is  A  The  coordinate  axes  of  the  system  are  the  axes  of  the  retarder.  In  this  case  the  I 
coefficients  from  Eqs.  (9)  have  the  more  simple  form 

( 1  -  cos  2i|/cos  2A)+  cos  2{P-M  )cos  2M  (cos  2 A  -cos  2i|/)+ 

+  sin  2  A  cos  Acos  2(P  -M  )sin  2\(/sin  2M 

I  s  =  sin  2  (P  -  M  )sin  2  A  sin  2\(/sin  A  ,  (1  .b ) 

/,  =  sin  2{P -M  )[(cos  2v|/-  cos  2A)sin  2M  +  sin  2  Acos  2M  sin  2v|/cos  A] ,  (l.c) 

withP,  A  and  M  the  orientation  angles  of  the  polarizer,  analyzer  and  modulator  respectively.  We  have  the  polarizer  aligned 
parallel  to  the  modulator  when  the  first  and  the  second  I  coefficients  (1  b,  c)  are  zero  no  matter  what  the  position  of  the 
analyzer  is.  This  happens  because  Is  and  Ic  are  proportional  to  sin2(P-M) .  We  also  know  that  the  retardation  axis  is  the 
longitudinal  axis  of  the  silica  bar.  We  then  set  the  polarizer  so  that  P-M=45°.  For  a  fixed  value  of  P-M,  we  vary  M  and  A 

until  we  obtain  the  extinction  of  Is  and  Ic  again.  This  happens  when  M=A=0°,  because  Is  is  proportional  to  sin2A  and  Ic  has  a 

term  proportional  to  sin2A  and  another  proportional  to  sin2M.  In  this  way  we  determine  the  zero  position  for  the  modulator 
and  the  analyzer. 


3.  MEASURABLES 

From  Fig.  1  we  see  that  the  transformation  suffered  by  the  beam  in  the  phase-modulator  scatterometer  is  described  by 

Eout  =  Af  M  P  Ein  ,  (2) 

where  the  Jones  matrices  for  analyzer,  sample  and  modulator  are 
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respectively,  with 

5(f)  =  A  sin  cm  ,  (4) 

which  is  the  variable  phase  retardation  introduced  by  the  modulator  in  which  A  is  the  amplitude  of  the  modulator.  The  matrix 

of  the  polarizer  is  simply  the  analyzer  matrix  with  A  substituted  by  P.  Eout  and  E!n  are  the  output  and  input  vector  electric 
fields,  respectively.  After  making  the  calculations  in  (2)  we  obtain 

r  =  Eout / EP  =  cos  ( P-M  \rP  cos  M  +  rs  sin  M )+  sin  (P  -  M )(-  rp  sin  M  +  rs  cos  M  )exp  (-  j8(t )) ,  (5) 

where  we  used  the  notations 

(6.a) 
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The  notations  (6)  were  chosen  for  convenience  reasons,  but  they  have  a  certain  physical  meaning  as  well.  Namely,  rp  is  the 
contribution  of  the  p  component  of  the  beam  incident  on  the  sample  to  the  output,  and  rs  is  the  similar  contribution  of  the  s 
component.  Squaring  the  absolute  value  of  r  in  Eq.  (5)  we  obtain  in  terms  of  power 


l(t)=I0+Is  sin  5 (f )+  /,  cos  5(f), 


where 


/()  =  r  2  +  r  2  +  cos  2 (P  -M) 


Is  =2  sin  2{P-M  )lm(rpip), 
I  =sin2  (P-M) 
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The  calculations  through  which  we  obtained  the  functions  Iq,  Is  and  Ic  are  very  laborious  and  even  more  laborious  is  to  obtain 
a  simple,  synthetic  expressions  of  them  as  in  Eqs.  (9).  However,  there  is  a  simple  way  to  verify  the  accuracy  of  Eqs.  (9).  With 
the  help  of  a  programming  environment  for  symbolic  calculation  as  powerful  as  Mathematica,  we  can  calculate  the  expanded 
form  of  the  harmonics,  which  is  in  the  range  of  pages  of  formulae.  Then  we  program  the  computer  to  simplify  the  difference 
between  the  expanded  forms  of  the  harmonics  and  their  synthetic  expression  (9).  As  expected,  the  result  of  the  simplification 
is  zero.  One  may  notice  that  I(t)  from  Eq.  (8)  is  not  actually  the  square  of  r,  but  its  double.  Since  the  power  is  half  of  the 
square  of  the  absolute  value,  it  follows  that  we  multiplied  (8)  with  4.  The  reason  for  this  choice  is  that  it  normalizes  the 
harmonics  (9),  their  maximum  absolute  value  becomes  1.  The  inspection  of  Eqs.  (9)  shows  what  he  have  advertised  in  the 
introduction  of  this  chapter,  the  fact  that  the  measurables  depend  on  the  signs  of  the  phases.  This  dependence  is  insured  by 

the  fact  that  the  formulae  (9)  contain  both  a  term  Re  (r  r*)  and  a  term  Im(rpr5*),  the  cosine  and  the  sine  of  the  phase  of  the 
complex  term  r  r* . 


Since  in  Eq.  (8)  we  have  to  deal  with  trigonometric  functions  with  the  retardation  8  from  (4)  as  argument,  it  is  useful  to  write 
the  expansion  of  sine  and  cosine  of  Sin  Bessel  functions 

sin  5(f)  =  2/i(A  )sin cot  +  2 (>4  )sin  3aJ  +... higher  harmonics  ,  (lO.a) 

cos  5(f)  =  J0  (A  )+  2J2  (A  )sin  2cot+ . . .  higher  harmonics  .  (lO.b) 


It  follows  that  the  harmonics  of  the  output  signal  are 
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The  S  coefficients  of  Eq.  (11)  are  what  can  be  measured  with  the  phase-modulator  scatterometer  in  the  ideal  case.  In  practice 
we  set  the  modulation  amplitude  A  so  that  we  have 


A  =  A  0  =137.8°,  (12.a) 

which  nullifies  the  Bessel  function  of  zeroth  order, 

•/0(M  =  °-  (12-b) 

For  this  value  of  the  modulation  amplitude,  the  measurables  S  are  proportional  to  the  I  quantities.  In  practice,  however,  we 
prefer  to  work  with  the  quantities 
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There  are  some  obvious  advantages  of  working  with  the  quantities  (13)  besides  the  fact  that  they  are  normalized.  The 
numerator  and  the  denominator  of  (13)  are  measured  simultaneously  and  therefore  their  ratio  is  free  from  systematic  errors 
caused  by  the  spatial  displacement  of  the  beam  spot  on  the  photodetector  area  and  by  fluctuations  in  time  of  the  laser  power 
and  other  variables. 


Note  also  that,  with  the  phase-modulation  scatterometer,  we  extract  more  information  than  in  the  case  of  the  ellipsometric  - 
scatterometer  because  we  measure  the  two  quantities  (13. a,  b)  simultaneously.  Therefore,  the  information  provided  by  the 
phase-modulation  scatterometer  is  richer  than  that  provided  by  the  ellipsometric- scatterometer  under  two  aspects: 
qualitatively,  because  the  measurements  depend  on  the  signs  of  the  phases,  and  quantitatively,  because  we  take  double  the 
number  of  measurements  in  a  0-scan. 


4.  CALIBRATION 

Unlike  the  ellipsometric- scatterometer,  the  calibration  of  the  phase-modulation  scatterometer  can  be  quite  a  challenge.  New 
problems  arise,  for  instance,  because  the  modulation  amplitude  A  is  not  constant  in  time.  This  is  a  significantly  important 
new  source  of  errors  that  we  must  minimize.  For  this  reason  the  calibration  of  the  modulation  amplitude  performed  as 
indicated  in  Eqs.  (12)  must  be  done  repeatedly  during  the  experiment.  There  is  a  better  way  to  maintain  constant  the 

modulation  amplitude,  by  monitoring  the  ratio  of  the  third  and  first  harmonic  9,  but  our  present  experimental  arrangement 
does  not  yet  support  this  technique. 

Another  problem  is  that  in  practice  we  must  deal  with  slightly  different  equations  than  those  corresponding  to  the  ideal  case 
shown  in  Eq.  (11).  Namely  new  non-diagonal  elements  of  the  matrix  on  the  right  hand  side  of  Eq.  (11)  become  non-zero,  and 
they  are  too  large  to  be  neglected  (around  few  percents).  The  reason  for  this  non-ideal  behavior  is  still  in  dispute.  According 
to  Jasperson  et  al.  the  reason  is  that  the  modulator  has  a  residual  constant  birefringence  ^ .  Acher  et  al.,  on  the  other  hand, 


hypothesize  that  higher  vibration  modes  (i.e.  2co,Jcoetc.  modes)  are  excited  by  the  transducer  in  the  silica  bar  resulting  in  a 
modulation  of  the  form  ^ 

S(t  )  =  S0  +  A :  sin  (cof)+/\2  sin  (2cof )+  A  3  sin(3cof)  +  ... ,  (14) 


where  the  amplitudes  A  j  A2  A3,  etc.  may  be  complex.  A  very  important  point  made  by  Acher  et  al.  is  that  no  matter  how 
many  higher  modes  from  (14)  perturb  the  modulation  and  hence  the  coefficients  of  the  transmission  matrix  in  (11),  the 
relation  between  the  S  and /coefficients  is  linear.  They  show  then  thatEq.  (11)  should  be  rewritten  as 
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The  terms  Tj  and  T2  are  called  transmission  coefficients  and  they  are  approximately  1.  The  coefficients  Cc>o ,  CSyo ,  Cc>(0  and 
Cs,2 caare  called  coupling  coefficients,  and  their  absolute  values  are  small  compared  to  1.  The  ensemble  of  the  coefficients 
introduced  in  Eq.  (15)  will  be  called  adjustment  coefficients.  Now  we  have  to  set  the  amplitude  A  to  a  different  value  than 
137.8°,  because  of  the  perturbation  introduced  by  Ccp.  After  setting/!  so  that  Jd(A  )+Cc>o  is  as  close  as  possible  to  0  we  have 
only  to  determine  Tj,T2,  Cs0 ,  Cc>G)  and  Cs>2(iy. 


Table  1  Calibration  configurations  for  the  phase  modulator. 


Configuration  label 

P-M  (°) 

M(°) 

phase  retarder 

A  (°) 

a) 

+45 

0(90) 

no 

+45 

b) 

+45 

+45 

yes 

+45 

c) 

+45 

0(90) 

yes 

+45 

All  these  operations  can  be  made  with  the  help  of  the  measurement  configurations  described  in  Table  1.  Instead  of  a  sample, 
a  quarter-wave  phase  retarder  or  nothing  are  alternately  inserted  in  the  experimental  arrangement.  The  I  coefficients  (9)  take 
the  form  (1).  The  angles  M  and  A  are  measured  with  respect  to  the  axes  of  the  retarder.  For  configuration  a)  the  absolute 
values  of  the  angles  do  not  matter,  but  only  their  difference,  and  we  do  not  need  to  set  reference  axes.  The  void  that  is  in 
place  of  the  sample  in  configuration  a)  can  be  modeled  as  a  retarder  with  arbitrary  orientation  \\f  and  with  the  retardation 
A^0°.  The  ratios  (13)  in  the  non-ideal  case  become 
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where  we  noted 

=Jo(A  )+  Cc  0. 


(17.b) 

(18) 


In  Eqs.  (17)  we  kept  only  the  first  order  of  the  expansion,  taking  into  account  that  the  transmission  coefficients  are  close  to  1 
and  the  coupling  coefficients  are  close  to  0. 


We  will  particularize  Eqs.  (9)  and  (17)  for  each  configuration.  Since  the  coupling  coefficients  are  very  small,  we  have  to  take 
into  account  the  first  order  expansion  of  the  uncertainties  of  the  angles  P,  A,  M,  \(/and  A  because  they  are  of  the  same  order 
as  the  coupling  coefficients,  and  they  can  infringe  in  the  validity  of  the  calibration. 

Configuration  a) 

Eqs.  (9)  become 


4=r 


I  c  =  sin  2{P  -  M  )sin  2(A  -  M  )=  ±1 . 


Then  the  quantities  (17)  become 


K  ^c,C0^c  9 
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(19.a) 

(19.b) 

(19.c) 

(20.a) 

(20.b) 


Changing  P  or  A  with  ±90°  changes  the  sign  of  Ic.  The  modulation  amplitude  is  adjusted  so  that  the  modulus  of  R2(0  remains 
unchanged  for  ±90°  rotation  of  the  polarizer  or  the  analyzer.  Fig.  2  illustrates  how  R2 «  varies  in  two  configurations  of  type  a) 
when  the  modulation  amplitude  is  varied  and  how  the  curves  intersect  at  A  0. 

One  may  notice  that  the  configuration  in  which  the  polarizer  and  the  analyzer  are  parallel  is  more  sensitive  to  variations  of 
the  modulation  amplitude.  It  follows  that  for  routine  recalibrations  of  A  ,  after  T2  has  been  determined,  it  is  better  to  use  this 
configuration  rather  than  the  one  in  which  the  polarizer  and  the  analyzer  are  perpendicular.  All  we  have  to  do  then  for  a 

routine  recalibration  is  to  adjust  A  so  that  R2(0 
corresponding  to  the  configuration  P-45,  A-45  reaches 
the  closest  value  to  R2(0  corresponding  to  the 
configuration  P-45,  A--45,  which  we  may  consider 
constant  because  its  insensitiveness  to  changes  of  A  . 

After  we  nullify  J0  we  can  easily  measure  T2  and  Cc>ft) 
using  Eqs.  (20).  Taking  into  account  the  fact  that  a 
rotation  of  P  or  A  with  ±90°  changes  the  sign  of  7C,  we 
obtain 
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Fig.  2  Illustration  of  setting  the  modulation  amplitude 
so  that  J0=0. 

Configuration  c) 

We  have  (9)  yielding 


Cr 


Here  the  superscripts  7,  2  refer  to  two  cases  of 
configuration  a),  one  with  the  polarizer  or  the  analyzer 
rotated  with  ±90°  in  rapport  with  the  other. 


'o=l. 

7  =  sin  2 (P  -  AT )sin  2A  =  ±1 , 

Ic  =  sin  2 (P  -  M) sin  2 A  cos  2 Mea 


;  0 . 


(22.a) 

(22.b) 

(22.c) 


Here  ^  is  the  uncertainty  of  the  retardation  A  Thus  (17)  becomes 


(23.b) 


^2(0  “  T2Ic  +  Cs,2a/s  • 

Changing  P  or  A  by  ±90°  changes  the  signs  of  both  Is  and  Ic.  We  want  to  change  the  sign  of  Is  in  order  to  separate  the  term 
with  from  the  term  with  C^in  Eq.  (23  a).  We  also  want  to  separate  the  term  with  T2  from  the  term  with  Cs>2(0  from  (23 .b). 
Hence  we  need  a  transformation  that  changes  the  sign  of  Is  but  not  that  of  Ic.  This  transformation  is  the  rotation  of  the  phase 
retarder  by  ±90°,  which  means  the  coordinate  axes  are  rotated  with  the  same  amount.  This  is  equivalent  to  simultaneously 
rotating  P,  A  and  M  by  +90°  .  Therefore  we  have 
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(24.a) 

CJ,0  =  k  +  ^)/2, 

(24.b) 

C^=sgnk)(<-/?22J/2, 

(24. c) 

where  the  superscripts  correspond  to  two  cases  of  configuration  c),  one  having  the  phase  retarder  rotated  with  ±90°  in  rapport 
with  the  other. 

Configuration  b) 

Although  Acher  et  al.  9  recommend  configuration  b)  for  calibration  purposes, 
provide  an  accurate  determination.  Indeed,  for  configuration  b)  (9)  gives 

we  rejected  it  because  it  cannot 

(25. a) 

/  =  sin  2 (p  -  M  )sin  2A , 

(25.b) 

Ic  =  sin  l(P  -  M  )sin  2 M  -  eA ) , 

(25  .c) 

where  €y  and  ^  are  the  uncertainties  in  the  orientation  angle  of  the  phase  retarder  and  the  analyzer  respectively.  Eqs.  (17) 
thus  become  Eqs.  (23),  the  same  as  for  configuration  c).  As  in  the  case  of  configuration  c),  we  need  a  transformation  that 
changes  the  sign  of  Is  but  not  that  of  Ic.  We  cannot  move  the  analyzer  or  the  phase  retarder,  because  the  uncertainties  €y  and 
would  change  uncontrollably,  prohibiting  the  use  of  Eq.  (25. c).  Rotating  the  polarizer  does  not  perform  the  transformation 
we  need.  Only  rotating  the  modulator  provides  the  desired  transformation.  In  our  experimental  system  the  modulator  cannot 
be  precisely  rotated.  Therefore  we  did  not  use  configuration  b).  The  transformation  used  by  Acher  et  al.  was  the  rotation  of 
the  analyzer  or  the  polarizer,  not  of  the  modulator,  which  we  showed  above  that  does  not  provide  an  accurate  calibration. 


Table  2  The  adjustment  coefficients  for  the  modulator. 


Adjustment  coefficients 

Ti 

t2 

Cs,o(%) 

Cc,«,(%) 

<W%) 

Value 

0.9962 

0.9934 

-0.03 

-1.79 

1.69 

Precision 

0.0008 

0.0007 

0.04 

0.25 

0.32 

After  performing  several  calibrations  of  our  modulator  we  obtained  the  average  values  for  the  adjustment  coefficients 
displayed  in  Table  2.  One  comment  that  can  be  made  about  this  table  is  that  it  is  consistent  with  the  hypothesis  of  Jasperson 

et  al.  ^  about  the  source  of  the  adjustment  coefficients,  rather  than  the  hypothesis  of  Acher  et  al.  9  Namely,  Table  2  shows 
evidence  favoring  the  hypothesis  that  the  source  of  errors  is  a  constant  residual  birefringence  of  the  silica  bar.  Indeed,  if  Eq. 
(14) reduces  to 

5(t )  =  80  +  A  sin  (cor),  (26) 

then  (15)  becomes 
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This  means  we  have 


CJi0  =  0,  (28. a) 

C,20)=-Cc,01=5(),  (28.b) 

which  is  supported  by  the  data  shown  in  Table  2.  We  can  even  calculate  §>  from  Table  2  and  obtain  that  it  is  equal  to  1.00°. 
However,  other  characteristics  of  our  system  fit  neither  the  analysis  of  Jasperson  nor  of  Acher.  For  instance  our  is  not  a 
residual  birefringence;  it  disappears  when  the  modulator  is  not  excited.  When  the  modulator  is  unexcited,  it  behaves  like 
simple,  neutral  glass.  Moreover,  is  not  even  constant,  it  varies  in  time.  This  observations  lead  us  to  the  conclusion  that  §>  is 
caused  by  imperfections  of  the  electronic  excitation  system,  in  particular  of  the  piezoelectric  quartz. 


Fig.  3.  Empirical  error  function  for  the  phase- 
modulator  scatterometer:  a)  b:  =  106  +  3.2  x  104 
l0,  b)  ls:  a  =  106  +  2.0  x  104  ls  c)  lc:  a  =  106  +  1 .4  x 
104  lc.  On  the  right  hand  of  each  figure  are  shown 
the  coefficients  for  the  best  fit  of  the  errors.  The 
measurements  were  performed  using  the  "XL6" 
sample  at  (|)=0oand  A^632.8  nm. 
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Fig.  4.  Example  of  fitting  data  from  the  phase- 
modulator  scatterometer  for  sample  "XL6"  at 
(|>=90oand  A^632.8  nm.  The  results  of  the  fitting 
were  lw=1493.1±1 .2  A,  hg=5568.4±3.3  A, 
h|=1 604. 3±1 .9  A. 

It  must  be  noted  that  the  calibration  procedures  are  very 
sensitive  to  the  quality  of  the  polarizers.  Polymeric 
polarizers  transform  the  incident  beam  into  an 
elliptically  polarized  beam  with  large  eccentricity  rather 
than  into  a  perfect  linearly  polarized  beam.  For  this 
reason,  to  perform  the  calibration  one  needs  better 
polarizers,  such  as  Glan-Thompson  prism  polarizers, 
with  a  very  high  extinction  ratio. 


M(°) 


5.  SYMMETRY  PROPERTIES  AND  SENSITIVITY  CONSIDERATIONS 

The  application  of  the  SAF  formalism  can  be  a  very  time-consuming  procedure.  In  order  to  avoid  burdening  the  computer 
with  unnecessary  sensitivity  calculations,  we  will  try  to  reduce  the  domain  of  the  degrees  of  freedom  of  the  experimental 

arrangement  ^  Unlike  the  ellipsometric-scatterometer,  for  the  phase-modulator  scatterometer  we  can  make  some  a 
priori  sensitivity  considerations,  independent  of  the  sample,  which  will  further  reduce  the  domain  of  experimental  parameters 
that  we  have  to  scan  in  the  search  of  the  optimum  sensitivity. 

At  first  sight  it  seems  we  have  four  degrees  of  freedom  for  the  phase-modulator  scatterometer:  P,  A,  M  and  (f*  The 
measurables  of  the  system,  the  ratios  R /?2co  from  (13),  are  approximately  equal  to  the  ratios  Is/I0  and  Ic/I0-  A  simple 
inspection  of  the  I  coefficients  from  Eqs.  (9)  shows  that  these  ratios  are  maximum  when  P-M=A45°.  For  improving  the 
signal  to  noise  ratio  factor  we  need  to  set  the  position  of  the  polarizer  relative  to  the  modulator  to  either  +45°  or  -45°  (i.e.  P- 
M=A45°). 

Changing  M  by  ±90°  changes  the  sign  of  Is  and  Ic  but  not  their  amplitude,  while  Io  remains  unchanged.  Thus  the  quantities 
R K,  and  /?2co  changes  only  their  the  sign,  which  is  inconsequential  for  the  sensitivity.  Therefore  we  can  restrict  the  sensitivity 
scan  to  a  domain  of  Mef 0°. . .  90°) . 


6.  THE  EMPIRICAL  ERROR  FUNCTION 

The  SAF  formalism  requires  an  extrapolation  of  the  measurement  errors  obtained  is  some  particular  measurement 
configurations  to  an  empirical  error  function  valid  for  all  the  measurement  configurations.  The  empirical  error  function  for 

the  phase-modulator  scatterometer  was  evaluated  in  the  same  way  it  was  evaluated  for  the  ellipsometric-scatterometer  ^  in 
Fig.  3  it  is  illustrated  how  we  fit  the  averaged  errors  for  a  set  of  one  hundred  measurements  with  the  empirical  function.  The 
function  that  best  approximates  the  measurement  errors  is 

cr2=l(T6  +  5.0xl0^/2.  (29) 
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Fig.  5.  Sensitivity  map  from  the  SAF  scan  for  the 
phase-modulator  scatterometer,  sample  XL6,  at 
X=632.8  nm.  Flere  the  color  code  is  as  follow: 
deeper  the  hue,  smaller  a. 
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Fig.  6.  Sensitivity  map  from  the  SAF  scan  for  the 
phase-modulator  scatterometer,  sample  P200,  at 
X=632.8  nm.  The  color  code  is  as  in  Fig.  5. 


7.  SAF  PREDICTIONS  AND  COMPARISON  WITH  EXPERIMENT 


The  SAF  formalism  H  was  applied  to  the  samples  "XL6"  and  "P200"  presented  in  Table  3,  for  the  measurables  from  (13).  In 
Fig.  4  an  example  is  shown  of  fitting  data  generated  by  the  phase-modulation  scatterometer  to  theoretical  curves  calculated 
with  RCWA.  In  Fig.  5  the  sensitivity  map  is  illustrated  for  sample  "XL6",  using  the  configurations  with  P^=45°,  among 
which  there  is  the  optimum  configuration.  For  the  time  being  we  are  restricted  by  the  fact  that  we  cannot  precisely  change  the 
orientation  angle  of  the  modulator  M.  Additionally,  because  of  the  property  shown  in  Fig.  2,  we  have  to  maintain  the  analyzer 
orientation  parallel  to  that  of  the  polarizer,  P=A=45°.  For  this  reason  we  cannot  yet  provide  experimental  data  for  some 
measurement  configurations  to  compare  with  the  theoretical  predictions  (for  instance  configuration  5  in  Table  3).  In  Fig.  6 
the  sensitivity  map  for  sample  "P200"  is  shown  for  configurations  type  P=A=45°.  The  code  color  for  both  Figs.  5  and  6  is 
the  same,  namely  the  deeper  the  hue  of  gray,  the  smaller  a. 

Table  3.  Geometrical  and  optical  parameters  of  the  samples  investigated.  The  subscripts  Mg" 
and  "s"  refer  to  the  grating  and  the  substrate,  respectively.  The  layers  from  the  grating  to  the 
substrate  have  numerical  subscripts  starting  with  1.  Sample  "XL6"  is  real,  and  sample 
"P200"  is  theoretical. 


Sample 

A  (A) 

lw  (A) 

hg  (A) 

hi  (A) 

% 

ni 

ns 

XL6 

3628.8 

1470 

5700 

1600 

1.64 

1.66 

3.882- 0.019  i 

P200 

2000 

700 

4000 

- 

1.64 

- 

3.882- 0.019  i 

Table  4.  Numerical  results  of  the  sensitivity  analysis  of  sample  "XL6"  for  phase-modulator 
type  measurables. 


Label 

Measurement  configuration 

lw 

<j(iw)  (A) 

a(lw)  (A) 

to 

M(°) 

p(°) 

A(°) 

(experiment) 

(theory) 

(experiment) 

1 

0 

90 

45 

45 

1467.2 

4.1 

2.1 

2 

45 

90 

45 

45 

1419.0 

40.2 

29 

3 

90 

90 

45 

45 

1507.4 

35 

2.9 

4 

21 

90 

45 

45 

1464.8 

5.0 

4.4 

5 

6 

37 

45 

-45 

- 

3.0 

- 

Table  5.  Numerical  results  of  the  sensitivity  analysis  of  sample  "P200" 
for  phase-modulator  type  measurables. 


Label 

Measurement  configuration 

o(lw)  (A)  (theory) 

to 

M(°) 

p(°> 

A(°) 

1 

3 

36 

45 

-45 

4.8 

2 

87 

56 

45 

-45 

4.7 

3 

90 

90 

45 

45 

18.8 

4 

24 

90 

45 

-45 

14.1 

In  Table  4  the  numerical  results  of  the  sensitivity  analysis  performed  on  the  sample  "XL6"  are  summarized.  The  first  four 
configurations  were  used  for  computing  the  empirical  error  function  discussed  in  paragraph  6.  We  tested  experimentally  these 
four  configurations.  Configuration  5,  deemed  by  SAF  to  be  optimum  could  not  be  tested.  We  notice  a  qualitative  agreement 
between  theory  and  experiment,  except  for  configuration  3.  The  reason  for  this  discrepancy  is  because  the  measurement 
precision  for  this  configuration  was  inexplicably  better  than  the  measurement  precision  for  the  other  configurations.  The 


cause  of  this  (fortunate)  anomaly  is  yet  to  be  determined.  In  Table  5  the  numerical  results  of  the  sensitivity  analysis  for  the 
theoretical  sample  "P200"  are  shown. 

The  inspection  of  Tables  4  and  5  does  not  reveal  a  marked  improvement  of  the  sensitivity  compared  to  the  ellipsometric- 
scatterometer.  It  is  not  even  clear  if  there  is  any  improvement,  although  the  phase-modulation  scatterometer  provides  more 
information,  both  quantitatively  and  qualitatively,  than  the  ellipsometric-scatterometer.  The  reason  was  ascribed  to  the 
additional  sources  of  errors  introduced  by  a  more  complicated  experimental  arrangement.  Specifically  the  instability  of  the 
phase  modulation  is  an  important  perturbation  factor.  This  assertion  is  proved  by  the  fact  that  the  empirical  error  function 

(29)  is  considerably  higher  than  for  the  ellipsometric  scatterometer  6. 

There  is,  however,  an  important  aspect  in  Table  4  that  makes  our  effort  of  developing  a  phase-modulator  scatterometer 
worthwhile.  Namely,  we  notice  what  we  believe  is  an  increased  accuracy.  Actually  there  is  no  reason  to  doubt  the  absolute 
accuracy  of  these  results  since  the  dispersion  of  the  experimentally  determined  values  for  the  linewidth  are  almost  within  the 
theoretical  uncertainties.  If  we  ignore  the  insensitive  configurations  2  and  3,  we  have  a  both  precise  and  accurate 
determination  of  the  linewidth,  which  is  clearly  bound  to  lie  within  the  interval  1460-1470  A.  The  perceived  accuracy  is  the 
most  important  improvement  shown  by  the  phase-modulation  scatterometry  technique,  compared  to  previous  scatterometry 
techniques.  This  is  attributed  to  the  fact  that  the  measurables  are  real-time  ratios.  The  numerator  and  the  denominator  are 
simultaneously  measured,  in  exactly  the  same  area  of  the  photodetector,  unlike  the  ellipsometric-scatterometer,  thus  allowing 
many  systematic  errors  to  be  eliminated.  Contrary  to  the  ellipsometric-scatterometer,  the  normalization  measurement  Iq  is 
taken  in  the  same  conditions  as  Is  and  Ic.  The  increased  accuracy  that  is  associated  with  the  phase-modulation  scatterometer 
leads  us  to  the  conclusion  that  in  the  systematic  errors  manifest  in  Table  4,  the  external  causes  outweigh  the  internal  ones. 


8.  CONCLUSION 

Scatterometry  has  been  a  valuable  metrology  tool  to  characterize  structures  with  CD’s  as  small  as  100  nm  and  smaller.  To 
compensate  for  the  loss  of  sensitivity  due  to  the  miniaturization  trend  in  microelectronics,  we  have  designed  a  methodology 
for  finding  the  optimum  measurement  configuration.  The  technique  exploits  experimental  parameters  that  have  optimum 

sensitivity  to  changes  of  the  spatial  parameters  of  the  grating.  This  method,  which  we  introduced  previously  is  termed 
"sensitivity  analysis  for  fitting"  (SAF),  and  it  is  a  mathematical  algorithm  that  can  be  applied  for  optimization  purposes 
regardless  of  the  specific  type  of  measurements  that  are  performed. 

In  the  work  discussed  here,  we  applied  SAF  to  a  new  experimental  arrangement  that  has  an  element  added  to  the 
classical  experimental  arrangement  -  a  phase  modulator.  This  element  acts  to  provide  more  information  regarding  grating 
properties.  Additionally,  the  modulator  allows  three  signals  (harmonics  of  the  phase  modulation  signal)  to  be  measured 
simultaneously  with  lock-in  amplifiers,  which  means  that  the  speed  of  acquiring  information  is  increased.  By  normalizing  the 
two  superior  harmonics  to  the  fundamental  harmonic  (DC  term),  we  eliminate  systematic  errors,  such  as  the  spatial  variations 
of  the  sensitivity  of  the  photodetector  and  laser  fluctuations.  However,  phase  modulation  scatterometry  has  some 
disadvantages.  It  is  more  sensitive  to  residual  birefringence  of  the  optical  elements  than  classical  scatterometry.  Because  of 
residual  birefringence  and  higher  modulation  orders  induced  by  the  modulator  piezoelectric  transducer,  correction  terms  need 
to  be  applied  to  the  measurements.  These  correction  terms  have  to  be  calculated  prior  to  any  data  acquisition,  and  there  is  not 
an  easy  and  accurate  way  to  do  this  initial  calibration  automatically.  Additionally,  the  precision  of  the  phase  modulation 
impacts  the  precision  of  sample  parameter  measurements. 

Carefully  weighing  the  pros  and  cons,  we  will  illustrate  that  by  joining  the  phase  modulator  with  a  mathematical  optimization 
algorithm  (SAF)  and  classical  scatterometry,  we  obtain  an  improved  method  that  yields  superior  results.  The  technique  is 
able  to  satisfy  tomorrow's  metrology  needs  of  the  microelectronics  industry. 


REFERENCES 

1.  J.  R.  McNeil,  S.  S.  H.  Naqvi,  S.  M.  Gaspar,  K.  C.  Hickman,  S.  R.  Wilson,  in  Encyclopedia  of  Materials 
Characterization  C.  A.  E.  C.  Richard  Brundle,  Shaun  Wilson,  Ed.  (Manning  Publishing  Co.,  1992). 

2.  S.  S.  H.  Naqvi,  J.  R.  McNeil,  R.  H.  Krukar,  Z.  R.  Hatab,  "Grating  parameter  estimation  using  scatterometry," 
Miniature  and  Micro-Optics  and  Micromechanics,  Proc.  SPIE  1992 ,  pp.  170,  1993. 

3.  M.  R.  Murnane,  C.  J.  Raymond,  S.  S.  H.  Naqvi,  J.  R.  McNeil,  "Subwavelength  photoresist  grating  metrology  using 
scatterometry,"  Proc.  SPIE  2532 ,  pp.  251-261,  1995. 


4.  B.  K.  Minhas,  S.  A.  Coulombe,  S.  S.  H.  Naqvi,  J.  R.  McNeil,  "Ellipsometric-Scatterometry  for  the  metrology  of  sub- 
0.1  (im  linewidth  structures",  Appl.  Opt.  37(22),  pp.  5112-5115,  1998. 

5.  S.  A.  Coulombe,  P.  C.  Logofatu,  B.  K.  Minhas,  S.  S.  H.  Naqvi,  J.  R.  McNeil,  "Ellipsometric-Scatterometry  for  sub- 
0.1  mm  CD  measurements,"  Microlithography ,  P roc.  SPIE  3332,  pp.  282-293,  1998. 

6.  P.  C.  Logofatu,  J.  R.  McNeil,  "Sensitivity  analysis  of  fitting  for  scatterometry,"  Microlithography  XIII,  Proc.  SPIE 
3677,  pp.  177-183,  1999. 

7.  S.  N.  Jasperson,  S.  E.  Schnatterly,  "An  improved  method  for  high  reflectivity  ellipsometry  based  on  a  new 
polarization  modulation  technique",  Rev.  Sci.  Instrum  40(6),  pp.  761-767,  1969. 

8.  T.  C.  Oakberg,  "Modulated  interference  effects:  use  of  photoelastic  modulators  with  lasers",  Optical  Engineering 
34(6),  pp.  1545-1550,  1995. 

9.  O.  Acher,  E.  Bigan,  B.  Drevillon,  "Improvements  of  phase  modulated  ellipsometry",  Rev.  Sci  Instrum.  60(1),  pp.  65- 
77,  1989. 

10.  D.  W.  Mills,  R.  L.  Allen,  "Spectral  ellipsometry  on  patterned  wafers,"  1995. 

11.  P.  C.  Logofatu,  dissertation,  University  of  New  Mexico  (2000). 


Measurement  precision  of  optical  scatterometry 

Petre  C.  Logofatu*  and  John  R.  McNeil 

Center  of  High  Technology  Materials,  University  of  New  Mexico,  Albuquerque  NM  87106 


ABSTRACT 

In  the  work  reported  here,  we  discuss  the  measurement  precision  of  two  scatterometry  techniques,  the  variable  angle  and  the 
variable  wavelength  techniques.  The  issue  of  interest  is  the  measurement  precision  of  the  sample  parameters.  This  is 
determined  by  both  the  sensitivity  of  the  diffraction  measurable  to  changes  in  sample  parameters  and  the  precision  with 
which  the  measurable  can  be  determined.  This  approach  includes  taking  into  account  the  correlation  effect  between  the 
contribution  to  the  measurable  of  the  various  grating  parameters  to  be  determined,  such  as  linewidth  and  height.  The 
comparison  of  the  theoretical  predictions  of  precision  for  angle-resolved  and  wavelength-resolved  scatterometer 
measurements  shows  no  conclusive  hierarchy.  Practical  considerations,  however,  indicate  that  angular-resolved  scatterometry 
is  a  more  advantageous  technique.  For  both  methods,  decreasing  the  wavelength  of  the  light  source  improves  the 
determination  precision  of  the  sample  parameters. 

Keywords:  scatterometry,  ellipsometry,  error  analysis,  spectroscopy,  sensitivity,  measurement  precision 


1.  INTRODUCTION 

Scatterometry  is  a  method  of  inferring  the  shape  of  microstructures  from  how  these  structures  affect  light  in  reflection^. 
The  technique  represents  a  metrology  tool  for  determining  the  geometrical  parameters  of  microelectronics  test  structures  from 
measurements  of  the  light  diffracted  by  these  structures.  For  clarity,  suppose  the  measurable  is  a  quantity  q  (generally  the 

reflectance)  which  is  a  function  of  the  sample  geometrical  parameters  that  we  want  to  determine  pgeo  (e.g.  linewidth,  grating 
height,  underlying  layer  heights,  etc.),  the  sample  optical  parameters  that  we  know  a  priori  popt  (refractive  indices  of  the 

materials  and  grating  pitch),  and  the  parameters  of  the  experimental  arrangement  that  we  control  pexp  (azimuth  angle  of  the 
sample,  incidence  angle  and  wavelength  of  the  beam,  and  input  and  output  polarization);  or 

q  =  q(p^,p°P‘,pexp).  (1) 

We  can  generate  theoretical  or  experimental  curves  of  q  versus  pexp  by  varying  the  experimental  parameter  pexp  .  The  choice 
of  this  variable  experimental  parameter  defines  a  scatterometry  technique.  If  the  choice  is  the  incidence  angle,  we  have  angle- 
resolved  scatterometry^"^.  If  the  choice  is  the  wavelength,  we  have  spectroscopic  scatterometry^. 

It  is  generally  recognized  that  the  most  important  aspect  of  a  metrology  technique  is  the  measurement  precision  of  the 
quantities  to  be  determined.  In  this  paper  the  two  scatterometry  techniques  mentioned  above  will  be  analyzed  and  compared 
primarily  with  this  in  mind. 


2.  MEASUREMENT  PRECISION  OF  THE  SAMPLE  PARAMETERS 

A  rigorous  definition  of  the  measurement  precision  for  scatterometry  is  expressed  by  taking  into  account  the  determination 
procedure  of  the  sample  parameters.  Namely,  they  are  obtained  by  fitting  theoretically  generated  curves  to  experimental 
curves.  The  most  probable  values  for  the  sample  parameters  are  those  parameters  corresponding  to  the  curve  which  has  the 
minimum  weighted  mean  square  difference  with  the  experimental  curve,  or 
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where  ot  is  the  measurement  precision  of  qt.  This  fitting  method,  also  called  least  squares  fit,  yields  the  following 
determination  precision  of  the  parameters  pf0  ^  ^ 

cj{pr)=fi 7  ’  (3) 

where  a  is  the  curvature  matrix^ 


ak,i  ~ 


1  d2X2  =  f  1  Qu  dqt 

2  dpfodPr  h  cj-  dpf°  dpfe° ' 


(4) 


Here  /,  k  and  /  range  from  1  to  M,  the  number  of  parameters  to  be  determined.  M  is  the  dimension  of  the  vector  p  f°  and  of 
the  matrices  a  and  a'.  We  observe  that  for  the  rigorous  estimation  of  (3),  one  needs  both  the  measurement  precision  of  the 
measurables  and  the  sensitivity  of  the  measurables  qt  with  respect  to  changes  of  the  sample  parameters  pgeo ,  both  of  which 
are  expressed  in  (4).  We  also  note  for  later  references  that  the  curvature  matrix  is  symmetric,  i.e.  a12=(X2i- 


2.1.  Measurement  precision  of  the  measurables 

The  measurement  precision  of  the  measurables  is  a  purely  experimental  issue,  independent  of  the  sensitivity  of  the 
measurables  with  respect  to  the  sample  parameters.  There  are  various  factors  that  influence  the  measurement  precision.  For 
the  particular  case  of  the  angle-resolved  scatterometer  in  our  laboratory,  we  isolated  as  possible  sources  of  errors  the  power 
fluctuations  of  the  laser,  mechanical  vibrations  due  to  the  movements  of  parts  of  the  system,  and  finite  resolution  of  the  lock- 
in  amplifier.  Typically  the  measurable  is  the  reflectance  of  the  system,  R.  The  measurement  precision  of  R ,  assuming 

influences  from  all  the  perturbation  factors  enunciated  above,  has  the  form^’  ^ 

v2(R)  =  vl+cRR2+ce[ j|j  .  (5) 

Here  ores  is  the  contribution  of  the  finite  resolution,  cR  is  the  coefficient  of  the  laser  fluctuations  contribution,  which  is  linear 
in  R,  and  ce  is  a  coefficient  that  describes  the  contribution  of  the  mechanical  vibrations  of  the  system  on  the  incidence  plane. 
The  latter  term  in  Eq.  (5)  integrate  the  uncertainties  of  the  incidence  angle  6  in  the  uncertainty  of  the  measurable  R.  The  terms 

in  the  sum  (5)  are  squared  because  we  have  no  reason  to  believe  that  the  various  contributions  are  correlated^.  Using  (5)  as  a 
fitting  function  for  the  measurement  error  of  R ,  one  obtains  the  coefficients  from  the  right  hand  side  of  (5).  We  will  refer  to 
Eq.  (5)  as  ’’the  empirical  error  function”. 


2.2.  Sensitivity 

The  sensitivity  of  the  measurable  with  respect  to  the  parameters  of  the  sample  is  an  intrinsic  characteristic  of  the  sample, 
irrespective  of  the  experimental  conditions.  As  expressed  above  in  (4),  the  situation  in  which  k=l  accounts  for  the  sensitivity 
of  the  measurable  to  changes  in  a  single  sample  parameter.  The  situation  in  which  k^l  accounts  for  the  correlation  effects  of 
changes  in  multiple  sample  parameters.  All  situations  should  be  considered  in  the  determination  of  (3). 

Inspecting  Eqs.  (3)  and  (4)  we  notice  that  it  is  difficult  to  analytically  define  the  sensitivity  unless  all  are  equal  and  can  be 
factored.  For  an  arbitrary  empirical  error  function,  sensitivity  and  measurement  error  cannot  be  uncoupled.  This  situation  has 
been  accommodated  in  the  results  presented  below. 

For  illustration  purposes,  let  us  assume  that  the  empirical  error  function  (5)  is  a  constant,  i.e. 


cf.  =  Gq  =  const 


(6) 


Fig.  1  Partial  derivative  of  the  measurable  Rpp  with  respect  to  the 
linewidth  Iw  for  the  sample  "XL6"  versus  the  incidence  angle  9  and 
azimuth  angle  <j>. 


Then  we  can  rewrite  Eqs.  (3, 4)  as 
<j{pjeo)=(y‘l^yJ  =  (Tq/ sens  ,  (7) 
where  y  is 


From  Eq.  (7)  it  follows  quite 
intuitively  that  sens ,  the  inverse 

of  ,  can  be  chosen  as  the 

quantitative  definition  of  the  intrinsic 
sensitivity  of  the  measurables  with 
respect  to  the  sample  parameter  pj. 
Eq.  (7)  clearly  illustrates  our 
statement  that  a  good  determination 
precision  of  the  sample  parameters  is 
achieved  when  we  have  good 
measurement  precision  of  the 
measurables  (small  <jq)  and  high 
sensitivity  (large  sens). 

More  generally  we  have  two  aspects 
that  need  to  be  taken  into  account  for 
a  correct  evaluation  of  the  sensitivity. 
First,  the  absolute  magnitude  of  the 

partial  derivatives  dqt  / dpf°  must  be 


large,  and  second,  their  correlation 

that  must  be  low.  Fig.  1  shows  the  partial  derivative  of  the  ellipsometric  measurable  Rpp  (the  reflectance  for  p  input 
polarization  and  p  output  polarization)  for  the  sample  labeled  "XL6",  with  parameters  defined  in  Table  1.  Intuitively  one  is 
drawn  to  the  conclusion  that  the  measurements  should  be  made  at  pairs  of  angles  G  and  0  where  the  partial  derivative  is  large 
(i.e.  the  sensitivity  is  high).  In  many  cases  this  is  true,  but  not  necessarily.  The  reason  is  that  generally  more  than  one  sample 
parameter  is  fit.  The  variation  of  the  measurement  curve  caused  by  the  variation  of  a  certain  parameter  p 1  might  be  similar  to 
that  for  the  variation  of  another  parameter  p2.  If  their  correlation  is  strong,  then  two  curves  corresponding  to  two  different  sets 
of  parameters  (p h  p2)  may  be  indistinguishable.  This  can  happen  even  if  we  have  large  partial  derivatives,  as  in  Fig.  1,  for 
large  6  and  small  0.  In  this  latter  case,  if  we  change  the  sign  of  the  variation  of  one  of  the  parameters,  the  curves  might  be 
well  separated  (see  Fig.  2.  for  illustration).  However,  this  does  not  change  the  situation  because  the  worst  case  defines  the 
real  sensitivity. 


Table  1.  Geometrical  and  optical  parameters  of  the  samples  investigated.  The  subscripts  "g" 
and  "s"  refer  to  the  grating  and  the  substrate,  respectively.  The  layers  from  the  grating  to  the 
substrate  have  numerical  subscripts  starting  with  1.  Sample  "XL6"  is  real,  and  sample 
"P200"  is  theoretical. 


Sample 

A  (A) 

lw  (A) 

hg(A) 

hi  (A) 

ng 

ni 

ns 

XL6 

3628.8 

1470 

5700 

1600 

1.64 

1.66 

3.882- 0.019  i 

P200 

2000 

700 

4000 

- 

1.64 

- 

3.882- 0.019  i 

The  above  considerations  can  be  better  understood  if  we  look  at  the  two-dimensional  case,  in  which  we  need  to  fit  two 
parameters.  In  this  situation  Eq.  (3)  becomes 


Correlation  effect 


(9.a) 


G 


*11 


a\\a22 


(9-b) 


We  have  small  determination  errors  if  the 
diagonal  terms  of  the  curvature  matrix  a  are 
large,  and  the  non-diagonal  one  is  small.  The 
diagonal  terms  are  the  sum  of  the  squares  of 
the  partial  derivative  amplitudes,  while  the 
non-diagonal  term  is  their  correlation,  which 
can  be  seen  by  particularizing  Eq.  (4)  as 


N  1 

i=l  o', 


dq, 


\2 


dpf° 


(lO.a) 


0^12  —  0^21 


f  1  dq,  dqt 
he jf  dpfeo  dpf°  ‘ 


(lO.b) 


Fig.  2.  Example  of  the  correlation  effect.  Curve  (b)  is  much 
stronger  correlated  with  (a)  than  (c),  although  the 
corresponding  sample  parameters  values  are  equally 
separated.  The  values  for  the  parameters  displayed  on  the 
graph  are  in  nanometers.  Case  (a)  "XL6". 


The  above  discussion  is  important  because, 
when  studying  the  sensitivity,  one  may  be 
tempted  to  overlook  the  correlation  term 
(lO.b)  and  concentrate  only  on  the  magnitude 

terms  (10. a)^. 


XL6  4>=90°  P=0°  A=0° 


Fig.  3  Determination  of  the  empirical  error  function;  no  intensity 
stabilizer  utilized  in  the  experimental  arrangement. 


3.  COMPARISON  OF 
SPECTROSCOPIC  AND  ANGULAR 
SCATTEROMETRY 

Now  that  we  have  a  method  for  determining 
the  measurement  precision  of  a  metrology 
technique,  we  can  apply  it  to  compare  the 
performances  of  spectroscopic  and  angular 
scatterometry.  A  difficulty  is  posed  by  the 
fact  that  we  do  not  have  an  empirical  error 
function  for  the  spectroscopic  scatterometer; 
we  will  approximate  this  subsequently.  We 
determined  the  empirical  error  function  for 
the  angle-resolved  scatterometer  by 
repeatedly  measuring  the  reflectance,  R ,  over 
a  wide  range  of  values.  The  statistical 
interpretation  of  the  batch  of  data  in  the  light 
of  Eq.  (5)  lead  us  to  the  empirical  error 
function 

cr2(i)=1.0xl0“8+1.3xl0_5i?2.  (9) 

We  did  not  notice  significant  correlation  of 
the  experimental  errors  with  dR/dO  .  Even 
for  the  other  two  terms,  the  correlation  was 
very  weak  (see  Fig.  3).  Eq.  (9)  is  just  the 


Table  2.  Experimental  results  illustrating  the 
improvement  in  the  linewidth  measurement 
precision  provided  by  the  use  of  the  laser 
intensity  stabilizer  in  the  case  of  the  sample 
"XL6".  The  wavelength  was  633  nm. 


Configuration 

cj(A) 

p(°> 

A  (°) 

<t>o 

Without 

stabilizer 

With 

stabilizer 

0 

0 

0 

4.9 

1.9 

0 

0 

45 

- 

3.2 

0 

0 

90 

6.3 

4.0 

90 

90 

0 

19.5 

0.6 

90 

90 

45 

- 

1.9 

90 

90 

90 

7.6 

1.4 

Table  2  illustrates  results  from  applying  the 
sensitivity  analysis  techniques  discussed  above  to 
sample  ”XL6”,  with  147  nm  linewidth.  We  performed 
repeated  (50)  measurements  of  the  sample  in  each  of 
6  measurement  configurations  to  illustrate  the 
influence  of  the  two  empirical  error  functions  on 
measurement  precision.  The  measurement  precision 
improves  significantly  (see  Table  2)  with  the 
improved  intensity  stabilization  and  the  smaller 
empirical  error  function  (10),  up  to  2-3  times 
improvement  depending  upon  the  measurement 
configuration.  The  wavelength  used  was  633  nm.  It 
can  be  seen  that  in  all  but  one  situation,  sub-nm 
precision  was  observed  for  both  empirical  error 
functions  (9)  and  (10). 


closest  approximation  (best  fit)  obtained.  For  this  functional 
form,  the  scatterometer  apparatus  did  not  contain  a  means  to 
stabilize  the  laser  intensity  or  to  normalize  the  measurements  to 
input  intensity.  Using  a  laser  intensity  stabilizer  (THORLABS 
’’Noise  Eater”),  the  empirical  error  function  becomes 

a2(i2)=l.Oxir8+5.OxlO“7i?2+l.OxlO“4(ai?/50)2.  (10) 

Again,  Eq.  (10)  is  just  the  closest  approximation  (best  fit);  see 
Fig.  4.  The  additional  term  in  (10)  is  the  result  of  much  lower 
intensity  variations  that  cause  additional  effects  to  be  influential. 


P  =  90  A  =  90  (|)  =  45 


Fig.  4  Determination  of  the  empirical  error  function,  with  a 
stabilizer  is  inserted  in  the  experimental  arrangement. 


Obtaining  empirical  error  functions  for  a 

spectroscopic  scatterometer  is  less  straightforward.  The  material  optical  properties  must  be  determined  over  the  wavelength 
range  of  interest.  This  may  not  be  an  issue  if  the  technique  is  used  in  concert  with  a  spectroscopic  ellipsometer.  However,  one 
must  be  confident  that  process  changes  that  might  occur  do  not  significantly  influence  material  optical  properties,  especially 

at  UV  and  blue  wavelengths.  The  wavelength  range  reported  in  recent  work  involving  this  technique^  is  230  nm  to  450  nm. 
Additionally,  if  a  detection  arrangement  is  used  that  consists  of  an  array,  all  elements  of  the  array  would  optimally  have  the 
same  empirical  error  function.  Another  distinction  of  the  spectroscopic  scatterometer  is  that  it  utilized  a  low  intensity, 
incoherent  light  sources. 


To  compare  measurement  precision  of  the  two  scatterometer  techniques  discussed  here,  we  need  an  empirical  error  function 
for  the  spectroscopic  scatterometer.  Not  having  one,  we  approximated  it  to  be  the  same  as  that  for  the  angular  scatterometer. 
In  light  of  the  remarks  from  the  previous  paragraph  about  the  measurement  precision  of  the  spectroscopic  scatterometer 
technique,  this  approximation  is  likely  quite  optimistic.  In  particular,  for  the  simulated  results  shown  in  Table  3,  we  used  the 
empirical  error  function  (9).  Table  3  indicates  results  for  each  of  the  four  measurement  configurations  indicated.  However,  as 
opposed  to  Table  2,  which  contains  experimental  results,  Table  3  has  only  calculated  simulations.  The  improvement  provided 
by  shorter  wavelength  is  apparent. 


The  conclusion  of  interest  that  inspection  of  Table  3  provides  is  that  there  is  no  conclusive  improvement  of  measurement 

precision  of  one  technique  over  the  other.  In  their  recent  paper,  Allgair  et  al^  state  that  spectroscopic  scatterometry  is  more 
sensitive  than  variable-angle  scatterometry.  However,  this  conclusion  was  drawn  from  comparing  the  sensitivity  of  variable- 
angle  measurements  utilizing  633  nm  light  to  variable -wave length  measurements  utilizing  light  over  the  range  of  230  nm  to 
450  nm.  Spectroscopic  scatterometry  would  be  indeed  more  sensitive  than  variable-angle  scatterometry  if  the  variable-angle 
technique  utilized  wavelengths  significantly  longer  than  those  utilized  in  variable-wavelength,  such  as  these.  Variable-angle 
scatterometry  shows  similar  sensitivity  and  precision  as  spectroscopic  scatterometry  if  light  sources  of  roughly  comparable 
wavelengths  are  utilized.  Additionally,  for  the  correct  assessment  of  sensitivity  we  need  to  take  into  account  of  correlation 
effects  as  well,  which  the  authors  neglected.  Finally,  as  we  shown,  superior  sensitivity  is  not  sufficient  for  superior  results. 


Table  3.  Simulation  results  for  variable  wavelength  scatterometry  (0=45°  and  wavelength  range  A=230...450 
nm)  and  variable-angle  scatterometry  (angular  range  6  =5°...7 0°).  In  both  cases  the  empirical  error  function  Eq. 
(9)  was  assumed. 


Configuration 

ct(A) 

V  ariable-wavelength 
scatterometer 

A=230...450  nm 

Variable-angle  scatterometer 

A,=325  nm 

A,=442  nm 

A,=633  nm 

Measurable 

Azimuth  <|)(0) 

XL6 

P200 

XL6 

P200 

XL6 

P200 

XL6 

P200 

RPP 

0 

0.85 

0.43 

6.60 

1.51 

2.65 

8.30 

1.8 

5.0 

Rss 

0 

0.35 

0.38 

0.50 

0.28 

0.95 

5.93 

0.6 

1.4 

RPP 

90 

0.42 

3.35 

0.42 

3.38 

4.30 

6.55 

15.0 

1.5 

Rss 

90 

0.06 

1.37 

0.42 

1.56 

0.72 

1.67 

8.4 

8.4 

4.  CONCLUSIONS 

The  results  presented  above  illustrate  the  importance  of  understanding  the  two  contributions  to  measurement  precision  of 
sample  parameters:  sensitivity  of  the  measurable  to  changes  in  sample  parameters,  and  the  precision  that  the  measurable 
quantity  can  be  determined  (i.e.  the  empirical  error  function).  A  correct  approach  of  the  sensitivity  analysis  must  also  include 
the  correlation  effects  that  one  may  be  tempted  to  overlook.  The  empirical  error  function  can  be  easily  determined  for  the 
angle-resolved  scatterometer  configuration.  The  use  of  an  intensity  stabilizer  can  significantly  improve  the  empirical  error 
function  and,  consequently,  the  measurement  precision  of  the  sample  parameters.  Experimental  results  illustrate  how  angle- 
resolved  scatterometry  can  provide  sub-nm  measurement  precision  of  a  nominal  147  nm  linewidth  sample.  Using  this  same 
empirical  error  function  together  with  calculations  of  sensitivity  of  R  to  changes  in  a  70  nm  linewidth  sample,  simulations 
indicate  sub-nm  measurement  precision  is  provided  by  the  variable-angle  scatterometer  using  visible  light.  Parallel 
simulations  for  the  spectroscopic  scatterometry  and  the  angle-resolved  scatterometry  show  comparable  precision  results. 
Practical  considerations  indicate  the  angle-resolved  technique  is  advantageous  to  the  spectroscopic  technique.  As  a  general 
trend  for  both  scatterometry  techniques,  the  use  of  a  shorter  wavelength  improves  the  sensitivity  and  the  measurement 
precision  of  the  sample  parameters. 
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Analysis  of  phase  measurement  error  for  null  generalized 
ellipsometry  using  the  phase  compensator 
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Abstract:  An  error  analysis  of  null  generalized  ellipsometry  for  phase  measurement  using  a 
Babinet  compensator  is  presented.  It  turns  out  that  there  are  errors  only  in  a  second  order 
approximation,  which  means  this  technique  provides  high  sensitivity  for  phase  measurements.  The 
sensitivity  can  be  improved  by  equalizing  the  amplitudes  of  the  x  and  y  components  of  the  field  at  the 
Babinet  compensator  but  this  complicates  considerably  the  experimental  procedure. 

1.  Introduction 

Phase  measurement  techniques,  although  known  for  more  than  a  century,  are 
increasingly  used  today  for  their  sensitivity  and  accuracy  [1-4].  On  the  other  hand  null 
measurements  are  reputed  for  being  very  sensitive  [5].  Therefore  null  measurements  of  the 
phase  using  a  Babinet  compensator  (null  ellipsometry)  are  likely  to  provide  high  sensitivity.  In 
our  paper  ellipsometry  is  used  for  the  determination  of  the  phase  change  in  reflection  for  an 
anisotropic  surface.  This  type  of  surfaces  cannot  be  described  just  by  two  complex  reflection 
coefficients  rp  and  rs.  Polarization  conversion  of  light  requires  four  complex  reflection 
coefficients  for  a  complete  description  of  the  optical  properties  of  the  surface,  rpp,  rsp,  rps  and 
rss,  a  formalism  called  generalized  ellipsometry  [6-8].  We  derive  an  expression  for  the  error  of 
phase  change  measurement  in  these  conditions  and  it  turns  out  that  there  is  an  error  only  on  a 
second  order  approximation,  which  is  an  eloquent  argument  in  favor  of  the  sensitivity  of  this 
measurement  technique. 

The  experimental  arrangement  for  phase  measurement  is  illustrated  in  Fig.  1.  The  beam 
coming  from  left  is  linearly  polarized  by  the  polarizer  P  and  becomes  elliptically  polarized 
after  reflection  on  the  grating.  The  compensator  C  compensates  for  the  phase  shift  between  x 
and  y  component  and  the  beam  is  again  linearly  polarized.  Another  polarizer  A  (analyzer)  is 
set  perpendicular  on  the  incident  beam,  which  vanishes  on  transmission.  The  phase 
measurement  is  a  null  measurement  and  we  nullify  the  light  intensity  by  an  appropriate 
combination  of  phase  shift  introduced  by  the  compensator  and  polarization  angle  of  the 
analyzer. 

The  ultimate  goal  is  the  determination  of  the  optical  properties  of  the  material,  i.e.  the 
reflection  coefficients.  The  measurable  must  be  a  function  of  these  coefficients, 

2.  Error  analysis 


l 


For  calculations  we  will  use  the  Jones  matrix  approach  [9].  The  input  components  of 
the  beam  are 


E f  =  Ein  cosip p, 
E;  =  Ein  sim pP. 


Upon  reflection  the  field  becomes 
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The  phase  shift  due  to  reflection  on  the  sample  is 
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The  compensator  will  introduce  a  phase  shift  Ac  which  in  the  end  is  supposed  to  be  equal  to  A 
but  for  now  we  assume  is  arbitrary.  Then  we  have 
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After  the  analyzer  the  beam  is 
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The  output  intensity  measured  by  the  detector  D  is 
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where 


and 


Iref  is  the  intensity  that  can  be  measured  by  a  detector  right  after  the  reflection  on  the  sample 
and  also  happens  to  be  the  maximum  output  at  the  detector  D  in  Fig.  1  (when  the  analyzer  A  is 


If  =  Iref  cos2  a, 
If  =Iref  sin2  a, 


tan(a)= 


rsp  cosi pP  +  rss  simp  j 


rPP  cosi pp  +  rps  sin  ipP 


(8) 


(9) 
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parallel  to  the  linearly  polarized  output  beam  and  if  there  are  no  pure  transmission  losses  in  the 
compensator  and  the  analyzer).  Obviously,  from  Eq.  (3),  Iref  is  of  the  form 

!ref  =  j^p  cos ipP  +  rss  simpP\2  + 1 rpp  cos ipP  +  rps  simpP\2  )in ,  (10) 

with 

/"'  =  I  Ein  2 .  (11) 

2 

a  is  the  polarization  angle  of  the  beam  when  the  phase  shift  is  compensated  and  also  the  angle 
at  which  there  is  one  of  the  semiaxes  of  the  polarization  ellipse.  Using  Eqs.  (8)  and  (9)  we  can 
rewrite  (7)  as 

jout  _  /re/(p0S2^  cos2  a +  sin2xp Asin2  a +  2sinip A  cos ipA  sinacosacos(Ac  -  A)J.(12) 

To  obtain  an  expression  for  the  phase  measurement  error  we  derive  the  output  Iout  on  the  phase 
A: 

pi  TOUt  1 

- =  — sin(2xpA  )sin(2a)sin(A  -  Ac  )l'ef .  (13) 

OA  2 

At  null  conditions  one  has  =a±Jt/2  and  A~Ac.  Then  it  follows 

8Iout  =  ^-|sin2(2«)/re/  |<5A2 ,  (14) 

or 


where  <5/(o)  is  the  measurement  error  for  output  intensity  when  this  intensity  is  zero. 
Generally  this  is  the  minimum  value  of  the  error. 

3.  Comments 

It  is  important  to  note  that  the  error  appears  only  in  the  second  order  approximation. 
There  is  no  measurement  error  for  the  phase  in  a  first  order  approximation.  This  is  a  direct 
consequence  of  the  fact  that  a  null  measurement  type  technique  is  used.  It  must  be  also 
mentioned  that  these  considerations  apply  to  precision  not  to  accuracy.  They  are  valid  only  if 
random,  unbiased  errors  are  assumed. 

This  relation  makes  sense,  and  we  can  derive  it  just  from  intuitive  considerations. 
Since  the  phase  measurement  is  a  null  measurement,  the  larger  is  the  input  to  nullify,  the  more 
precise  is  the  measurement  of  the  phase.  And  we  can  see  that  for  large  values  of  Iref,  6A  is 
indeed  smaller.  Also,  the  precision  of  the  phase  compensation  is  better  when  the  components 
Erf  and  Erf  have  comparable  amplitudes  and  is  maximum  when  they  are  equal.  The 

precision  of  the  phase  compensation  is  obviously  related  to  how  large  is  the  difference 
between  the  maximum  and  the  minimum  output  at  the  detector,  because  a  large  difference 
allows  you  a  better  discrimination  of  the  minimum.  Or  this  difference  is  a  maximum  when 

\e?  -  I E«  (16) 
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and  is  minimum  when  one  of  the  components  is  zero.  Indeed  the  smaller  error  is  for  a=45°, 
when  Eq.  (16)  is  fulfdled  and  becomes  infinite  for  a=0°  or  a=90°,  i.e.  Erf  or  Erf  is  zero. 

From  Eq.  (12)  if  we  allow  the  compensating  phase  shift  Ac  to  vary  we  obtain 

7max  _  7mi„  _  | sin(2xp  A  >in(2a  )ref  .  (17) 

This  quantity  is  obviously  maximum  when  a=45°  and  is  minimum  when  a=0°  or  a=90°. 

Therefore,  an  important  enhancement  of  the  precision  of  the  phase  measurement  can  be 
done  if  we  rotate  the  axes  of  the  compensator  with  an  amount  0  necessary  to  equilibrate  the  x 
and  y  components.  In  this  case  Eq.  (15)  becomes 


However,  A  becomes,  instead  Eq.  (4), 

(Eref,\  (  Efcosd  +  Efsmd 

A  -  ai'S  Eref'  ~aTg  -Ef  smd  +  Ef  cos  6 
\  y  /  \  x  y 

where  Erxef'  and  Eref'  are  the  components  of  the  reflected  beam  in  the  rotated  frame. 

Although  the  rotation  of  the  compensator  axes  can  dramatically  improve  the  precision,  in 
practice  is  difficult  to  implement  and  it  may  not  be  worth  the  trouble.  To  find  the  angle  0,  you 
need  to  scan  the  output  at  the  detector  for  various  angles  i|m  with  the  compensator  inactive 
until  you  find  two  positions  say  ipi  and  i|io  separated  by  90°  where  the  output  is  equal.  Then 
you  have  to  rotate  the  compensator  so  that  the  x  and  y  axes  matches  the  two  positions.  Then  0 
must  be  noted  for  further  use  in  calculations.  This  procedure  must  be  repeated  for  each 
measurement  and  for  each  measurement  we  have  a  different  value  for  0. 

One  must  be  aware  that  Eq.  (15)  is  valid  only  in  the  approximation  that  there  are  no 
pure  transmission  losses  in  the  compensator  and  the  analyzer.  Generally  we  do  have  this  kind 
of  losses  and  Eq.  (15)  must  be  modified  accordingly 


where  Tc  is  the  transmission  of  the  compensator  and  TA  is  the  maximum  transmission  of  the 
analyzer,  when  the  beam  is  linearly  polarized  and  the  analyzer  is  parallel  to  the  beam 
polarization. 

4.  Conclusion 


We  have  shown  using  quantitative  arguments  that  null  ellipsometry  using  a  Babinet 
compensator  is  a  highly  sensitive  phase  measurement  technique.  The  first  order  approximation 
of  the  error  is  zero  and  he  have  to  go  to  the  second  order  approximation.  By  rotating  the 
compensator  to  a  position  where  the  x  and  y  components  are  of  equal  amplitude  the  sensitivity 
can  be  considerably  increased  but  this  complicates  the  experimental  procedure  and  the 
computations  and  introduces  supplementary  sources  of  errors. 
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Figure  captions 

Fig.l  Experimental  arrangement  for  phase  measurement  using  a  Babinet  compensator. 
P  is  the  polarizer,  A  the  analyzer,  C  the  phase  compensator,  D  the  detector,  K  the  grating 
vector,  cpthe  azimuth  angle,  0  the  incidence  angle,  rpp  the  polarizer  angle,  ip  a  the  analyzer 
angle  and  Em,  Eref,  Ec  and  Eout  are  the  light  field  at  different  positions  in  the  system. 
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Abstract 

Based  on  the  metals  property  of  modifying  the  phase  of  an 
incident  beam,  a  simple  method  for  identifying  the  fast  axis  of  a 
wave  plate  independent  of  a  standard  retarder  is  demonstrated. 
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Phase-modulation  scatterometry 


Petre  C.  Logofatu 


Phase-modulation  scatterometry  is  a  metrology  technique  for  determining,  by  means  of  a  phase  modu¬ 
lator  as  a  key  device,  the  parameters  of  gratings.  The  main  source  of  error  to  be  dealt  with  are  the 
fluctuations  of  the  phase-modulation  amplitude.  The  grating  zeroth-order  reflectance  modulated  by  the 
phase  modulator  is  converted  into  a  signal  by  the  photodetector.  The  measurables  are  the  direct  term 
and  the  first  two  harmonics  of  the  signal.  For  experimental  data  fitting,  we  used  the  ratio  of  the 
harmonics  over  the  direct  term  because  it  significantly  improves  the  accuracy.  A  sensitivity  analysis 
was  performed  for  two  samples,  one  real  and  one  theoretical,  to  find  the  measurement  configuration  that 
insures  optimum  determination  precision  for  the  grating  parameters.  For  the  real  sample,  comparison 
of  the  theoretical  predictions  for  sensitivity  with  the  actual  values  showed  a  good  agreement.  For  both 
samples  the  sensitivity  analysis  indicated  subnanometric  precision  for  the  critical  dimension  (grating 
linewidth).  ©  2002  Optical  Society  of  America 
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1.  Introduction 

In  the  ongoing  effort  to  develop  scatterometry  as  a 
sensitive  metrology  technique,  researchers  continue 
to  improve  it  by  implementing  new  measurement 
techniques.1-6  In  this  paper  the  implementation  of 
the  phase-modulation  technique  (PMT)  in  scatterom¬ 
etry  is  discussed.  PMT  was  first  applied  for  optical 
measurement  by  Jasperson  and  Schnatterly7  and  has 
been  continuously  developed  since  then.8’9  The 
technique  was  previously  applied  to  scatterometry,9 
but  an  analysis  of  the  sensitivity  and  simplified  syn¬ 
thetic  formulas  for  the  measurables,  to  our  knowl¬ 
edge,  are  not  available  in  the  literature  and  are  thus 
an  original  contribution.  Generally,  we  stressed  the 
new  and  original  elements  adduced  in  this  paper,  but, 
when  required  by  the  intelligibility  of  the  text,  we 
briefly  reviewed  other  authors'  research.  The  main 
contribution  presented  here  is  the  sensitivity  analy¬ 
sis,  formally  identical  to  the  one  performed  for  the 
ellipsometric  scatterometer  discussed  in  Ref.  6, 
namely,  the  application  of  the  sensitivity  analysis  for 
fitting  (SAF)  formalism  for  the  phase-modulation 
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scatterometer.  The  same  samples  that  were  ana¬ 
lyzed  in  Ref.  6,  labeled  XL6  and  P200,  are  analyzed 
here  as  well,  and  comparison  between  the  sensitivity 
of  the  ellipsometric  scatterometer  and  the  phase- 
modulation  scatterometer  is  made.  As  one  can  see 
from  Ref.  6,  sample  parameters  have  two  categories. 
First,  one  has  the  optical  parameters  that  are  known 
a  priori  from  other  types  of  measurement  than  scat¬ 
terometry,  the  pitch  A  and  the  refraction  indices  of 
the  materials  of  which  the  sample  is  made.  Second, 
one  has  the  geometric  parameters  that  must  be  esti¬ 
mated  from  scatterometry  measurements:  the  line- 
width  Iw  (or  the  critical  parameter),  the  grating 
height  hg ,  and  heights  of  the  underlying  layers  ht. 
Although  all  the  parameters  in  the  second  category 
are  calculated  from  scatterometry  measurements, 
the  sensitivity  analysis  is  performed  only  for  the  pa¬ 
rameter  of  interest  in  the  microelectronics  industry, 
the  critical  parameter  Iw.  The  diffraction  calcula¬ 
tions  in  this  paper  were  made  with  the  rigorous 
coupled-wave  analysis  (RCWA).10 11 

2.  Experimental  Arrangement 

The  experimental  arrangement  that  we  call  phase- 
modulator  scatterometer  is  still  an  ellipsometric  scat¬ 
terometer.  Compared  with  the  scatterometer 
described  in  Ref.  6,  there  is  only  one  new  device  added 
to  the  system,  the  phase  modulator  (see  Fig.  1).  This 
one  device,  however,  radically  modifies  the  nature  of 
the  experimental  arrangement,  as  we  shall  see  in  the 
following. 

The  phase  modulator  we  used  was  a  PEM-90  phase 
modulator,  manufactured  by  Hinds  Instruments 
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(Hillsboro,  Oregon),12  with  an  oscillation  frequency  of 
50  kHz.  The  modulator  introduces  a  variable  phase 
retardation  along  one  of  the  polarization  axes,  caus¬ 
ing  the  eccentricity  of  the  polarization  ellipse  to  os¬ 
cillate.  The  analyzer  converts  the  light  with 
oscillating  polarization  into  light  with  oscillating  am¬ 
plitude,  which  is  collected  by  the  photodetector.  A 
lock-in  amplifier  measures  the  harmonics  of  the  sig¬ 
nal.  These  harmonics  are  the  measurables  of  the 
experimental  arrangement,  and  they  are  functions  of 
the  geometric  parameters  of  the  sample. 

The  experimental  parameters  of  the  phase- 
modulation  scatterometer  are,  as  seen  in  Fig.  1,  the 
polarizer  polarization  angle  P,  the  orientation  of  the 
modulator  axis  of  variable  retardation  M,  the  inci¬ 
dence  angle  of  the  light  onto  the  sample  0,  the  azi¬ 
muth  angle  of  the  sample  4>,  and  the  analyzer 
polarization  angle  A.  Parameters  P,  M,  c|),  and  A  are 
constant  during  the  measurement  process,  and  they 
define  a  measurement  configuration.  The  domain  of 
these  parameters  is  scanned  to  find  the  configuration 
with  optimum  sensitivity.  The  angle  0  is  variable 
during  the  experiment  and  is  typically  varied  from  5° 
to  70°  in  steps  of  1°.  A  He-Ne  laser  (the  wavelength 
X  =  633  nm)  was  the  light  source  used  in  this  exper¬ 
iment. 

3.  Measurables 

From  Fig.  1  one  sees  that  the  transformation  in¬ 
curred  by  the  beam  in  the  phase-modulator  scat¬ 
terometer  is  described  by 

Eout  =  ArMPEin,  (1) 

where  the  Jones  matrices  for  analyzer,  sample,  and 
modulator 


A  = 


cos2  A  cos  A  sin  A 
sin  A  cos  A  sin2  A 


M  = 


X 


cos  M  —sin  M 
sin  M  cos  M 

cos  M  sin  M 
-sin  M  cos  M 


exp[— j'8(^)]  0 

0  1 


respectively,  were  used,  with 

8(£)  =  si  sin(o ot), 


(2a) 


(2b) 


(2c) 


(3) 


which  is  the  variable  phase  retardation  introduced  by 
the  modulator  and  in  which  si  is  the  phase- 
modulation  amplitude.  The  matrix  of  the  polarizer 
is  simply  the  analyzer  matrix  with  A  substituted  with 
P.  Values  Eout  and  Em  are  the  output  and  the  input 
vector  electric  fields,  respectively.  The  coefficients 
ruv  of  the  reflection  matrix  r  describe  the  matrix  ac- 


Fig.  1.  Experimental  arrangement  of  the  phase-modulator  scat¬ 
terometer. 


cording  to  generalized  ellipsometry.13  After  making 
the  calculations  in  Eq.  (1)  we  obtain 

r  =  Eout/Ep  = 

=  cos (P  M)(rP  cos  M  +  rs  sin  M) 

+  sin(P  -  M)(—rP  sin  M 
+  rs  cos  Af)exp[-i/'8(£)],  (4) 

where  we  used  the  notations 


-pi  out 


=  rpp  cos  A  +  rsp 
=  rps  cos  A  +  rss 


=  E° 


cos  A 
sin  A 


sin  A, 
sin  A, 


(5a) 

(5b) 

(6a) 


Ep 


(E{p  cos  P  +  E ™  sin  P) 


cos 

sin 


P 

P 


=  EP 


cos 

sin 


P 

P 


(6b) 


The  notations  in  Eqs.  (5a)  and  (5b)  were  chosen  for 
convenience  reasons,  but  they  have  a  certain  physical 
meaning  as  well:  namely,  rp  is  the  contribution  of 
the  p  component  of  the  beam  incident  on  the  sample 
to  the  output,  and  rs  is  the  similar  contribution  of  the 
s  component. 

Squaring  the  absolute  value  of  r  in  Eq.  (4)  we  ob¬ 
tain  in  terms  of  power 


I(t)  =  I0  +  Is  sin  b(t)  +  Ic  cos  8(£),  (7) 


where 


lo  =  \rp\2  +  |rs|2  +  cos  2 (P  -  M)[cos  2M(\rp\ 2  -  |rs|2) 
+  2  sin  2 M  Re(rprs*)],  (8a) 

Is  =  2  sin  2(P  -  M)Im(rprs*),  (8b) 

Ic  =  sin  2 (P  -  M)[— sin  2M(\rp\2  -  |rs|2) 

+  2  cos  2 M  Re(rprs*)].  (8c) 
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The  calculations  through  which  we  obtained  the  func¬ 
tions  I0,  Is,  and  Ic  are  very  laborious,  and  it  is  even 
more  laborious  to  obtain  a  simple,  synthetic  expres¬ 
sion  of  them  as  in  Eqs.  (8a)-(8c).  However,  there  is 
a  simple  way  to  verify  their  accuracy.  With  the  help 
of  a  programming  environment  for  symbolic  calcula¬ 
tion  as  powerful  as  Mathematical4  we  can  calculate 
the  expanded  form  of  the  harmonics,  which  is  in  the 
range  of  pages  of  formulas.  Then  we  program  the 
computer  to  simplify  the  difference  between  the  ex¬ 
panded  forms  of  the  harmonics  and  their  synthetic 
expression.  As  expected,  the  result  of  the  simplifi¬ 
cation  is  zero.  One  may  notice  that  I(t)  from  Eq.  (7) 
is  not  actually  the  square  of  r  but  is  its  double.  The 
reason  for  this  choice  is  that  it  normalizes  the  har¬ 
monics  in  Eqs.  (8a)-(8c);  their  maximum  absolute 
value  becomes  1. 

After  expanding  the  trigonometric  functions  in  Eq. 
(7)  in  Bessel  functions  we  obtain  the  harmonics’  ex¬ 
pression 


'So' 

"1 

0 

JoW  ' 

'/o' 

su 

= 

0 

0 

Is 

s2w 

0 

0 

2  J2(si) 

Ic 

The  S  coefficients  of  Eq.  (9)  are  what  can  be  measured 
with  the  phase-modulator  scatterometer  in  the  ideal 
case.  For  convenience  the  value  of  si  is  chosen  to  be 
si0  =  137.8°,  because  then  we  have  J0(si0)  =  0>  thus 
simplifying  the  equations.  In  practice  we  prefer  to 
work  with  the  quantities 


~  O) 

2J1(sZ0)So’ 

(10a) 

s2w 

(10b) 

2J  2(si0)S  0 

There  are  some  obvious  advantages  of  working  with 
the  quantities  in  Eqs.  (10a)  and  (10b),  besides  the  fact 
that  they  are  normalized.  Their  numerator  and  de¬ 
nominator  are  measured  simultaneously;  therefore 
their  ratio  is  free  from  systematic  errors  caused  by 
the  spatial  displacement  of  the  beam  spot  onto  the 
photodetector  area  and  by  the  fluctuations  in  time  of 
the  laser  power  and  other  variables. 

Note  also  that,  with  the  phase-modulator  scat¬ 
terometer,  we  extract  more  information  than  in  the 
case  of  ellipsometric  scatterometer,6  because  we  mea¬ 
sure  the  two  quantities  in  Eqs.  (10a)  and  (10b)  simul¬ 
taneously.  Therefore,  the  information  provided  by 
the  phase-modulator  scatterometer  is  richer  than 
that  provided  by  the  ellipsometric  scatterometer  be¬ 
cause  we  take  double  the  number  of  measurements  in 
a  0  scan. 


(a) 


(b) 


6(°) 

(c) 


4.  Calibration 

In  practice,  it  turns  out  that  Eq.  (9)  describes  only  an 
ideal  behavior  of  the  phase  modulator.  Corrections 
for  the  nonideal  behavior  were  introduced  in  the  cal¬ 
culations  according  to  Acher  et  al8 


Fig.  2.  Empirical  error  function  for  the  phase-modulator  scat¬ 
terometer.  (a )/0:  a2  =  10^6  +  3.2  X  10 “4/0,  (b )/s:  a2  =  10~6  + 
2.0  X  10"4  Is,  (c)  Ic\  a2  =  10  6  +  1.4  X  10~4  Ic.  The  measure¬ 
ments  were  performed  with  the  XL6  sample  at  4>  =  0°  and  X 
632.8  nm. 
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Table  1.  Numerical  Results  of  the  Sensitivity  Analysis  of  Sample  XL6a 


Label 

Measurement  Configuration 

Iw  (A) 

(Experiment) 

<r(lw)  (A) 

<K°) 

M(°) 

p(°) 

AO 

(Theory) 

(Experiment) 

1 

0 

90 

45 

45 

1467.2 

4.1 

2.1 

2 

45 

90 

45 

45 

1419.0 

40.2 

29 

3 

90 

90 

45 

45 

1507.4 

35 

2.9 

4 

21 

90 

45 

45 

1464.8 

5.0 

4.4 

“For  phase-modulator  type  measurables 


5.  Symmetry  Properties  and  Sensitivity  Considerations 

The  application  of  the  SAF  formalism  can  be  a  very 
time-consuming  procedure.  To  avoid  burdening  the 
computer  with  unnecessary  calculations,  we  reduced 
the  domain  of  the  degrees  of  freedom  of  the  experi¬ 
mental  arrangement.56  Unlike  the  ellipsometric 
scatterometer,  for  the  phase-modulator  scatterom- 
eter  we  can  make  some  a  priori  sensitivity  consider¬ 
ations,  independent  of  the  sample,  which  further 
reduce  the  domain  of  experimental  parameters  that 
we  have  to  scan  in  search  of  the  optimum  sensitivity. 

At  first  sight  it  seems  we  have  four  degrees  of  free¬ 
dom  for  the  phase-modulator  scatterometer:  P,  A, 
M,  and  c|).  The  measurables  of  the  system,  the  ratios 
and  R2oi  from  Eqs.  (10a)  and  (10b),  are  approxi¬ 
mately  equal  to  the  ratios  Is/I0  and  IJI0.  A  simple 
inspection  of  the  I  coefficients  from  Eqs.  (8a)-(8c) 
shows  that  these  ratios  are  maximum  when  P  —  M  = 
±45°.  To  improve  the  signal-to-noise  ratio,  we  need 
to  set  the  position  of  the  polarizer  relative  to  the 
modulator  to  either  +45°  or  -45°  (i.e.,  P  —  M  = 
±45°). 

Although  changing  M  by  ±90°  changes  the  sign  of 
Is  and  Ic  but  not  their  amplitude,  70  remains  un¬ 
changed.  Thus  the  quantities  RM  and  R2oi  change 
only  their  the  sign,  which  is  inconsequential  for  the 
sensitivity.  Therefore  we  can  restrict  the  sensitivity 
scan  to  a  domain  of  M  E  (0°  .  .  .  90°). 

6.  Empirical  Error  Function 

Since  the  number  of  measurement  configurations  to 
be  scanned  in  the  search  for  the  optimum  sensitivity 
is  huge,  it  is  necessary  to  extrapolate  measurement 
errors  obtained  from  some  particular  measurement 
configurations  to  an  empirical  error  function  valid  for 
all  the  measurement  configurations.  The  empirical 
error  function  for  the  phase-modulator  scatterometer 
was  evaluated  in  the  same  way  as  it  was  for  the 
ellipsometric  scatterometer.6  Figure  2  illustrates 
how  we  fit  the  averaged  errors  for  a  set  of  100  mea¬ 
surements  with  the  empirical  function.  The  func¬ 
tion  that  best  approximates  the  measurement  errors 
is 

ct2  =  iq-6  +  5.0  X  10“4/2.  (11) 

7.  Sensitivity  Analysis  for  Fitting  Predictions  and 
Comparison  with  Experiment 

The  SAF  formalism  was  applied  to  the  samples  XL6 
and  P200,  described  in  Table  1  from  Ref.  6,  for  the 


measurables  from  Eqs.  (10a)  and  (10b).  Figure  3 
shows  an  example  of  fitting  data  generated  by  the 
phase-modulator  scatterometer  to  theoretical  curves 


(a) 


(b) 

Fig.  3.  Example  of  fitting  data  from  the  phase-modulator  scat¬ 
terometer  for  sample  XL6  at  cj>  =  90°  and  X  =  632.8  nm.  The 
results  of  the  fitting  were  Iw  =  1493.1  ±  1.2  A,  hg  =  5568.4  ±  3.3 
A,  and  hx  =  1604.3  ±  1.9  A. 
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c(lw)  (A) 
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Fig.  4.  Sensitivity  map  from  the  SAF  scan  for  the  phase- 
modulator  scatterometer,  sample  XL6,  at  X  =  632.8  nm.  Here  the 
color  code  is  as  follows:  white  means  large  a  (bad  sensitivity), 
black  means  small  <r  (good  sensitivity). 


calculated  with  RCWA.  Figure  4  illustrates  the  sen¬ 
sitivity  map  for  sample  XL6,  with  the  measurement 
configurations  P  =  A  =  45°,  among  which  there  is  the 
optimum  configuration,  and  Fig.  5  shows  the  sensi¬ 
tivity  map  for  sample  P200  for  configurations  type 
P  =  -A  =  45°.  The  code  color  for  both  Figs.  4  and  5 
is  the  same:  namely,  the  darker  the  hue  of  gray,  the 
smaller  a. 

In  Table  1  the  numerical  results  of  the  sensitivity 
analysis  performed  on  the  sample  XL6  for  the  line- 
width  Iw  are  summarized.  We  tested  experimen¬ 
tally  the  sensitivity  predictions  for  the  measurement 
configurations  and  noticed  a  qualitative  agreement 
between  theory  and  experiment,  except  for  configu¬ 
ration  3.  The  reason  for  this  discrepancy  is  because 
the  measurement  precision  for  this  configuration  was 
inexplicably  better  than  the  measurement  precision 
for  the  other  configurations.  The  cause  of  this  (for¬ 
tunate)  anomaly  is  yet  to  be  determined.  In  Table  2 
the  numerical  results  of  the  sensitivity  analysis  for 
the  theoretical  sample  P200  are  shown. 

The  inspection  of  Tables  1  and  2  does  not  reveal  a 
significant  improvement  of  the  sensitivity  compared 


Table  2.  Numerical  Results  of  the  Sensitivity  Analysis  of  Sample  P200a 


Measurement  Configuration 
-  <j(lw)  (A) 


Label 

4>(°) 

M{°) 

p(°) 

A(°) 

(Theory) 

1 

3 

36 

45 

-45 

4.8 

2 

87 

56 

45 

-45 

4.7 

3 

90 

90 

45 

45 

18.8 

4 

24 

90 

45 

-45 

14.1 

“For  phase-modulator  type  measurables 
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Fig.  5.  Sensitivity  map  from  the  SAF  scan  for  the  phase- 
modulator  scatterometer,  sample  P200,  at  X  =  632.8  nm.  The 
color  code  is  the  same  as  in  Fig.  4. 


with  that  of  the  ellipsometric  scatterometer.  This 
lack  of  significant  improvement  was  ascribed  to  the 
additional  sources  of  errors  introduced  by  a  more  com¬ 
plicated  experimental  arrangement.  Specifically,  the 
instability  of  the  phase  modulation  is  an  important 
perturbation  factor.  This  assertion  is  proved  by  the 
empirical  error  function  (11)  being  considerably  higher 
than  that  for  the  ellipsometric  scatterometer.6 

There  is,  however,  an  important  aspect  in  Table  1 
that  makes  our  effort  of  developing  the  phase- 
modulator  scatterometer  worthwhile.  Namely,  we 
notice  what  we  believe  is  an  increased  accuracy.  Ac¬ 
tually,  there  is  no  reason  to  doubt  the  accuracy  of 
these  results  since  the  dispersion  of  the  experimen¬ 
tally  determined  values  for  the  linewidth  are  almost 
within  the  theoretical  uncertainties.  If  we  ignore 
the  insensitive  configurations  2  and  3,  we  have  both 
a  precise  and  an  accurate  determination  of  the  line- 
width,  which  is  clearly  bound  to  lie  within  the  inter¬ 
val  of  1460-1470  A.  The  perceived  accuracy  is  the 
most  important  improvement  shown  by  the  phase- 
modulation  scatterometry  technique,  compared  with 
previous  scatterometry  techniques.  This  accuracy 
is  attributed  to  the  measurables  being  real-time 
ratios.  The  numerator  and  the  denominator  are 
simultaneously  measured,  in  exactly  the  same  area 
of  the  photodetector,  unlike  in  the  ellipsometric 
scatterometer,  thus  eliminating  many  systematic 
measurement  errors.  Contrary  to  the  ellipsometric 
scatterometer,  the  normalization  measurement  S0  is 
made  in  the  same  conditions  as  those  for  and  S2 w. 

8.  Conclusion 

Scatterometry  has  been  a  valuable  metrology  tool  for 
characterizing  structures  with  CDs  as  small  as  100 
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nm  and  smaller.  To  compensate  for  the  loss  of  sen¬ 
sitivity  due  to  the  miniaturization  trend  in  microelec¬ 
tronics,  we  have  designed  a  methodology  for  finding 
the  optimum  measurement  configuration.  The 
method  determines  the  experimental  parameters 
that  provide  optimum  sensitivity  of  the  measurables 
to  changes  of  the  geometric  parameters  of  the  sample. 
This  method,  which  we  introduced  previously,4-6  is 
termed  sensitivity  analysis  for  fitting  (SAF),  and  it  is 
a  mathematical  algorithm  that  can  be  applied  for 
optimization  purposes  regardless  of  the  specific  type 
of  measurements  that  are  performed. 

Here  SAF  is  applied  to  a  new  experimental  ar¬ 
rangement  that  has  a  new  element  added  to  the 
experimental  arrangement  of  the  ellipsometric 
scatterometer — a  phase  modulator.  This  element 
acts  to  provide  more  information  regarding  grating 
properties.  Additionally,  the  modulator  allows 
three  signals  (harmonics  of  the  phase-modulation  sig¬ 
nal)  to  be  measured  simultaneously  with  lock-in  am¬ 
plifiers,  which  means  that  the  speed  of  acquiring 
information  is  increased.  By  normalizing  the  two 
superior  harmonics  to  the  zeroth  harmonic  (dc  term), 
we  eliminate  systematic  errors,  such  as  the  spatial 
variations  of  the  sensitivity  of  the  photodetector  and 
laser  fluctuations.  However,  phase-modulation 
scatterometry  has  disadvantages.  It  is  more  sensi¬ 
tive  to  residual  birefringence  of  the  optical  elements 
of  the  experiment  than  the  ellipsometric  scatterom¬ 
eter,  and  therefore  requires  better  optics.  Because 
of  the  residual  birefringence  and  the  higher  modula¬ 
tion  orders  induced  by  the  modulator  piezoelectric 
transducer,  correction  terms  must  be  applied  to  the 
measurements.  These  correction  terms  have  to  be 
calculated  prior  to  any  data  acquisition,  and  an  easy 
and  accurate  method  of  performing  this  initial  cali¬ 
bration  automatically  does  not  exist. 

Carefully  weighing  the  advantages  and  disadvan¬ 
tages,  we  illustrated  that  by  joining  the  phase  mod¬ 
ulator  with  a  mathematical  optimization  algorithm 
(SAF)  and  ellipsometric  scatterometry  we  obtained 
an  improved  method  that  yields  superior  results. 
The  technique  can  satisfy  tomorrow’s  metrology 
needs  of  the  microelectronics  industry. 
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Sensitivity  analysis  of  grating  parameter  estimation 


Petre  C.  Logofatu 


An  optimization  method  for  the  sensitivity  of  diffraction  efficiency  measurements  is  presented.  I  define 
the  sensitivity  as  the  estimation  precision  of  the  grating  parameters.  The  optimization  method  called 
sensitivity  analysis  for  fitting  scans  all  the  possible  measurement  configurations  and  selects  the  config¬ 
uration  that  yields  the  best  sensitivity.  The  scan  is  made  over  the  domain  of  the  experimental  param¬ 
eters  of  the  arrangement,  such  as  the  azimuth  angle  of  the  grating  and  the  orientation  angles  of  the 
analyzer  and  the  polarizer.  These  parameters  can  be  freely  varied,  and  among  the  multitude  of  possible 
combinations  there  is  one  configuration  that  provides  optimum  sensitivity.  Comparison  with  experi¬ 
mental  results  reveals  a  qualitative  agreement  between  theory  and  practice.  ©  2002  Optical  Society  of 
America 
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1.  Introduction 

The  development  of  the  microelectronics  industry  has 
created  the  need  for  reliable  metrology  techniques  for 
the  characterization  of  chip  microstructures.  Criti¬ 
cal  elements  of  the  microstructures,  such  as  the 
critical  dimension,  the  reproducibility,  and  the  uni¬ 
formity  of  the  microstructures,  are  areas  of  concern. 
Contact  metrologies  (atomic  force  microscopy1)  and 
noncontact  metrologies  (scanning  electron  microsco¬ 
py2’3  and  scatterometry4"7)  were  developed  to  fulfill 
this  need.  The  optical  metrology  technique,  improp¬ 
erly  called  scatterometry,  consists  in  the  estimation 
of  grating  parameters  from  diffraction  efficiency  mea¬ 
surements.  It  is  a  diffraction-based  technique  and 
has  nothing  to  do  with  diffuse  scattering,  as  one  may 
be  mislead  to  think.  The  experimental  arrangement 
illustrated  in  Fig.  1  is  called  a  scatterometer.  Scat- 
terometers  for  both  commercial  and  scientific  re¬ 
search  purposes  were  developed  and  are  in  use.  As 
one  can  see  in  Fig.  2,  gratings  generally  diffract  mul¬ 
tiple  orders.  However,  scatterometry  generally  lim¬ 
its  itself  to  measuring  the  0th  reflected  order 
(specular  reflection)  because  most  of  the  reflected 
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power  is  contained  in  that  order.  An  additional  rea¬ 
son  to  limit  the  measurements  to  the  0th  order  is 
that,  because  of  the  shrinking  geometry  of  microelec¬ 
tronics  samples,  oftentimes  the  0th  order  is  the  only 
order  available  for  measurement.  In  this  paper  I 
deal  with  the  measurement  of  the  Oth-order  reflection 
only.  Scatterometry  has  the  advantages  of  being 
noncontact,  nondestructive,  and  it  does  not  require 
special  environmental  conditions  such  as  a  vacuum. 
Conventional  scatterometry6  8  9  is  limited  to  only  spe¬ 
cific  values  of  the  experimental  parameters  illus¬ 
trated  in  Fig.  1:  the  azimuth  of  the  sample  4>,  the 
polarizer  angle  P,  and  the  analyzer  angle  A.  For 
this  reason,  important  information  that  can  be  ex¬ 
tracted  from  the  sample  remains  ignored.  Another 
consequence  is  that  there  is  little  room  for  optimi¬ 
zation  of  the  sensitivity  of  the  experimental  arrange¬ 
ment.  Ellipsometric  scatterometry10 11  is  a  general¬ 
ization  of  conventional  scatterometry  in  which  full 
use  of  the  experimental  degrees  of  freedom  of  the 
experimental  arrangement  is  made.  By  full  use  I 
mean  that  an  optimization  algorithm  called  sensitiv¬ 
ity  analysis  for  fitting  (SAF)12  was  developed  to  en¬ 
sure  that  the  scatterometer  is  in  the  measurement 
configuration  that  provides  the  best  sensitivity.  In 
this  paper  I  undertake  a  rigorous  and  formal  inves¬ 
tigation  of  the  ellipsometric  scatterometer  and  illus¬ 
trate  the  application  of  SAF  for  real  and  theoretical 
samples.  The  SAF  algorithm  was  presented  in  its 
initial  form  in  Ref.  12.  In  Section  5  I  present  a  mod¬ 
ified  approach  in  which  the  search  for  sensitivity  and 
stability  are  done  together  rather  than  separately. 
This  modified  approach  is  more  time-consuming  but 
its  results  are  much  more  stable. 
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Sections  2  and  3  are  purely  descriptive  but  contain 
absolutely  necessary  definitions  of  the  building  ele¬ 
ments  of  the  optimization  method.  Section  4  is  the 
application  of  the  Jones  formalism13  and  the  gener¬ 
alized  ellipsometry1415  to  the  specifics  of  the  scat- 
terometer.  The  main  contribution  of  this  paper, 
which  is  the  sensitivity  analysis  formalism  called 
SAF,  is  outlined  in  an  abstract  manner  in  Section  5 
and  in  a  concrete  manner  in  Section  8  where  it  is 
applied  to  the  specifics  of  scatterometry.  In  Section 
5  I  show  that  the  sensitivity  analysis  formalism 
stands  by  itself  and  can  be  applied  to  various  optimi¬ 
zation  problems.  Section  6  contains  theoretical  con¬ 
siderations  that  considerably  reduce  the  calculation 
time  required  by  the  optimization  procedure  in  the 
specific  case  of  scatterometry.  In  Section  7  I  present 
an  empirical  function  for  the  measurement  errors,  a 
necessary  ingredient  for  scanning  the  domain  of  mea¬ 
surement  configurations. 

2.  Sample  Geometry 

Scatterometry  can  be  defined  as  a  method  of  inferring 
the  shape  of  microstructures  from  how  these  struc¬ 
tures  affect  light  in  reflection.  More  completely  and 
specifically,  scatterometry  is  a  metrology  technique 
used  to  determine  the  geometric  parameters  of  mi¬ 
croelectronics  test  structures  from  efficiency  mea¬ 
surements  of  the  light  diffracted  by  these  structures. 
In  the  sample,  the  critical  features  of  the  chip  are 
reproduced  periodically.  A  schematic  of  a  test  sam¬ 
ple  can  be  seen  in  Fig.  2.  On  a  silicon  substrate  a 
stack  of  thin  films  is  deposited  and  on  top  of  them  is 
placed  a  rectangular  etched  grating.  The  geometric 
parameters  of  the  test  sample  in  Fig.  2  are  as  follows. 
The  pitch  is  the  period  of  the  grating  and  is  denoted 
by  A.  The  pitch  of  the  gratings  studied  was  around 
a  few  hundred  nanometers.  The  grating  vector  K  in 
Fig.  2  points  in  the  periodicity  direction,  and  its  mod¬ 
ulus  is  2tt/A,  analogous  to  a  wave  vector  correspond¬ 
ing  to  a  wavelength  A.  The  width  of  the  grating 
ridge  is  called  the  linewidth  and  is  denoted  as  Iw. 
The  height  of  the  grating  is  noted  as  hg,  and  the  rest 


Fig.  2.  Example  of  a  test  structure  for  scatterometry  studies.  A 
unidimensional  grating  is  placed  on  top  of  a  stack  of  thin  films 
ending  with  a  silicon  substrate.  The  materials  in  the  figure  are 
common  for  microelectronics  samples.  ARC,  •••. 


of  the  film  heights  have  similar  notations.  These 
geometric  parameters  of  the  sample,  together  with 
the  refractive  indices  of  the  materials  out  of  which  the 
layers  are  made,  completely  characterize  the  sample. 
The  rectangular  profile  is  not  the  only  profile  that  a 
grating  can  have.  However,  in  this  paper  I  use  the 
rectangular  profiles  for  reasons  of  simplicity  and  be¬ 
cause  they  are  the  most  common  profiles.  Some  of 
the  parameters  of  the  sample  can  be  found  precisely 
by  means  other  than  scatterometry.  For  example, 
the  refractive  index  is  usually  determined  from  ellip- 
sometric  measurements  of  unetched  samples  with 
smooth  surfaces  made  from  the  same  materials. 
The  pitch  is  the  only  geometric  parameter  that  is 
usually  measured  independently  with  good  precision 
by  methods  other  than  scatterometry. 

3.  Experimental  Arrangement 

In  Fig.  1  the  experimental  arrangement  for  the  ellip- 
sometric  scatterometer  is  shown.11 12  The  diffrac¬ 
tion  efficiency  of  the  Oth  reflected  order  (the  specular 
reflectance)  is  measured.  The  polarizer  and  the  an¬ 
alyzer,  with  polarization  angles  P  and  A  that  can 
take  any  values,  allow  us  to  control  the  polarization  of 
the  input  and  the  output.  We  also  have  the  ability  to 
vary  continuously  the  azimuth  c|).  The  measure¬ 
ments  are  done  against  the  incidence  angle  0.  The 
degrees  of  freedom  of  the  system,  or  the  experimental 
parameters,  are  therefore  the  angles  P,  A,  and  c|). 

Typically  we  take  a  set  of  measurements  corre¬ 
sponding  to  incidence  angles  0  ranging  from  5°  to  70° 
with  1°  steps.  While  the  incidence  angle  is  varied, 
the  other  experimental  parameters  are  kept  con¬ 
stant.  The  constant  experimental  parameters  de¬ 
fine  a  measurement  configuration.  To  express  this 
definition  in  a  formal  manner,  we  denote  the  mea¬ 
surable  quantities  of  the  scatterometer  by 

Q.i  *7i(Pgeo?  Popt?  Pscan?  Pexp)  ?  ^  1  .  .  .  N ,  (1) 

where  pgeo  are  the  geometric  parameters  of  the  sam¬ 
ple  that  we  wish  to  determine  (for  example,  Iw ,  hg), 
popt  are  the  optical  parameters  and  other  parameters 
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of  the  sample  that  are  estimated  through  alternative 
means  (refractive  indices,  A),pscan  is  the  experimen¬ 
tal  parameter  of  the  arrangement  that  we  vary  in  the 
measurement  (usually  the  incidence  angle  0),  pexp  are 
the  experimental  parameters  of  the  system  that  we 
control  and  keep  constant  during  the  measurements 
(P,  A,  and  4>),  and  N  is  the  number  of  measurements. 
The  Bold  type  indicates  an  array  of  quantities.  We 
have,  for  example,  pgeo  =  (pu  p2, . . . ,  pM).  The 
measurement  configuration  is  defined  by  pexp.  If  is 
Pexp  that  are  the  degrees  of  freedom  of  the  scatterom- 
eter.  It  is  pexp  that  we  explore  to  find  the  optimum 
measurement  configuration.  Each  pexp  corresponds 
to  a  different  measurement  configuration. 

4.  Measurables 

The  Jones  matrix  calculus13  was  used  to  obtain  the 
expression  of  the  measurables.  The  vector  electric 
field  output  is 


Eout  =  ArPEin, 


(2) 


or 


Jjl  out 

out 


X 


cos2  A  cos  A  sin  A 
sin  A  cos  A  sin2  A 

cos2  P  cos  P  sin  P 
sin  P  cos  P  sin2  P 


EP 

E e 


(3) 


Here  A  is  the  Jones  matrix  of  the  analyzer,  P  is  the 
Jones  matrix  of  the  polarizer,  and  r  is  the  reflection 
matrix  of  the  sample  as  defined  in  generalized  ellip- 
sometry.15’16  Making  the  calculations  in  Eq.  (3),  we 
obtain 


In  Eq.  (5)  the  dependence  of  the  measurable  with 
respect  to  the  polarization  angles  P  and  A  is  explicit. 
The  dependence  with  respect  to  the  third  degree  of 
freedom,  the  azimuth,  is  implicit  because  the  reflec¬ 
tion  coefficients  are  functions  of  the  azimuth. 


5.  Sensitivity  Analysis  for  Fitting 

For  sensitivity  analysis  purposes  we  define  sensitiv¬ 
ity  as  the  precision  of  the  fitting  of  the  standard  error 
for  the  parameter  estimation  error,17 

°‘(Pge0m)  =  yjAmm  AXjV,  (8) 

where  pgeom  (m  =  1,  M)  are  the  geometric  parameters 
to  be  determined,  A  is  the  curvature  matrix, 


A 


k,i  ~ 


2 


i  =  1 


1  dQi  dQi 
of  dPk  dP  l  ’ 


(9) 


Uj  are  the  measurement  errors  of  the  measurables  qt, 
i  =  1  .  .  .  N  (which,  in  our  case,  are  the  reflectance  at 
various  incidence  angles  0^),  and  A \n  1S  a  coefficient 
that  determines  the  probability  P(Axat)  that  the  real 
value  of  the  parameter  pgeom  falls  within  the  esti¬ 
mated  precision  o-(pgeom), 


P(Axv)  = 


r(M/2,  0,  AXiV2/2) 
T(M/2) 


(10) 


with  M  the  number  of  parameters  to  be  estimated 
and  T  the  Euler  gamma  function.18  With  the  help  of 
Eq.  (8)  we  can  proceed  to  the  sensitivity  scanning. 
Considering  Eq.  (1),  we  can  define  the  SAF  scan  as 
follows.  The  optimum  measurement  configuration 
Pexp°ptimum  1S  that  configuration  for  which  we  have 


cos  A\ 
sin  A  )  ’ 


(4) 


o' m°ptimum(pexPoptimum)  =  Min  Max[am(pgeo,  pexp)] 

Pexp  I  Pgeo 


where 


(ID 


r  =  rpp  cos  A  cos  P  +  rps  cos  A  sin  P 

+  rsp  sin  A  cos  P  +  rss  sin  A  sin  P,  (5) 

Ep  =  E;n  cos  P  +  Psin  sin  P.  (6) 

The  expression  in  Eq.  (6)  is  the  amplitude  of  the  beam 
field  after  passing  through  the  polarizer.  In  the  fol¬ 
lowing  I  refer  to  Ep  as  the  input  field,  because  then 
the  ratio  output  over  input  takes  a  more  convenient 
form.  The  relation  in  Eq.  (5)  is  fundamental  for  the 
study  of  the  ellipsometric  scatterometer,  containing 
in  a  simple,  synthetic  formula  all  the  information 
that  we  need.  In  practice  we  measure  powers,  so  we 
use  the  square  absolute  value  of  Eq.  (5).  The  mea¬ 
surements  taken  by  the  ellipsometric  scatterometer 
are  normalized  to  the  output  power  for  when  the 
sample  is  removed  from  the  system.  In  this  case  we 
have  r  =  1.  We  can  conclude  that,  for  the  ellipso¬ 
metric  scatterometer,  the  ratio  of  output  over  input  is 
r  from  Eq.  (5)  and  the  measurable  is  the  reflectance 

R  =  |r|2.  (7) 


where  am  is  the  estimation  precision  [Eq.  (8)]  for  the 
geometric  parameter  pgeom.  (The  dependence  of  crm 
on  the  optical  parameters  was  omitted  because  they 
are  known  and  have  fixed  values.)  This  formula  re¬ 
quires  some  explanation.  In  the  optimization  proce¬ 
dure  we  first  scanned  the  values  of  the  fitting 
precision  am  in  the  domain  of  the  geometric  param¬ 
eters  pgeo  where  we  suspected  the  real  geometric  pa¬ 
rameters  are  to  be  found,  looking  for  the  maximum. 
Because  we  did  not  know  the  exact  values  of  these 
parameters,  and  we  wanted  good  fitting  precision  for 
all  possible  values,  we  assumed  the  worst  situation  in 
this  domain  for  any  measurement  configuration  pexp. 
This  first  part  ensures  that  the  measurement  config¬ 
uration  is  stable  for  changes  of  the  geometric  param¬ 
eters.  Second,  we  scanned  the  values  of  the  fitting 
precision  in  the  domain  of  pexp,  or  the  domain  of 
the  experimental  degrees  of  freedom  searching  for 
the  minimum.  This  part  ensures  the  optimization  of 
the  sensitivity.  The  experimental  parameters 
PexP°ptimum  corresponding  to  this  minimum  define  the 
optimum  measurement  configuration  that  provides 
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us  with  good  sensitivity  even  for  the  most  unfavor¬ 
able  case  of  pgeo  values. 

We  wanted  to  verify  that  the  predictions  of  SAF  are 
accurate.  For  this  purpose  we  analyzed  the  distri¬ 
bution  of  the  results  for  the  fitted  parameters  to  see  if 
it  agrees  with  the  distribution  assumed  in  Eq.  (10). 
Then  we  determined  if  the  precision  given  by  theory 
agrees  with  the  precision  that  the  experimental  error 
distribution  indicates. 

6.  Symmetry  Properties  that  Reduce  the  Domain  of 
Experimental  Parameters 

We  have  determined  that  the  ellipsometric  scat- 
terometer  has  three  degrees  of  freedom:  the  polar¬ 
izer  orientation  angle,  the  analyzer  orientation  angle, 
and  the  azimuth.  The  optimization  procedure  is 
time-consuming,  and  it  is  a  priority  to  minimize  the 
calculation  time.  We  sought  to  reduce  the  three- 
dimensional  domain  of  the  degrees  of  freedom  that 
has  to  be  scanned  by  eliminating  the  redundant  mea¬ 
surement  configurations.  A  rectangular  grating  has 
rotation  symmetry  C2z  (Ref.  19)  because  it  remains 
unchanged  after  180°  rotation  in  azimuth.  The 
grating  also  has  a  reflection  symmetry  relative  to  the 
plane  xz,  or  cry.19  In  other  words,  switching  4>  by  — c|) 
changes  nothing.  (Actually,  because  of  some  sign 
conventions,  there  are  some  sign  changes  in  the  re¬ 
flection  matrix;  however,  careful  management  of 
these  signs  should  avoid  any  confusion20.)  We  can 
then  restrict  the  azimuth  to  the  first  quadrant  be¬ 
cause  every  quadrant  simply  repeats  the  information 
contained  in  the  others.  This  is  true  regardless  of 
the  actual  measurables  of  the  experimental  arrange¬ 
ment  because  it  is  an  intrinsic  property  of  the  grating 
and  independent  of  the  rest  of  the  experimental  sys¬ 
tem.  Turning  to  the  degrees  of  freedom  of  the  ellip¬ 
sometric  scatterometer  (P,  A)  that  are  intrinsic  to  the 
arrangement,  one  can  further  reduce  the  domain  of 
nonredundant  measurement  configurations.  Tak¬ 
ing  into  account  the  identity  of  the  cross-reflection 
coefficients,19’21  we  can  rewrite  Eq.  (5)  as 

r  =  rpp  cos  A  cos  P  +  rps  sin(P  -  A) 

+  rss  sin  A  sin  P.  (12) 

Because  the  measurable  is  the  square  modulus  of  Eq. 
(12),  a  sign  change  of  it  will  have  no  effect  on  the 
reflectance.  Note  that  a  change  of  ±180°  of  P  or  A 
transforms  r  into  ±r,  hence  leaving  reflectance  un¬ 
changed.  Actually,  this  is  the  simple  consequence  of 
the  fact  that  the  Jones  matrix  of  a  polarizer  is  left 
unchanged  by  a  180°  rotation. 

Another  symmetry  is  provided  by  the  transforma¬ 
tion 

P  — >  -A,  A  — »  -P.  (13) 

The  transformation  in  relation  (13)  leaves  r  un¬ 
changed,  and  it  is  equivalent  to  a  mirror  reflection  in 
the  (P,  A)  plane  relative  to  the  line  P  +  A  =  0  in  this 
plane.  The  redundancy  of  the  measurement  configu¬ 
rations  is  illustrated  in  Fig.  3.  A  map  of  the  scan  of 


G(lw) 


Fig.  3.  Map  of  a  sensitivity  (a)  scan  of  the  whole  range  of  polar¬ 
ization  angles  illustrating  the  redundancy  of  the  measurement 
configurations.  The  redundancy  occurs  no  matter  what  particu¬ 
lar  sample  and  azimuth  correspond  to  the  map. 

the  entire  domain  of  experimental  parameters  shows 
multiple  periodicity  and  symmetries.  The  first  two 
transformations  show  that  the  reflectance  has  a  peri¬ 
odicity  of  180°  with  respect  to  P  and  A.  The  transfor¬ 
mation  in  relation  (13)  shows  thatP  is  symmetric  for  a 
mirror  reflection  in  rapport  with  the  line  P  +  A  =  0. 
Each  of  these  three  transformations  reduce  the  initial 
domain  of  (P,  A)  to  half.  Overall,  the  three  transfor¬ 
mations  reduce  the  domain  of  meaningful  degrees  of 
freedom  to  one  eighth  of  the  initial  area.  By  conven¬ 
tion  we  chose  among  multiple  available  choices  this 
eighth  portion  of  the  initial  domain  to  be  the  area 

P  +  A  E  (0°,  180°),  P  -  A  E  (0°,  180°),  (14) 

corresponding  to  the  diamond-shaped  area  in  Fig.  3. 
For  convenience,  we  work  with  the  normal  coordi¬ 
nates  Pn  =  P  +  A  and  An  =  P  -  A. 

7.  Empirical  Error  Function 

The  sensitivity  analysis  has  to  be  performed  for  all 
the  possible  measurement  configurations  pexp.  To 
do  this  we  have  to  know  a  priori  the  measurement 
errors  from  Eq.  (9),  corresponding  to  all  the  mea¬ 
surement  configurations.  Because  this  is  not  possi¬ 
ble,  we  have  to  extrapolate,  to  calculate  the 
measurement  errors  in  certain  configurations,  to  in¬ 
fer  an  empirical  formula  from  them  that  we  then 
extrapolate  to  all  the  other  measurement  configura¬ 
tions. 

To  determine  the  empirical  measurement  error 
function  of  the  scatterometer,12’20  I  took  statistical 
batches  of  measurements  from  various  samples  and 
in  various  measurement  configurations.  In  the  par¬ 
ticular  case  shown  in  Fig.  4,  the  batch  consists  in  50 
curves;  the  measurement  configuration  is  4>  =  90°, 
P  =  0°,  and  A  =  0°.  I  varied  the  initial  conditions,  by 
realigning  the  system  and  allowed  significant  time 
intervals  between  every  ten  sets  of  measurements. 
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Fig.  4.  Measurement  precision  (dots)  fitted  by  the  empirical  error 
function  for  the  ellipsometric  scatterometer  (solid  curve).  In  the 
figure  are  shown  the  coefficients  for  the  empirical  error  function, 
which  here  are  the  coefficients  of  the  best  fit.  The  dots  are  the 
average  values  for  a  batch  of  50  measurement  sets  in  the  config¬ 
uration  4>  =  90°,  P  =  0°,  and  A  =  0°  at  X  =  632.8  nm. 

I  assumed  various  perturbation  factors,  such  as 
power  fluctuations  of  the  laser,  mechanical  vibrations 
that  cause  fluctuations  of  the  incidence  angle,  finite 
reading  resolution,  and  quantified  these  factors  as 

CT2  =  dres2  +  apow2i?2  +  avib2(aR/ae)2.  (15) 

I  calculated  the  coefficients  ares,  crpow,  and  avib  by 
fitting  the  actual  measurement  errors  with  the  for¬ 
mula  of  Eq.  (15).  I  did  not  find  conclusive  correla¬ 
tion  of  the  errors  with  the  partial  derivative  of  the 
reflectance  with  respect  to  the  incidence  angle,  so  I 
eliminated  the  hypothesis  of  the  mechanical  vibra¬ 
tions  as  a  meaningful  perturbation  factor.  In  addi¬ 
tion,  I  was  not  able  to  find  a  good  correlation  for  even 
the  remaining  perturbation  factors.  This  might 
have  been  caused  by  insufficient  size  of  the  statistical 
batch,  incorrect  randomization  of  the  experimental 
parameters  of  the  system,  or  simply  by  other  pertur¬ 
bation  factors  that  were  overlooked.  However,  an 
overall  correlation  of  the  errors  with  the  finite- 
resolution  reading  and  the  power  fluctuation  factors 
can  be  detected,  as  one  can  see  in  Fig.  4.  I  generally 
assumed  the  worse.  I  took  exceptions  when  the  fit¬ 
ted  factor  was  obviously  erroneous.  Finally  I  ar¬ 
rived  at  the  empirical  error  function  that  I  used  in  the 
SAF  calculations  for  samples  measured  with  the  el¬ 
lipsometric  scatterometer  expressed  as 

a2  =  1.0  X  10“8  +  1.3  X  10~5R2.  (16) 

8.  Sensitivity  Analysis  for  Fitting  Predictions  and 
Comparison  with  Experimental  Results 

The  SAF  formalism  was  applied  to  the  two  samples 
described  in  Table  1.  I  worked  with  He-Ne  lasers 
only  at  the  wavelength  X  =  632.8  nm.  For  sample 
XL6,  a  real  sample,  I  was  able  to  compare  the  exper¬ 
imental  results  for  the  actual  error  with  those  for  the 


Table  1.  Geometric  and  Optical  Parameters  of  the  Samples  Referred 
to  in  Section  5 


Sample 

A  (A) 

iw  (A) 

hg  (A)  h ,  (A) 

Tig  n1  ns 

XL6 

3628.8 

1470 

5700  1600 

1.64  1.66  3.882  -  0.019/ 

P200 

2000 

700 

4000  — 

1.64  —  3.882  -  0.019/ 

predicted  error,  as  explained  at  the  end  of  Section  5. 
For  the  sample  P200,  a  theoretical  sample,  I  give  only 
the  theoretical  predictions.  The  results  for  XL6  are 
summarized  in  Table  2.  The  values  of  the  estima¬ 
tion  precision  a  of  the  linewidth,  theoretical  and  ex¬ 
perimental,  were  calculated  for  a  confidence  level  of 
95%,  which  corresponds  to  a  Ay  of  2.79548.  By  con¬ 
vention,  the  estimation  precision  was  always  calcu¬ 
lated  for  a  probability  of  95%. 

Figure  5  illustrates  the  fitting  of  a  particular  set  of 
data  taken  from  sample  XL6.  Figure  6  shows  an 
example  of  a  SAF  scan  for  the  optimum  sensitivity  for 
the  sample  XL6,  and  Fig.  7  illustrates  another  exam¬ 
ple  for  the  sample  P200.  The  maps  in  Figs.  6  and  7 
are  sections  with 

4>  =  const  of  Max[am(pgeo,  pexp)] 

Pgeo 

Table  2.  Numerical  Results  for  the  Sensitivity  Analysis  of  the  Sample 
XL6  for  Ellipsometric  Scatterometer  Measurements  at  A.  =  632.8  nm 


Measurement 

Configuration 

-  a (Iw)  (A)  <j(lw)  (A) 


Label 

<M°)  P(°)  M°) 

Iw  (A) 

(Theory) 

(Experiment) 

1  (conventional) 

0 

0 

0 

1519.8 

1.8 

4.9 

2  (conventional) 

0 

90 

90 

1451.6 

0.6 

6.3 

3  (conventional) 

90 

0 

0 

1536.5 

15.0 

19.5 

4  (conventional) 

90 

90 

90 

1390.0 

8.4 

7.6 

5  (optimum) 

24 

-63 

63 

1494.4 

0.5 

2.6 

Fig.  5.  Example  of  the  fitting  of  a  curve  measured  by  the  ellipso¬ 
metric  scatterometer  in  the  configuration  4>  =  24°,  P  =  -63°,  and 
A  =  63°  at  X  =  632.8  nm.  In  the  plot  the  geometric  parameters  of 
the  sample  (parameters  of  the  best  fit)  are  shown. 
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O(lw)  (A) 


Fig.  6.  Illustration  of  the  sensitivity  (a)  scan  for  the  sample  XL6 
at  X  =  632.8  nm.  In  the  figure  is  a  section  of  the  investigated 
domain  for  4>  =  24°.  The  optimum  measurement  configuration  is 
4>  =  24°,  P  =  63°,  and  A  =  117°. 


from  Eq.  (11)  in  the  domain  of  degrees  of  freedom  of 
the  ellipsometric  scatterometer.  These  sections 
were  chosen  because  they  contain  the  optimum  con¬ 
figuration  pexpoptimum  defined  also  in  Eq.  (11).  In 
Figs.  6  and  7,  the  dark  hues  mean  low  values  of  a , 
whereas  light  hues  are  large  values.  We  are  inter¬ 
ested  therefore  in  the  darker  regions  of  the  map. 

We  can  see  that  Fig.  7  is  symmetric  relative  to  the 
diagonal  line.  This  happens  because  the  azimuth  is 
0°  and  we  have  additional  symmetry  properties  be¬ 
cause  rn<;  is  zero.  For  an  azimuth  different  from  0°  or 
90°,  the  degeneracy  is  removed,  and  it  was  not  con¬ 
sidered  necessary  to  make  a  special  case  out  of  these 
two  situations. 


Fig.  7.  Illustration  of  the  sensitivity  (a)  scan  for  the  sample  P200 
at  X  =  632.8  nm.  In  the  figure  is  a  section  of  the  investigated 
domain  for  cf)  =  0°.  The  optimum  measurement  configuration  is 
4>  =  0°,  P  =  -78°,  and  A  =  102°.  The  lower  corners  of  the  plot 
correspond  to  the  conventional  measurement  configurations  1  and 
2  from  Table  3. 


Inspection  of  the  last  two  columns  of  Table  2  shows 
that  there  is  not  a  rigorous  agreement  between  ex¬ 
periment  and  theory.  However,  we  do  have  a  qual¬ 
itative  agreement.  Importantly,  the  measurement 
configuration  that  was  predicted  by  the  SAF  formal¬ 
ism  to  be  optimum  is  indeed  the  optimum  configura¬ 
tion. 

Configurations  1-4  are  called  conventional  be¬ 
cause  they  are  in  use  in  conventional  scatterometry 
discussed  in  Section  1.  For  these  configurations  the 
conical  diffraction  simplifies  to  diffraction  in  a  plane, 
and  the  calculations  of  the  diffracted  fields  are  sim¬ 
pler.  Because  the  calculation  time  is  an  issue  for 
scatterometry,  these  configurations  are  used  more 
often  than  the  rest. 

The  fact  that  we  are  able  to  have  subnanometer 
precision  can  be  considered  a  success.  However, 
some  concerns  are  raised  by  the  large  discrepancy 
among  the  results  for  the  linewidth  obtained  through 
different  measurement  configurations.  The  discrep¬ 
ancy  is  larger  than  the  precision,  theoretical  or  ex¬ 
perimental,  displayed  in  Table  2,  which  leads  us  to 
the  conclusion  that  we  are  strongly  affected  by  sys¬ 
tematic  errors.  Systematic  errors  could  be  due  to 
either  internal  or  external  causes.  Internal  causes 
might  be  from  the  simplified  modeling  of  the  grating 
(a  rectangular  profile  was  assumed,  but  the  real  pro¬ 
file  may  be  more  complicated)  and  nonuniformity  of 
the  grating.  External  causes  might  be  deficient 
alignment,  inaccurate  optical  parameters,  imperfec¬ 
tion  of  the  optical  elements  used  in  the  system,  and 
the  fact  that  the  ellipsometric  scatterometer  makes 
the  normalization  measurement  in  different  condi¬ 
tions  than  the  regular  measurements.  Also,  there 
could  be  factors  that  I  was  unable  to  detect.  How¬ 
ever,  the  situation  is  reasonable  for  two  reasons. 
First,  to  a  limit,  the  precision  and  the  accuracy  can  be 
uncoupled.  Second,  the  industry  is  more  interested 
in  precision  that  in  accuracy.  The  manufacturers 
want  to  know  how  well  their  structures  are  repro¬ 
duced  rather  than  the  exact,  absolute  dimensions  of 
these  structures.  Taking  into  account  these  two 
considerations,  we  can  ignore  up  to  a  point  the  dis¬ 
crepancies  shown  in  the  column  of  the  linewidth  and 
focus  on  the  precision  columns.  These  columns  in¬ 
dicate  a  satisfactory  situation.  Of  course,  a  solution 
to  the  inaccuracy  problem  in  Table  2  would  be  wel¬ 
comed. 

Predictions  for  a  sample  with  small  pitch  and  line- 
width  were  calculated  and  are  shown  in  Table  3  be¬ 
cause  it  is  of  interest  to  see  if  scatterometry  and  the 
SAF  method  are  still  applicable  when  the  microelec¬ 
tronics  features  are  further  decreased.  The  predic¬ 
tion  for  the  optimum  sensitivity  is  not  as  good  as  for 
the  sample  XL6,  but  the  precision  is  still  acceptable. 
The  conventional  configurations  1  and  2  from  Table  3 
correspond  to  the  two  lower  corners  of  Fig.  7.  Note 
that  the  optimum  configuration  is  not  much  better 
than  a  conventional  configuration.  In  this  case  the 
optimization  procedure  did  not  bring  any  improve¬ 
ment.  In  the  case  of  two-parameter  estimation,  95% 
probability  corresponds  to  a  Ay  of  2.44775. 
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Table  3.  Numerical  Results  for  the  Sensitivity  Analysis  of  the  Sample 
P200  for  Ellipsometric  Scatterometer  Measurements  at  A.  =  632.8  nm 


Label 

Measurement 

Configuration 

a{lw)  (A) 
(Theory) 

<K°) 

p(°) 

A(°) 

1  (conventional) 

0 

0 

0 

5.0 

2  (conventional) 

0 

90 

90 

1.4 

3  (conventional) 

90 

0 

0 

1.5 

4  (conventional) 

90 

90 

90 

8.4 

5  (optimum) 

0 

-78 

102 

1.4 

9.  Conclusion 

In  this  paper  scatterometry  was  investigated  from 
the  point  of  view  of  sensitivity  to  provide  methods  to 
assess  optimum  measurement  configurations.  The 
primary  contribution  of  this  paper  is  the  SAF,  a  for¬ 
malism  that  analyzes  the  sensitivity  of  the  fitting  for 
scatterometry  data  in  a  straightforward  and  rigorous 
manner.  The  formalism  is  not  restricted  to  scat¬ 
terometry.  It  is  presented  in  the  most  general  form 
but  was  designed  for  scatterometry.  The  first  im¬ 
portant  feature  of  SAF  is  the  statistical  analysis  that 
provides  an  analytic  expression  to  determine  preci¬ 
sion  in  terms  of  the  partial  derivatives  of  the  mea¬ 
surable  with  respect  to  the  parameters  to  be 
determined.  This  expression  also  involves  the  mea¬ 
surement  precision,  which  has  to  be  known  a  priori. 
Extrapolation  of  the  error  patterns  found  in  certain 
measurement  configurations  into  an  empirical  error 
function  for  all  the  possible  measurement  configura¬ 
tions  is  required.  The  most  important  and  original 
aspect  of  SAF  is  the  scanning  of  the  sensitivity  of  the 
measurement  configurations  over  the  domain  of  the 
degrees  of  freedom  of  the  experimental  arrangement 
in  search  for  the  optimum  configuration.  Once  this 
optimum  configuration  is  found,  a  comparison  with 
experimental  results  is  undertaken.  A  qualitative 
agreement  between  the  theoretical  predictions  for 
sensitivity  of  SAF  and  actual  experimental  results 
was  found.  The  optimum  configuration  is  sample 
dependent. 

The  SAF  method  was  applied  to  the  ellipsometric 
scatterometer,  an  improved  version  of  the  conven¬ 
tional  scatterometer.  The  experimental  arrange¬ 
ment  contains  polarizers  with  variable  polarization 
angles  that,  together  with  the  azimuth  of  the  sample, 
constitute  the  degrees  of  freedom  of  the  arrangement. 
The  incidence  angle  is  the  varied  parameter  in  the 
measurement  curves  generated  by  the  scatterometer. 
In  the  examples  studied,  SAF  revealed  that  scat¬ 
terometry  can  provide  the  resolution  required  by  the 
microelectronics  industry,  with  the  632.8-nm  He-Ne 
laser,  provided  that  the  optimum  measurement  con¬ 
figuration  is  utilized.  Despite  the  good  precision 
and  reproducibility  demonstrated  by  individual  mea¬ 
surement  configurations,  some  discrepancy  among 
the  results  provided  by  them  was  noted  and  attrib¬ 
uted  to  possible  sources  of  systematic  errors,  intrinsic 
or  extrinsic  to  the  sample.  Because  the  microelec¬ 


tronics  industry  is  primarily  interested  in  reproduc¬ 
ibility  rather  than  accuracy,  the  discrepancy  was 
deemed  tolerable  to  the  extent  it  does  not  interfere 
severely  with  the  estimation  precision  of  the  geomet¬ 
ric  parameters  of  the  sample.  Symmetry  consider¬ 
ations  allowed  us  to  focus  only  on  the  nonredundant 
measurement  configuration  and  considerably  re¬ 
duced  the  amount  of  work  required  by  the  optimiza¬ 
tion  procedure. 
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ABSTRACT 

In  this  article  are  demonstrated  the  nanoscale  capabilities  of  scatterometry,  an  optical  metrology  technique  for  the  de¬ 
termination  of  the  grating  parameters.  Notably,  for  both  ellipsometric-scatterometry  and  phase-modulation  scatterometry 
we  are  now  currently  using  shorter  wavelength  laser  sources,  namely  325  nm  in  UV.  We  added  to  the  scatterometer  the 
ability  to  perform  ^-scanning  measurements.  This  capability  adds  new  degrees  of  freedom  to  the  sensitivity  optimization 
procedure. 

Because  the  sensitivity  analysis  formalism  leads  us  to  the  conclusion  that  sensitivity  is  provided  mainly  by  anomalies,  a 
rigorous  analysis  of  a  certain  type  of  anomaly,  the  most  likely  to  occur  for  our  type  of  samples,  was  made  using  Modal 
Analysis.  Our  analysis  allows  for  the  prediction  of  the  anomaly  position.  This  procedure  allows  us  to  find  the  optimum 
measurement  configuration  much  faster  than  the  SAF  formalism  used  in  the  past. 

Keywords:  scatterometry,  sensitivity,  grating  anomalies,  ellipsometry,  phase  modulation 


Fig.  1.  Example  of  test  structure  suited  for  scatterometry  studies.  A 
unidimensional  grating  is  placed  on  top  of  a  stack  of  thin  films  ending 
with  a  silicon  substrate.  The  materials  in  the  figure  are  common  for 
microelectronics  samples  The  grating  parameters  in  the  figure  are  the 
linewidth  Iw,  the  grating  height  hg,  the  pitch  A,  and  the  ARC  height. 
Several  diffraction  orders  are  emitted  from  the  sample  after  reflection 
due  to  the  periodic  nature  of  the  grating. 


1.  INTRODUCTION 

The  development  of  nanolithography 
requires  a  nanoscale  metrology  as 
well  for  the  characterization  of  the 
microscopic  structures.  Specifically, 
we  use  scatterometry,  an  optical  me¬ 
trology  technique,  for  characteriza¬ 
tion  of  surface-relief  gratings  used  as 
test  structures.  There  is  a  long  his¬ 
tory  of  research  dedicated  to  scat¬ 
terometry  at  the  University  of  New 
Mexico  1"14  In  Fig.  1  a  typical  test 
structure  is  presented  and  the  sample 
parameters  are  defined.  In  Fig.  2  we 
have  the  schematics  of  an  ellip- 
sometric  scatterometer,  the  experi¬ 
mental  arrangement  that  we  used  in 
our  measurements.  There  are  two 
version  in  use  of  this  scatterometer, 
one  with  variable  incidence  angle  (0) 
and  another  with  variable  azimuth 
(6).  Another  two  versions  of  the 
scatterometer  are  obtained  if  we  add 
in  the  experimental  arrangement  a 
phase-modulator  (represented  with 
faded  hues  in  Fig.  2).  Then  we  have 
a  phase-modulation  scatterometer 
with  incidence  angle  scanning  and  a 
phase-modulation  scatterometer  with 
azimuth  scanning.  Scatterometry 


Correspondence:  Email:  logofatu@chtm.unm.edu;  Telephone  505  272  7808;  Fax  505  272  7801 
Permanent  address:  National  Institute  of  Laser,  Plasma  and  Radiation  Physics  (INFLPR),  Bucharest-Magurele,  PO-Box 
MG-36,  76900,  Romania 


determines  from  reflectivity  measurements  the  parameters  of  the  gratings.  Among  these  geometrical  parameters  the 
most  important  is  the  critical  dimension  (CD),  or  the  linewidth  (lw)  of  the  grating  ridges.  The  determination  is  done  by 
fitting  reflectance  type  measurables  to  theoretical  curves  calculated  with  the  Rigorous  Coupled-Wave  Analysis  (RCWA) 
or  Modal  Analysis. 


2.  AZIMUTH-SCANNING  SCATTEROMETER 

In  the  past  scatterometry  measurements  were  done  in  our  lab  using  only  incidence  angle  (0)  scanning  ^  The  sam¬ 
ple  holder  (see  Fig.  2)  was  rotating  about  its  vertical  axis  and  the  detector’s  arm  was  moving  with  twice  the  speed  (2  0) 
about  the  same  vertical  axis  of  the  sample  holder  so  that  the  spot  of  the  reflected  specular  order  stays  on  the  detector. 

Recently  we  added  to  the  scat- 
terometer  the  ability  to  rotate  about 
the  normal  axis  to  its  surface  by 
attaching  the  sample  holder  to  a 
rotation  stage.  This  addition  allows 
for  making  azimuth  ((]))  scanning 
measurements.  There  are  some 
obvious  advantages  for  this  tech¬ 
nique.  One  advantage  is  that  the 
scatterometer  is  mechanically  more 
stable.  Only  the  sample  holder  is 
rotating  as  opposed  to  0-scanning 
where  both  the  sample  holder  and 
the  detector  were  moving.  Me¬ 
chanical  vibrations  are  an  impor¬ 
tant  source  of  random  measurement 
errors  and  the  drift  of  the  spot’s 
position  on  the  detector  is  an  im¬ 
portant  source  of  systematic  errors. 
Both  these  types  of  errors  are  re¬ 
duced  by  using  ^-scanning.  An¬ 
other  advantage  is  the  fact  that  now 
we  have  more  choices  of  measure¬ 
ment  configurations  and  therefore 
the  optimization  procedure  of  the 
measurement  configuration  using 

the  Sensitivity  Analysis  for  Fitting  (SAF)  formalism^’  6  js  \[^Q\y  to  yield  superior  results  compared  to  the  past.  We 
have  a  whole  new  class  of  ^-scanning  measurement  configurations  and  one  of  them  may  be  more  sensitive  than  the  0- 
scanning  configurations  that  in  the  past  were  the  only  class  of  measurement  configurations  that  could  be  investigated. 
Besides  our  desire  to  reduce  random  and  systematic  measurement  errors  there  is  another  reason  why  we  added  this  new 
capability  to  the  scatterometer.  In  converting  the  scatterometer  to  measurements  at  UV  wavelengths  one  of  the  difficul¬ 
ties  we  encountered  was  the  large  size  of  the  detectors  able  to  measure  UV  light.  Only  photomultipliers  tubes  (PMT) 
fulfill  all  the  requirements  for  UV  scatterometry,  such  as  quick  reaction  time,  sensitivity  and  large  frequency  bandwidth. 
Their  disadvantage  is  their  large  size,  which  makes  difficult  to  move  the  detector’s  arm.  We  needed  a  measurement  con¬ 
figuration  in  which  the  detector  is  immobile  and  this  measurement  configuration  is  ^-scanning. 


Fig.  2  Schematic  description  of  the  ellipsometric  scatterometer.  Two 
versions  of  this  scatterometer  are  in  use:  the  incident  angle  (0)  and 
the  azimuth  (c|>)  scanning  scatterometers.  The  addition  of  a  phase 
modulator  (represented  in  faded  hues)  converts  the  experimental  ar¬ 
rangement  into  the  phase-modulation  scatterometer. 
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Fig.  3  The  sensitivity  increases  when  the  wavelength  decreases. 


3.  UV  SCATTEROMETRY 

In  the  past  we  used  successfully  He-Ne  lasers  (X=633  nm  wavelength,  red  color  light)  as  light  sources  for  the  scat- 
terometry  optical  measurements.  He-Ne  lasers  have  many  useful  properties  such  as  power  stability  and  good  coherence 
and  they  have  low  cost.  These  properties  are  necessary  for  good  measurement  precision.  Other  light  sources  generally 
do  not  have  such  high  power  stability  and  coherence  and  they  are  more  expensive.  For  UV  wavelengths  domain  there 
are  supplementary  problems  such  as  lack  of  high  quality  optical  components  (polarizers  and  phase  retarders)  that  are 
integral  components  of  the  scatterometer  (see  Fig.  2).  However,  light  sources  of  shorter  wavelengths  have  the  advantage 
of  making  the  measurables  more  sensitive  with  respect  to  changes  of  the  gratings  parameters.  The  determination  preci¬ 
sion  of  the  grating  parameters  is  made  up  by  the  measurement  precision  of  the  measurables  and  the  sensitivity  of  the 

measurables  with  respect  to  changes  of  the  grating  parameters,  which  can  be  uncoupled  to  a  certain  extent  3.  Going  to 
shorter  wavelength  causes  the  measurement  precision  to  decrease,  as  is  explained  above  and  is  shown  quantitatively  in 
Eq.  1  below. 

a2  =  1 .0  x  1 0“8  + 1 .3  xl(T5  R2 ,  X=633  nm,  (1.1) 

a2  =  2.0  x  1 0“6  + 1 .0  x  10“4  R2 ,  ?i=442  nm,  (1 .2) 

g2=  1.0x10'5+5.5x10'4  R2  ,  A,=325  nm.  (1.3) 

where  R  is  the  measurable,  generally  a  reflectometric  quantity.  The  sensitivity,  however,  increases,  and  at  a  certain  de¬ 
gree  of  miniaturization  of  the  gratings  the  growth  in  sensitivity  compensates  and  surpasses  the  loss  in  measurement  pre¬ 
cision,  as  one  can  see  in  Fig.  3.  The 
miniaturization  trend  in  microlitho¬ 
graphy  eventually  forced  us  to  con¬ 
vert  the  scatterometer  to  shorter 
wavelengths. 

First  we  went  down  to  442  nm  (blue 
color  light),  using  a  He-Cd  laser. 

This  conversion  does  not  require 
dramatic  changes  in  the  experimental 
arrangement.  The  polarizers  used  for 
He-Ne,  being  broad  range,  could  still 
be  used  at  442  nm.  We  had  to  pur¬ 
chase  new  phase  retarders  though, 
but  their  quality  was  comparable 
with  the  quality  of  the  phase  retard¬ 
ers  for  632  nm. 

It  turns  out,  however,  in  the  condi¬ 
tions  of  ever  shrinking  grating  ge¬ 
ometry,  for  samples  with  pitches 
below  200  nm,  even  a  wavelength  of 
442  nm  may  not  be  enough.  The 
condition  for  having  good  sensitivity, 
as  it  will  be  detailed  in  the  next  sec¬ 
tion,  is  to  have  grating  anomalies, 
and  these  anomalies  appear  when  the 
wavelength  is  at  most  twice  the  pitch. 
Therefore  we  had  to  go  further  on  the 
wavelength  scale  down  to  the  UV  domain,  namely  to  325  nm,  another  line  of  the  He-Cd  laser.  Here  we  have  encoun¬ 
tered  supplementary  difficulties.  Both  the  polarizers  and  the  phase  retarders  had  to  be  changed  and  custom  made  for  325 
nm  and  they  don’t  have  the  high  quality  of  the  optical  components  for  visible  light  and  they  are  expensive.  We  had  to 
replace  also  the  silicon  photodiode  detector  with  a  photomultiplier  tube.  The  large  size  of  the  PMT  requires  keeping  the 
detector  arm  immobile  and  this  forced  us  to  add  azimuth  rotation  capability  to  the  scatterometer.  We  have  an  illustration 
of  UV  scatterometry  application  in  Fig.  4.  The  sample  examined  here  was  labeled  “XL3”  and  the  results  of  the  fitting 
are  shown  in  the  caption. 


XL3T60P 0A0L442 
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Fig.  4  Example  of  fitting  of  data  obtained  through  UV  scatterometry 
measurements  (k=325  nm).  The  measurement  configuration  is 
scanning  at  0=60°,  input  polarization  P=0°,  output  polarization  A=0°. 
The  results  of  the  fitting  are  lw=117.9  nm,  hg=566.6  nm,  hARc=174.3 
nm.  The  experimental  data  is  plotted  as  a  dotted  curve  and  the  best 
theoretical  fit  as  continuous  curve. 


4.  GRATING  ANOMALIES 

One  important  concrete  aspect  in  the  sensitivity  considerations  is  the  presence  of  grating  anomalies.  There  are  various 
types  of  anomalies  but  they  all  cause  a  rapid  variation  of  the  light  reflectance  with  respect  to  the  grating  parameters, 
which  is  another  way  of  saying  that  they  increase  the  sensitivity  of  the  scatterometry  measurements.  Actually  the  main 
concrete  reason  for  the  loss  of  sensitivity  when  the  geometry  of  microlithographic  structures  shrinks  is  precisely  the  ab¬ 
sence  of  anomalies.  The  search  for  optimum  sensitivity,  done  according  to  the  SAF  algorithm  shows  that  the  optimum 
sensitivity  is  reached  when  the  0-scan  or  (|)-scan  intersects  as  much  as  possible  an  anomaly,  as  seen  in  Fig.  5.  By  de¬ 
creasing  the  wavelength  down  to  the  UV  domain,  the  anomalies  reappear  and  the  sensitivity  is  boosted  up.  There  is  one 
specific  type  of  anomaly  with  which  we  concerned  ourselves,  the  last  to  disappear  when  the  geometry  is  shrinking.  This 
anomaly  is  the  most  likely  to  help  in  boosting  the  sensitivity  in  the  conditions  of  small  pitch  samples.  Moreover  this 
anomaly  cause  a  sharper  variation  of  the  reflectance  versus  the  grating  parameter  than  the  anomaly  caused  by  the  emer¬ 
gence  of  the  first  reflection  order.  This  is  the  same  as  saying  that  this  anomaly  has  a  stronger  impact  on  the  sensitivity 
than  the  first  order  emergence  anomaly,  as  it  can  be  seen  in  Fig.  5.  In  the  past  we  used  a  simplified  model  to  account  for 
the  existence  of  this  anomaly,  in  which  we  used  Effective  Medium  Theory  approximation  in  the  zeroth  order.  In  this 
model  the  anomaly  was  due  to  a  Fabry-Perot  resonance  of  the  +1  diffracted  order  in  the  grating.  This  approximate  model 
predicted  well  the  angular  position  of  the  anomaly  (0,  ([>)  but  did  predict  correctly  whether  the  anomaly  appears  or  not. 

A  rigorous  analysis  of  the  anomaly  can  be  made  with  the  help  of  Modal  Analysis.  It  turns  out  that  this  anomaly  is  indeed 
a  Fabry-Perot  resonance  but  of  the  +1  modal  order,  and  only  when  the  electric  field  between  the  ridges  is  not  evanes¬ 
cent.  The  values  of  6  and  0  at  which  the  resonance  occurs  can  be  calculated  from  the  dispersion  equation  of  Modal 
Analysis,  the  equation  that  determines  the  allowed  modes  in  the  grating, 


cos 


[k{  /w]cos[/c2  (A  -  /w)] 


—  +  ^-^-  sin[/c1/w]sin[/c2  (A-/w)]  =  cos(&JC>/J  A) ,  (2) 

2  £  fcr)  /Cl 


where  Kj  and  k2  are  the  wave  vectors  of  the  mode  n  in  the  ridge  medium  (photoresist)  and  the  medium  between  the 
ridges  (air)  respectively, 


ki = i£'M 


(8) 


Introducing  Eq.  (8)  in  Eq.  (2)  we  obtain  two  equations  in  two  variables,  6 ,  0  corresponding  to  the  TE  and  TM 
cases.  Solving  these  equations  we  obtain  the  two  curves  in  Fig.  4,  the  red  and  blue  ones.  There  is  a  supplementary  con¬ 
dition  that  must  be  fulfilled  in  order  to  have  resonance,  namely  the  electric  transversal  field  between  ridges  must  be  non- 
evanescent.  This  condition,  together  with  the  distribution  of  the  fields  inside  the  grating  illustrated  in  Fig.  5,  explains 
why  the  Fabry-Perot  resonance  occurs  only  for  mode  +1. 

In  actual  measurements  we  adjust  the  measurement  configuration  of  the  scatterometer  so  that  this  anomaly  occurs  and 
hence  we  have  optimum  sensitivity. 
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Fig.  5  The  Fabry-Perot  anomaly  contour  calculated  with  the  dispersion  equation 
for  sample  XL6.  The  thick  grey  strip  represents  the  peak  areas  of  the  partial 

derivative  \dRpp/dlw\ .  The  anomaly  due  to  the  emergence  of  the  +1  reflection 

order  has  much  less  impact  on  the  sensitivity  than  the  Fabry-Perot  anomaly. 
The  importance  of  the  anomaly  is  shown  by  the  fact  that  the  optimum  0- 
scanning  and  the  optimum  ^-scanning  measurement  configurations  are  those 
that  intersects  as  much  as  possible  the  ridge  of  the  anomaly. 


5.  CONCLUSION 

In  this  paper  are  demonstrated  the  nanoscale  capabilities  of  scatterometry,  an  optical  metrology  technique  for  the  deter¬ 
mination  of  the  grating  parameters.  Lately  we  concentrated  on  improving  and  expanding  the  existing  capabilities  of 
scatterometry  at  University  of  New  Mexico.  Notably,  for  both  ellipsometric-scatterometry  and  phase -modulation  scat¬ 
terometry  we  are  now  currently  using  shorter  wavelength  laser  sources,  namely  we  switched  from  He-Ne  laser  (633  nm) 
to  He-Cd  laser  (442  nm  and  325  nm).  We  are  doing  now  what  we  call  “blue  and  UV”  scatterometry.  Related  to  the  con- 


version  to  UV  but  also  with  an  independent  value  of  its  own,  we  added  to  the  scatterometer  the  ability  to  perform  c|>- 
scanning  measurements.  This  capability  adds  new  degrees  of  freedom  to  the  sensitivity  optimization  procedure. 

Because  the  sensitivity  analysis  formalism  leads  us  to  the  conclusion  that  sensitivity  is  provided  mainly  by  anomalies,  a 
rigorous  analysis  of  a  certain  type  of  anomaly,  the  most  likely  to  occur  for  our  type  of  samples,  was  made  using  Modal 
Analysis.  It  turns  out  this  is  a  Fabry-Perot  resonance  of  the  +1  modal  order  when  the  fields  between  the  ridges  are  not 
evanescent.  Our  analysis  allows  for  the  prediction  of  the  anomaly  position.  This  procedure  allows  us  to  find  the  opti¬ 
mum  measurement  configuration  much  faster  than  the  SAF  formalism  used  in  the  past. 
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Nanoheteroepitaxy  has  recently  been  proposed  as  a  technique  for  significantly  extending  the 
thickness  of  pseudomorphic  growth  in  mismatched  heterostructures.  This  letter  reports  the 
experimental  application  of  nanoheteroepitaxy  for  the  growth  of  GaN  on  patterned  (111)  oriented 
silicon-on-insulator  substrates  by  organometallic  vapor  phase  epitaxy.  Transmission  electron 
microscopy  reveals  that  the  defect  concentration  decays  rapidly  away  from  the  heterointerface  as 
predicted  by  nanoheteroepitaxy  theory.  The  melting  point  of  the  nanoscale  islands  is  found  to  be 
significantly  reduced,  enhancing  substrate  compliance  and  further  reducing  the  strain  energy  in  the 
GaN  epitaxial  layer.  ©  2000  American  Institute  of  Physics.  [S0003-695 1(00)02607-3] 


Nanoheteroepitaxy1  (NHE)  exploits  three-dimensional 
stress  relief  mechanisms  that  are  available  to  nanoscale  is¬ 
lands  to  reduce  the  strain  energy  in  lattice-mismatched  ma¬ 
terial  systems.  The  NHE  approach  consists  of  patterning  a 
substrate  into  a  two-dimensional  array  of  10-300  nm  sized 
nucleation  islands  and  then  selectively  growing  epitaxial  ma¬ 
terial  vertically  on  the  islands  (Fig.  1,  point  a).  This  is  fol¬ 
lowed  by  lateral  overgrowth  to  achieve  coalescence  (Fig.  1, 
point  b).  In  contrast  to  current  epitaxial  lateral  overgrowth,2 
and  pendeo3  approaches,  where  the  patterning  is  on  the  scale 
of  1-10  jam,  the  nanoscale  patterning  in  NHE  allows  en¬ 
hanced  substrate  compliance  mechanisms  to  operate.  For  ex¬ 
ample,  in  addition  to  strain  partitioning  similar  to  that  found 
in  planar  compliant  structures,4  stress  in  the  NHE  sample 
decays  exponentially  away  from  the  heterointerface  with  a 
characteristic  decay  length  proportional  to  the  diameter  of 
the  island.  Strain  partitioning  and  stress  decay  interact  syn- 
ergistically  to  significantly  lower  the  strain  energy  in  nano- 
structural  lattice  mismatched  material  systems.  NHE  theory 
predicts  that  mismatch  dislocation  formation  can  be  elimi¬ 
nated  from  materials  systems  with  a  lattice  mismatch  in  the 
range  of  0%-4%.  In  material  systems  with  larger  lattice  mis¬ 
match  such  as  GaN  on  Si  (20%  lattice  mismatch)  defects  are 
probably  unavoidable,  however  as  we  show  below,  NHE  re¬ 
sults  in  a  significant  reduction  in  the  local  defect  density  in 
nanostructural  GaN/Si  islands. 

Nanoheteroepitaxy  was  investigated  using  organometal¬ 
lic  vapor  phase  epitaxy  (OMVPE)  growth  of  GaN  on  the 
patterned  surface  silicon  layer  of  commercial  (111)  silicon- 
on-insulator  (SOI)  wafers  manufactured  by  the  separation  by 
implantation  of  oxygen  (SIMOX)  process.  The  SOI  wafers 
were  patterned  using  interferometric  lithography  and  reactive 
ion  etching5,6  to  form  a  square  two-dimensional  array  of  sili¬ 
con  islands  on  top  of  a  field  of  Si02  as  illustrated  in  Fig.  1. 
The  height  of  the  islands  was  40  nm  while  the  diameter 
varied  between  80  and  300  nm.  Island  periodicity  was  360 


and  900  nm.  GaN  was  then  selectively  grown  on  the  silicon 
islands  using  a  three  step  sequence  consisting  of  an  in  situ 
cleaning  at  a  temperature  of  878  °C  for  5  min  under  a  hydro¬ 
gen  atmosphere,  a  low  temperature  (543  °C)  nucleating  layer 
deposition,  and  finally  main  epilayer  growth  at  high  tempera¬ 
ture  (1031  °C).  This  yielded  a  GaN  island  thickness  of  100- 
200  nm.  The  exposed  Si02  layer  in  the  SOI  served  as  a 
convenient  mask  for  the  selective  growth  of  GaN  on  silicon. 
The  SOI  structure  allowed  efficient  implementation  of  the 
patterning  and  selective  growth  requirements  of  the  NHE 
approach. 

In  addition  to  exploiting  three-dimensional  stress  relief 
mechanisms  due  to  geometrical  effects,  nano  structuring  can 
also  enable  other  stress  relief  effects  due  to  the  large  surface- 
to- volume  ratio  of  nanoscale  structures.  For  example,  in  ma¬ 
terial  systems  in  which  the  epitaxial  material  has  a  larger 
melting  point  than  the  substrate,  compliance  can  be  actively 
enhanced  due  to  softening  of  the  nanoscale  substrate  islands 
at  growth  temperatures  well  below  the  substrate  bulk  melting 
point.  Evidence  of  substrate  softening  is  shown  in  Fig.  2 
which  compares  the  size  and  shape  of  silicon  nanometer 
sized  islands  (a)  without  and  (b)  with  heat  treatment  at 
1110°C  for  10  min.  Figure  2  shows  that  mass  transport  has 
occurred  due  to  melting  in  the  heat-treated  sample.  This  in¬ 
dicates  that  the  melting  point  for  these  silicon  islands  has 
been  reduced  from  its  bulk  value  of  1412  °C  to  less  than 
1110°C.  The  in  situ  cleaning  temperature  previously  used 
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FIG.  1 .  Shows  the  patterned  SOI  substrate  with  islands  created  in  the  upper 
Si  layer.  The  selective  growth  of  GaN  is  also  illustrated.  A  defected  region 
is  located  at  the  heterointerface. 
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FIG.  2.  Comparison  of  silicon  morphologies  on  patterned  SOI  substrate 
before  (a)  and  after  (b)  heating  to  1110  °C.  The  change  of  shape  indicates 
that  the  silicon  islands  have  melted. 

for  GaN  growth  on  sapphire7  is  1100°C,  however,  because 
of  this  observed  melting  we  have  now  reduced  the  in  situ 
clean  temperature  to  878  °C.  A  reduction  of  melting  point  is 
anticipated  from  recent  results  on  nanophase  materials  which 
has  shown  that  the  materials  properties  (including  the  melt¬ 
ing  point)  of  nanometer  scale  particles  are  significantly 
modified.8  In  the  case  of  NHE  this  reduced  melting  point  can 
be  utilized  during  growth  to  soften  the  silicon  islands  (i.e., 
reduce  their  elastic  modulus)  and  enhance  substrate  compli¬ 
ance.  We  call  this  effect  active  compliance. 

While  not  explicitly  discussed  in  the  original  NHE 
model,1  active  compliance  can  be  incorporated  in  the  strain 
partitioning  equations  by  adjusting  the  elastic  compliance  ra¬ 
tio,  K ,  given  by 

K=Yepi(l-vsuh)/[Ysuh(l-vQpi)l  (1) 

where  Y  is  Young’s  modulus  and  v  the  Poisson’s  ratio  in  the 
epitaxial  material  and  the  substrate  materials.  At  low  tem¬ 
peratures,  well  below  the  actual  melting  point  of  the  epitaxial 
and  substrate  materials,  the  Young’s  moduli  of  these  materi¬ 
als  are  equal  to  their  respective  bulk  values.  For  a  higher 
growth  temperature,  near  the  reduced  melting  point  of  the 
substrate  island,  Tsub  will  be  reduced  causing  K  to  be  in¬ 
creased  above  its  bulk  value,  K0  .  The  impact  of  active  com¬ 
pliance  on  strain  partitioning  is  illustrated  in  Fig.  3,  which 
shows  the  effect  that  increasing  K  has  on  the  normalized 
strain,  e/eT,  in  the  epitaxial  and  substrate  layers,  where  eT  is 
the  total  mismatch  strain.  It  is  not  known  at  this  time  by  how 
much  the  value  of  K  can  be  increased  beyond  K0  ,  however, 
to  illustrate  the  active  compliance  effect,  an  increased  value 
of  5 K0  was  arbitrarily  chosen  for  comparison  in  Fig.  3.  The 
graph  shows  that  active  compliance  enhances  strain  parti- 


FIG.  3.  Plots  the  normalized  interfacial  strain  in  the  epilayer  and  substrate 
vs  epilayer  thickness.  Without  active  compliance  (solid  lines)  the  final 
amount  of  strain  in  the  epilayer  and  substrate  are  similar.  With  active  com¬ 
pliance  (dashed  lines)  much  more  of  the  total  strain  is  transferred  to  the 
substrate. 


FIG.  4.  Cross-sectional  TEM  micrographs  of  GaN  grown  on  Si  islands  on 
the  patterned  SOI  substrate.  Both  figures  show  defects  at  the  heterointerface 
but  high  quality  GaN  away  from  the  interface.  The  electron  diffraction  pat¬ 
tern  reveals  the  epitaxial  relationship  between  GaN  and  Si. 


tioning  by  increasing  the  partitioning  at  a  given  thickness 
and  increasing  the  ultimate  degree  of  strain  partitioning.  In 
addition,  active  compliance  works  to  directly  lower  the  total 
strain  energy  in  the  system  when  Tsub  is  reduced  because 
strain  energy  is  proportional  to  the  Young’s  modulus.1 

Island  size  is  a  critical  parameter  in  NHE  as  it  controls 
both  the  elastic  modulus  through  the  actual  melting  point  of 
the  substrate  island  and  the  characteristic  decay  length  of  the 
strain  energy.  During  selective  epitaxy,  the  island  size  and 
spacing  will  also  control  the  effective  growth  rate  at  the  is¬ 
land. 

Cross-section  transmission  electron  microscopy  (XTEM) 
micrographs  of  GaN  growth  on  two  silicon  islands,  one  80 
nm  in  diameter  and  the  other  280  nm  in  diameter,  are  shown 
in  Fig.  4.  In  both  cases  the  defects  are  concentrated  near  the 
GaN/Si  heterointerface  and  the  defect  density  decays  rapidly 
to  vanish  at  approximately  20-50  nm  from  the  interface.  In 
Fig.  4(a),  strain  field  contours  can  be  observed  in  the  defect- 
free  region.  The  general  shape  of  the  strain  contours  indi¬ 
cates  that  the  strain  originates  at  the  GaN/Si  heterointerface. 
The  contour  lines  broaden  and  separate,  away  from  the  het¬ 
erointerface,  indicating  a  decaying  strain  field  in  agreement 
with  the  predictions  of  NHE  theory.  Fateral  growth  of  GaN, 
extending  beyond  the  edges  of  the  silicon  island,  is  also  ob- 
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FIG.  5.  High-resolution  TEM  micrographs  of  (a)  GaN/Si  heterointerface 
and  (b)  GaN  away  from  the  interface. 

served.  The  electron  diffraction  pattern  from  the  sample  in 
Fig.  4(a)  (inset)  reveals  the  relative  crystallographic  orienta¬ 
tion  of  GaN  and  Si  as  GaN[0001]IISi[lll].  Thus  the  silicon 
island  is  acting  as  an  epitaxial  template.  Figure  4(b)  shows  a 
threading  dislocation  in  the  GaN,  indicating  that  the  mecha¬ 
nism  that  creates  threading  dislocations  was  not  suppressed 
sufficiently  to  prevent  the  formation  of  the  threading  dislo¬ 
cation  in  this  particular  island.  Figures  4(a)  and  4(b)  also 
show  that  GaN  grows  preferentially  on  the  (111)  surface  of 
the  silicon  island,  suggesting  that  selective  growth  via  crys¬ 
tallographic  orientation  (i.e.,  use  surface  patterning  instead 
of  using  an  oxide  mask)  may  be  achievable  for  this  material 
system. 

High-resolution  XTEM  (Fig.  5)  reveals  stacking  faults  in 
the  GaN  near  the  interface  [Fig.  5(a)]  and  a  highly  defected 
silicon  layer  at  the  interface.  Segmented  silicon  lattice  planes 
lying  within  the  2.0-nm-thick  defected  layer,  in  some  areas 
penetrating  as  much  as  three  lattice  planes  (0.95  nm)  into  the 
layer,  indicate  that  the  layer  is  composed  of  silicon,  implying 
that  much  of  the  mismatch  strain  is  being  taken  up  by  the 


silicon.  Figure  5(b)  shows  that  GaN  is  locally  of  high  quality 
away  from  the  interface. 

In  this  experimental  demonstration  of  nanoheteroepit¬ 
axy,  results  are  presented  on  the  selective-area  OMVPE 
growth  of  GaN  on  a  novel,  patterned  SOI  substrate.  A  re¬ 
duced  melting  point  is  observed  for  nanoscale  silicon  islands 
enabling  active  compliance  effects  that  work  to  further  en¬ 
hance  strain  partitioning  and  reduce  strain  energy.  Electron 
microscopy  reveals  that  the  defect  concentration  decays  rap¬ 
idly  away  from  the  GaN/Si  heterointerface,  in  agreement 
with  NHE  theory.  Beyond  this  defected  region,  the  GaN  ap¬ 
pears  to  be  of  high  quality.  Current  work  is  focusing  on 
evaluating  the  quality  of  the  material  grown  by  the  NHE 
approach  including  the  coalescence  region  and  other  effects 
such  as  differential  thermal  expansion  and  will  be  discussed 
elsewhere. 

Partial  support  for  this  work  was  provided  by  the  Air 
Force  Office  of  Scientific  Research.  XTEM  images  were  ac¬ 
quired  with  a  JEOL  2010  high-resolution  analytical  electron 
microscope  at  the  University  of  New  Mexico. 
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Nanoheteroepitaxy  is  a  fundamentally  new  epitaxial  approach  that  utilizes  three-dimensional  stress 
relief  mechanisms  available  to  nanoscale  hetero structures  to  eliminate  defects  provided  the  island 
diameter  is  below  a  critical  value  2 lc.  Analysis  shows  that  2/c— (  15-30)  X  the  critical  thickness 
hc .  In  the  case  of  GaAs  on  Si  (—4%  misfit),  2/c— 40nm.  In  material  systems  such  as  GaN  on  Si 
(—20%  misfit),  where  the  misfit  is  much  larger  and  interfacial  defects  are  unavoidable,  the 
nanoheteroepitaxial  structure  is  shown  to  reduce  the  formation  and  propagation  of  threading  defects. 
Nanostructured  substrate  parameters  that  impact  growth  are  discussed  and  interferometric 
lithography  is  introduced  as  a  method  for  fabrication  of  large-area  substrates  for  nanoheteroepitaxy. 
Si  nanoisland  diameters  as  small  as  20  nm  are  demonstrated.  Scanning  and  transmission  electron 
microscopy  data  of  GaN  grown  on  Si  (via  organometallic  vapor  phase  epitaxy)  shows  reduced 
threading  defects  in  nanostructured  samples  compared  to  growth  on  planar  substrates. 
Photoluminescence  intensity  data  of  nanostructured  samples  is  enhanced  by  —100  X  as  compared  to 
planar-growth  samples.  ©  2000  American  Vacuum  Society.  [S0734-211X(00)  11206-5] 


I.  INTRODUCTION 

The  heterogeneous  integration  of  compound  semiconduc¬ 
tors  and  devices  onto  Si  substrates  is  a  long  sought  goal  that, 
if  achieved,  would  have  a  major  impact  on  the  functionality 
of  integrated  circuits  by  combining  improved  speed,  noise, 
and  optical  performance  with  the  massive  density  of  Si  very 
large  scale  integration.  However  one  of  the  major  obstacles 
to  achieving  this  goal  has  been  the  lack  of  a  metallurgically 
compatible  (lattice  parameter,  chemical  interactions,  and 
thermal  expansion)  compound  semiconductor  with  suitable 
optical  and  electronic  properties  and  a  mature  processing 
technology.  Primarily  due  to  its  processing  maturity  and  su¬ 
perior  optical  and  electronic  properties,  much  research  has 
focused  on  developing  a  strategy  for  integrating  GaAs  with 
Si  although  this  system  has  a  lattice  mismatch  of  —4%,  ther¬ 
mal  expansion  mismatch  of  —60%,  and  a  polarity 
mismatch.1  By  far  the  dominant  issue  in  the  monolithic  inte¬ 
gration  of  GaAs  on  Si  substrates  has  been  the  formation  and 
propagation  of  large  densities  of  line  and  areal  defects,  prin¬ 
cipally  dislocations,  stacking  faults,  and  cracks,  arising  from 
the  lattice  mismatch.2  While  material  quality  and  device  per¬ 
formance  has  improved  over  the  years,  a  persistent  problem 
has  been  poor  long-term  reliability  due  to  instability  of  non- 
radiative  defects  that  propagate  during  room  temperature  cw 
operation,  especially  in  devices  such  as  lasers  where  there  is 
significant  heat  generation  during  device  operation.3,4 

Recently  developed  heterogeneous  integration  strategies 
that  address  mismatch  issues  have  resulted  in  a  renewed  re¬ 
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search  emphasis  on  integrating  compound  semiconductors 
with  Si.  For  example,  compliant5,6  substrates  mitigate  strain 
energy  buildup  by  partitioning  the  strain  between  the  epil- 
ayer  and  the  substrate.  Epitaxial  lateral  overgrowth7,8  reduces 
defect  density  by  changing  growth  direction  and  eliminating 
the  atomic  connection  between  the  two  materials.  Nanohet¬ 
eroepitaxy  (NHE)  is  a  fundamentally  new  approach  that 
takes  advantage  of  three-dimensional  stress  relief  mecha¬ 
nisms  available  to  nanometer  sized  heterostructures  to  reduce 
strain  energy.9,10  In  NHE,  growth  is  selectively  initiated  on  a 
dense  array  of  nanosized  nuclei  [Fig.  1(b)].  After  a  thin  layer 
is  grown,  typically  less  than  100  nm,  the  epilayer  becomes 
strain  free  [see  Fig.  1(b)].  At  this  point,  growth  conditions 
are  modified  to  encourage  lateral  growth  and  coalescence  of 
islands  into  a  continuous  epitaxial  film.  Theory  shows  that 
for  lattice  mismatches  up  to  4.5%  (this  range  includes  many 
of  the  important  III-V  on  Si  combinations  including  GaAs), 
the  strain  energy  can  be  below  that  required  to  form  defects 
for  readily  achievable  nucleus  dimensions.  Even  in  more 
highly  mismatched  systems,  such  as  GaN  on  Si  (20%  misfit), 
the  local  presence  of  free  surfaces  in  the  novel  NHE  structure 
impacts  the  character  of  the  defects  and  leads  to  improved 
material  quality. 

II.  THEORY  OF  NHE 

The  additional  stress  relief  mechanisms  available  to 
nanoscale  islands  as  compared  to  a  conventional  planar  ep¬ 
ilayer  are  illustrated  in  Fig.  1.  In  a  conventional  planar  struc¬ 
ture  [Fig.  1(a)],  the  epilayer  can  elastically  deform  vertically 
(i)  to  relieve  part  of  the  mismatch  stress.  However,  because 
only  part  of  the  stress  is  relieved,  the  strain  energy  in  the 
epilayer  increases  linearly  with  increasing  epilayer  thickness 
up  to  the  point  where  it  becomes  energetically  favorable  to 
create  interfacial  misfit  dislocations  (v).  In  contrast,  in 
nanoscale  islands  [Fig.  1(b)]  the  substrate  and  epilayer  are 
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Fig.  2.  Strain  energy  as  a  function  of  epilayer  thickness  in  the  GaAs/Si 
material  system  for  conventional  growth,  compliant  substrate  growth,  and 
NHE  growth.  Only  the  NHE  growth  (40  nm  diam  islands)  is  below  the 
screw  dislocation  nucleation  threshold  (also  shown)  for  all  him  thicknesses. 


Fig.  1 .  Schematic  comparison  of  conventional  (large-area)  heteroepitaxy  (a) 
and  nanoheteroepitaxy  (b).  Nanoheteroepitaxy  allows  3D  strain  relief 
mechanisms  that  are  unavailable  in  large-area  growth. 


able  to  deform  vertically  (i)  and  laterally  (ii).  Moreover,  the 
strain  decays  exponentially  away  from  the  interface  in  both 
materials.  This  helps  to  reduce  both  the  strain  in  the  epilayer 
and  the  strain  energy  in  each  nanoisland.  In  many  cases, 
strain-free  material  is  achieved  after  only  —100  nm  of 
growth.  If  mismatch  dislocations  are  formed  in  the  nanois¬ 
land,  they  can  glide  or  climb  to  an  island  edge  (iii),  assuming 
the  glide  plane  is  suitably  oriented.  In  conventional  planar 
heteroepitaxy,  dislocations  are  unlikely  to  reach  the  sample 
edge  and  instead  nucleate  threading  defects  (iv). 


A.  Calculation  of  the  strain  energy 

The  areal  strain  energy  associated  with  a  film  of  thickness 
/zepi  in  conventional  planar  epitaxy  is  given  by11 

Y  2 

^CON-  |  _  v  eApi  >  (1) 


where  Y  is  Young’s  modulus,  v  is  Poisson’s  ratio,  and  sT 
=  2|aepi— asub|/(aepi+^sub)  is  the  lattice  mismatch.  <2epi  and 
asub  are  the  equilibrium  lattice  parameters  of  the  epilayer  and 
substrate.  Due  to  strain  partitioning  (and  neglecting  any 
bending  stresses)  the  areal  strain  energy  in  planar  compliant 
substrates  is  the  sum  of  the  strain  energies  in  the  epilayer  and 
substrate,  given  by5 
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where  £epi  and  ssub  are  the  partitioned  strain  in  the  epilayer 
and  substrate  and  hsuh  is  the  thickness  of  the  substrate.  sepi 
and  esub  are  functions  of  the  elastic  moduli  and  thickness  of 
the  epilayer  and  substrate.12 


The  areal  strain  energy  in  NHE  structures  is  the  sum  of 
the  strain  energies  in  the  epilayer  and  substrate  due  to  strain 
partitioning,  as  in  the  compliant  substrate  case.  However,  the 
strain  energy  calculation  must  also  take  into  account  strain 
decay  in  the  NHE  structure  [see  Fig.  1(b)].  The  areal  strain 
energy  in  each  nanoisland  was  previously9  shown  to  be 
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where  £eipo  and  £sub()  are  the  strain  in  the  epilayer  and  sub¬ 
strate  materials  at  the  heterointerface,  respectively,  and  are 
called  the  interfacial  partitioned  strains.  The  parameters 
and  /z  j[lb  in  Eq.  (3)  account  for  the  strain  decay  and  describe 
the  effective  thickness  of  material  that  is  contributing  to  the 
integrated  strain  energy  in  the  epilayer  and  substrate,  respec¬ 
tively. 

A  plot  of  strain  energy  as  a  function  of  epilayer  thickness 
for  different  substrate  configurations  is  given  in  Fig.  2  for  the 
case  of  GaAs  growth  on  silicon.  Also  shown  in  Fig.  2  is  the 
energy  associated  with  the  formation  of  a  screw  dislocation. 
The  curve  labeled  “Conventional”  is  for  a  1  mm  thick  pla¬ 
nar  substrate.  Thin  planar  substrates  that  enable  compliance 
and  strain  partitioning  effects  are  represented  by  the  curve 
labeled  “Compliant.”  These  substrates  were  assumed  to 
have  a  compliant  layer  thickness  of  50  nm.  The  nanohet- 
eroepitaxial  configuration  is  represented  by  the  curve  labeled 
“NHE  model.”  The  energy  associated  with  a  screw  disloca¬ 
tion  increases  as  the  logarithm  of  the  epilayer  thickness  up  to 
the  point  where  the  epilayer  thickness  is  equal  to  the  radius 
of  the  island  (/  =  20nm).  Beyond  this  point,  the  screw  dis¬ 
location  energy  is  constant  and  controlled  by  /  because  the 
edge  of  the  sample  is  the  nearest  free  surface  and  the  dis¬ 
tance  to  this  free  surface  controls  the  screw  dislocation  en¬ 
ergy.  The  strain  energy  in  the  thick  planar  substrate  ( hsuh 
=  1  mm)  increases  linearly  with  increasing  epilayer  thick¬ 
ness.  Notice  that  even  at  an  epilayer  thickness  of  /zepi 
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=  1  nm,  the  strain  energy  in  the  conventional  substrate  is 
equal  to  or  greater  than  the  screw  dislocation  energy.  This 
means  that  it  is  energetically  favorable  to  create  dislocations 
in  such  epilayers.  Although  thin  compliant  substrates  ( hsuh 
=  50nm)  are  designed  to  reduce  the  strain  energy,  the  en¬ 
ergy  reduction  in  these  substrates  is  insufficient  to  eliminate 
dislocations.  Moreover,  at  epilayer  thicknesses  below  2  nm, 
the  thin  substrate  is  incompliant  and  behaves  much  like  a 
conventional  substrate. 

In  contrast,  the  strain  energy  in  the  NHE  configuration 
(2  /  =  40  nm,  hsu b=  50  nm)  is  below  the  screw  dislocation  en¬ 
ergy  at  all  epilayer  thicknesses,  making  it  energetically  un¬ 
favorable  for  screw  dislocations  to  form.  For  thick  NHE 
films,  the  strain  energy  reaches  an  asymptote,  below  the 
screw  dislocation  energy,  as  the  film  relaxes  to  its  normal 
lattice  constant.  Throughout  the  thickness  range,  the  strain 
energy  of  the  NHE  sample  is  less  than  the  screw  dislocation 
energy,  indicating  that  GaAs  of  arbitrary  thickness  can  be 
coherently  grown  on  silicon,  provided  the  surface  is  nano- 
structured  with  islands  having  a  diameter  of  ^40  nm. 

Equation  (3)  can  be  greatly  simplified  for  the  case  when 
/zepi  and  hsuh  are  >1  and  K=  1.  This  simplified  equation  can 
then  be  compared  to  criteria  for  coherent  growth  on  planar 
substrates  such  as  the  criteria  put  forth  by  People  and  Bean11 
for  the  critical  thickness  hc ,  at  which  it  becomes  energeti¬ 
cally  favorable  for  dislocations  to  be  formed  in  strained  ep¬ 
itaxial  layers.  [K=  1  implies  that  Yevi/(l  —  vevi)  =  Ysuh/(l 
—  vsuh)  =  Y/(  1  —  v)].  In  this  limit,  the  interfacial  partitioned 
strains  reduce  to 
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The  prefactors  [1  — sech(k/)]  in  and  hf*b  normally  vary 
from  0.78  to  0.91  corresponding  to  Poisson’s  ratio  varying 
from  0.1  to  0.5  and  can  be  taken  as  —  1  as  a  worst  case 
approximation.9  In  addition,  the  factors  [1  —  exp(  —  irh/l)\  re¬ 
duce  to  unity  for  hepi  and  hsub  large.  This  simplifies  the  ef¬ 
fective  thicknesses  to 
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Fig.  3.  Theoretical  critical  layer  thickness  hc  and  critical  island  diameter  2  lc 
of  Ge^Sii-*  alloys  grown  on  silicon  vs  misfit.  Also  shown  is  the  ratio 
2  lclhc. 
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where  b  is  the  magnitude  of  the  Burger’s  vector,  a  is  the 
lattice  parameter,  and  l  has  been  replaced  by  lc  to  represent 
the  critical  island  radius  beyond  which  it  is  energetically 
favorable  to  generate  dislocations.  Figure  3  compares  2 lc  to 
hc  as  a  function  of  misfit  using  the  Ge^Sq^  on  Si  material 
system  as  an  example.  Also  shown  is  the  ratio  of  2 lc/hc 
which  is  obtained  by  dividing  Eq.  (7)  by  hc : 


2lc  _  A  _  ln(lc/b) 

l^  =  47THhJb)' 


(8) 


Both  2 lc  and  hc  decrease  as  the  lattice  mismatch  increases. 
However,  2  lc  is  between  15  and  30  times  larger  than  hc 
depending  on  the  misfit.  Figure  3  shows  how  the  lateral  di¬ 
mensions  accessible  by  lithography  can  be  used  to  control 
the  strain  energy  and,  thereby,  the  quality  of  growth  in  the 
vertical  direction.  This  gives  a  wholly  new  degree  of  free¬ 
dom  in  hetero structure  growth. 


Equation  (3)  can  now  be  written  in  the  simplified  form 


^NHE- 


(6) 


Equation  (6)  shows  that  the  strain  energy  in  the  NHE  sample 
is  finite  and  proportional  to  the  island  radius  /.  Equation  (6) 
describes  the  asymptote  that  the  NHE  approaches  on  the 
right  edge  of  Fig.  2. 

The  energy  balance  approach  taken  by  People  and  Bean11 
for  determining  the  onset  of  misfit  dislocation  states  “that 
interfacial  misfit  dislocations  will  be  generated  when  the  ar¬ 
eal  strain  energy  exceeds  the  self-energy  of  an  isolated  dis¬ 
location.”  Equating  Eq.  (6)  and  area  energy  density  associ¬ 
ated  with  an  isolated  screw  dislocation9  yields 


B.  Defect  management 

The  dashed  line  in  Fig.  3  is  an  extrapolation  of  the  2  lc 
curve  and  it  indicates  that  for  material  systems  with  lattice 
mismatch  greater  than  —8%,  the  critical  diameter  drops  be¬ 
low  —  10  nm  and  will  be  increasingly  difficult  to  achieve  for 
large-area  substrates  within  current  lithographic  and  process 
capabilities.  However,  one  crucial  aspect  of  NHE  theory  is 
that  the  strain  decays  exponentially  away  from  the  interface 
into  both  the  epilayer  and  substrate  with  characteristic  length 
2// 77  [see  Fig.  1(b)].  This  means  that  all  of  the  strain  energy 
is  contained  in  a  region  near  the  interface.  Material  grown 
beyond  this  region  is  essentially  stress-free  inhibiting  the  for¬ 
mation  of  defects  in  epitaxial  layers  not  only  during  growth, 
but  also  when  device  fabrication  and  operation  stress  the 
material.  This  is  potentially  important  for  high  performance 
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Table  I.  Aspects  of  nanostructure  patterning  controllable  by  interferometric 
lithography. 


Nanoisland 

parameter 

NHE  impact 

IL  approach 

Accessible 

range 

Pattern 

periodicity 

Film  coalescence 

Incident  angles 
{p  =  X/[2  sin(0)]}, 
nonlinear  IL  and 
integration  of  optical 
and  interferometric 
lithographies 

>75  nm 

Size 

Island  growth, 
strain  partitioning, 
active  compliance 

Vary  exposure  flux, 
use  photoresist 
and  other  process 
nonlinearities. 

to  20  nm 

Pattern 

Film  coalescence 

Number  and 

Rectangular, 

symmetry 

orientation  of  beams 

hexagonal,... 

Localized 

Thermal  expansion 

Integration  of 

Fully  arbitrary 

growth  areas 

strain,  integration 

conventional  optical 

to  optics  reso¬ 

with  Si  circuitry 

and  IL,  hard  masks. 

lution  limits 

and  long  term  reliability  of  devices  under  high  power  or  high 
current  density  applications.  Moreover,  in  cases  where  inter¬ 
facial  defects  cannot  be  avoided  (e.g.,  GaN  on  Si),  the  geo¬ 
metrical  structure  of  NHE  provides  free  surfaces  close  to  the 
interface  suitable  for  interfacial  dislocation  termination. 
(This  is  in  contrast  to  a  planar  structure  which  provides  a  free 
surface  essentially  only  at  the  top  of  the  epilayer.)  This  helps 
to  inhibit  the  formation  of  threading  defects  since  disloca¬ 
tions  must  terminate  upon  themselves  in  a  loop,  on  another 
dislocation,  or  at  a  free  surface.13  These  defect  management 
effects  are  clearly  observed  in  the  growth  of  GaN  on  nano- 
structured  (111)  Silicon  on  insulator  (SOI)  as  presented  in 
Sec.  IV. 

III.  NANOSTRUCTURED  SUBSTRATE 
PARAMETERS  FOR  NHE 

Clearly,  island  diameter  is  an  important  parameter  in 
NHE.  However,  other  parameters  such  as  island  spacing  or 
pitch  are  also  expected  to  play  an  important  role.  Table  I  lists 
four  nanostructured  substrate  parameters  along  with  how 
they  can  be  controlled  using  interferometric  lithography 
(IL), 14-17  an  inexpensive,  large-area,  highly  uniform  pattern¬ 
ing  technology. 

Island  spacing  or  pitch  is  also  a  critical  parameter  for 
NHE  since  it  impacts  the  growth  rate  at  the  island  and  deter¬ 
mines  the  lateral  dimension  needed  for  coalescing  the  islands 
into  a  continuous  layer.  We  have  observed  for  example  that 
reducing  the  pitch  from  900  to  360  nm  shortened  the  growth 
time  and  improved  coalescence  of  GaN  grown  on  nanostruc¬ 
tured  SOI  substrates.  The  limiting  pitch  for  conventional  IL 
is  — X/2.  To  date,  most  of  our  experiments  have  been  carried 
out  at  wavelengths  of  364  nm  (Ar  ion  laser)  or  355  nm  [third 
harmonic  of  yttrium-aluminum-garnet  (YAG)]  with  a  lim¬ 
iting  period  of  —180-200  nm.  This  wavelength  range  was 
chosen  to  be  compatible  with  well-developed,  commercially 
available  i- line  photoresists.  With  the  advent  of  availability 
of  193  nm  resists  developed  for  ArF  based  photolithography, 
we  are  extending  our  experiments  to  —125  nm  pitches  with  a 
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Fig.  4.  Square  and  hexagonal  arrays  of  —20  nm  diam  SOI  nanoisland 
samples  (360  nm  pitch)  fabricated  by  interferometric  lithography. 


fifth  harmonic  YAG  source  at  213  nm.  Nonlinear  processes18 
to  further  extend  lithographic  capabilities  to  higher  density 
(smaller  pitch)  patterns  have  also  been  demonstrated.  The 
basic  idea  follows  from  the  realization  that  the  fundamental 
limit  of  IL  is  on  the  pitch,  not  on  the  linewidth  or  critical 
dimension  (CD).  First  a  periodic  array  of  small  features  with 
CD<p/4  with  p  the  pitch  is  transferred  into  a  sacrificial  layer 
(e.g.,  a  Si3N4  film).  In  a  second  lithographic  step,  an  identical 
pattern  is  defined  with  a  phase  shift  of  p/2  and  transferred 
into  the  sacrificial  layer.  The  result  is  a  pattern  with  one-half 
the  pitch,  doubling  the  density. 

Pattern  symmetry  may  also  play  an  important  role  in  coa¬ 
lescence.  For  example,  a  hexagonal  symmetry  is  expected  to 
yield  superior  coalesce  of  wurtzite  GaN  crystals  compared  to 
a  rectangular  symmetry.  Pattern  symmetry  can  be  controlled 
by  varying  the  exposure/development  and  pattern  transfer 
parameters.  Complex  two-dimensional  (2D)  patterns  can  be 
generated  in  a  single  exposure  with  multiple  input  beams. 
Specifically,  three-beam  and  three  exposure  geometries  lead 
to  hexagonal  symmetries  and  four  and  five  beam  geometries 
give  2D  hole  arrays.19  Combined  with  multiple  resist  tonali¬ 
ties,  image  reversal,  multilayer  resists,  and  the  possibilities 
created  with  additional  sacrificial  hard-mask  layers,  there  is  a 
very  large  flexibility  to  provide  a  wide  variety  of  nanostruc¬ 
ture  geometries.  Figure  4  shows  nanostructured  SOI  sub¬ 
strates  with  rectangular  and  hexagonal  symmetries  with  —20 
nm  diam  islands. 

The  ability  to  define  localized  growth  areas  will  address 
issues  related  to  thermal  expansion  mismatch  as  well  as  in¬ 
tegration  with  silicon  circuitry.  Isolated  areas  of  nanostruc¬ 
tures  can  be  generated  by  combining  IL  with  conventional 
optical  lithography  to  define  larger  areas. 

A  major  advantage  of  IL  is  its  scalability  and  low  cost 
manufacturability.  In  contrast  to  e-beam  lithography  ap¬ 
proaches,  which  are  able  to  reach  these  same  scales,  IL  offers 
a  critical  parallel-printing,  large-area  capability.  Applying 
the  large-field,  parallel-printing  advantages  of  optical  lithog¬ 
raphy,  IL  easily  provides  macroscopic  sample  areas  (^1- 
100  cm2)  with  a  high  density  of  nanostructures.  This  will  be 
critical  both  for  optimization  of  NHE  across  a  broad  param¬ 
eter  space  and  for  eventual  device  fabrication. 
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Fig.  5.  Plan  view  SEM  images  of  —140  nm  thick  GaN  epitaxial  samples 
grown  on:  (a)  planar  (lll)Si;  (b)  planar  (111)  SOI;  and  (c)  on  a  —70  nm 
diam  island  nonostructured  (111)  SOI  sample. 


Fig.  6.  Plan  view  SEM  image  of  —400  nm  thick  GaN  samples  grown  on:  (a) 
planar  (lll)Si;  (b)  planar  (111)  SOI;  and  (c)  nanostructured  (111)  SOI 
samples. 


IV.  RESULTS 

An  initial  investigation  of  NHE  was  carried  out  using  or- 
ganometallic  vapor  phase  epitaxy  growth  of  GaN  on  nano¬ 
structured  (111)  SOI  wafers  manufactured  by  the  separation 
by  implantation  of  oxygen  process.  For  comparative  pur¬ 
poses,  GaN  was  also  grown  on  planar  (111)  silicon  and  pla¬ 
nar  (111)  SOI.  The  general  procedure  consisted  of  first  nano¬ 
structuring  the  surface  silicon  layer  of  the  SOI  substrate.  This 
was  followed  by  cleaning  the  substrates  to  remove  impurities 
and  native  oxide.  All  three  types  of  substrates  were  then 
loaded  into  the  growth  chamber.  GaN  was  then  grown  on  the 
samples  using  a  three-step  sequence  consisting  of:  (1)  an  in 
situ  clean  at  a  temperature  of  878  °C  for  5  min  under  a 
hydrogen/nitrogen  atmosphere;  (2)  a  low  temperature 
(543  °C)  nucleating  layer  deposition;20  and  (3)  the  main  ep- 
ilayer  growth  at  high  temperature  (1031  °C).  The  samples 
were  characterized  using  scanning-electron  microscopy 
(SEM),  transmission-electron  microscopy  (TEM),  and  pho¬ 
toluminescence  (PL). 

Figure  5  shows  plan  view  SEM  micrographs  of  GaN 
grown  to  a  nominal  thickness  of  140  nm  on  the  three  types  of 
substrates.  The  SEM  image  of  GaN  grown  on  the  planar 
(111)  Si  [Fig.  5(a)]  clearly  shows  etch  pits  and  dark  spots 
indicative  of  crystal  defects.  The  GaN  grown  on  the  planar 
(111)  SOI  [Fig.  5(b)]  also  has  etch  pits  and  a  connected 
network  of  dark  lines  indicative  of  grain  boundaries.  In  con¬ 
trast,  Fig.  5(c)  shows  an  array  of  truncated  six-sided  pyramid 
shaped  crystals  typical  of  wurtzite  GaN.  No  grain  boundaries 
or  etch  pits  are  observed  indicating  that  the  array  is  com¬ 
posed  of  high  quality  single  crystals.  The  diameter  of  the 
crystals  is  approximately  300  nm. 

GaN  was  grown  to  a  nominal  thickness  of  400  nm  in 
order  to  coalesce  the  array  of  GaN  crystals  into  a  continuous 
layer.  Figure  6(a)  is  a  plan  view  SEM  of  GaN  grown  on 
planar  (111)  Si  while  Fig.  6(b)  shows  GaN  grown  on  planar 
(111)  SOI.  These  SEMs  look  similar  to  those  of  the  corre¬ 
sponding  140  nm  samples.  In  contrast,  Fig.  6(c)  shows  evi¬ 
dence  that  significant  (although  still  incomplete)  coalescence 
has  occurred  after  400  nm  of  growth  for  the  NHE  sample. 
Figure  7  shows  cross-sectional  diffraction-contrast  images  of 
the  three  types  of  samples  after  400  nm  of  growth.  Figure 
7(a)  clearly  shows  threading  dislocations  in  the  GaN  layer 
while  no  dislocations  are  observable  in  the  silicon  substrate. 
The  GaN  grown  on  the  planar  (111)  SOI  is  also  highly  de¬ 
fected,  however  the  defect  character  is  different  as  shown  in 


Fig.  7(b).  The  defects  in  the  SOI  sample  appear  to  be  mis- 
oriented  grain  boundaries  with  possible  cubic  GaN 
nanophases  at  the  interface.  Figure  7(c)  is  a  cross-sectional 
image  of  the  coalescence  region.  Small  looped  defects  can  be 
observed  near  the  coalescence  region.  While  grain  bound¬ 
aries  were  found  in  other  coalescence  regions,  this  result  is 


GaN 

(a) 

Si 

200  nm 

Fig.  7.  Cross-section  diffraction-contrast  images  of  the  same  samples  as 
shown  in  Fig.  6. 
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Fig.  8.  Room  temperature,  band  gap  PL  intensity  of  GaN  grown  on  the 
substrates  indicated  as  a  function  of  GaN  thickness.  Note  both  the  much 
larger  intensity  for  the  NHE  sample  and  the  decay  in  the  PL  intensity  for  the 
bulk  film  cases  indicating  an  increase  in  nonradiative  defects  for  the  thicker 
growths.  The  PL  intensity  for  a  GaN  on  sapphire  sample  is  also  shown. 


important  because  it  establishes  that  it  is  possible  to  achieve 
coalescence  without  grain  boundaries  or  defects  propagating 
through  the  epilayer. 

Room  temperature  PL  intensity  results  at  the  —365  nm 
band  edge  are  plotted  in  Fig.  8  for  GaN  grown  on  nanostruc- 
tured  and  planar  substrates.  The  244  nm  wavelength  pump 
power  density  was  —700  mW/cm2.  The  data  show  that  the 
PL  intensity  on  the  nanostructured  substrate  was  not  only 
brighter  (by  almost  2  orders  of  magnitude),  but  its  brightness 
remained  fairly  constant  with  him  thickness  while  the  inten¬ 
sity  on  the  planar  (111)  silicon  and  (111)  SOI  substrates  was 
much  weaker  and  decreased  with  increasing  thickness  above 
140  nm.  PL  spectra  also  showed  almost  no  mid-gap  PL 
(‘‘yellow  band”)  for  the  NHE  samples  while  this  band  was 
much  more  intense  in  the  bulk  silicon  and  SOI  samples.  A 
sample  with  surface  relief  such  as  the  NHE  sample  is  ex¬ 
pected  to  be  brighter  than  a  sample  with  a  smooth  planar 
surface  because  the  scattering  from  the  structure  reduces  the 
impact  of  total  internal  reflection  for  a  high-index  material 
such  as  GaN.  However,  these  effects  should  be  decreased  for 
the  thicker,  more  coalesced,  sample.  The  fact  that  the  inten¬ 
sity  remained  relatively  constant  in  the  thickness  range  from 
140  to  400  nm  in  the  NHE  samples  suggests  that  the  brighter 
intensity  is  not  solely  attributable  to  these  scattering  effects 
but  also  reflects  a  high  quantum  efficiency  for  the  NHE 
sample. 

The  decrease  in  the  intensity  for  the  planar  (111)  silicon 
and  SOI  substrates  is  possibly  due  to  increased  defect  density 
and  accompanying  nonradiative  recombination  in  these  ma¬ 
terials  due  to  the  increased  stresses  characteristic  of  growth 
on  misfit  planar  structures.  In  contrast,  the  fact  that  the  in¬ 
tensity  in  the  nanostructured  sample  is  much  stronger  and 
remains  relatively  constant  indicates  that  this  material  is  of 
higher  quality  and  does  not  become  defected  as  the  thickness 
increases.  This  is  supported  by  SEM  and  TEM  data.  The  PL 
data  support  nanoheteroepitaxial  theory,  which  states  that  the 
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material  is  essentially  stress  free  beyond  the  characteristic 
length  of  decay.  In  addition,  the  PL  intensity  of  the  nano¬ 
structured  sample  was  also  —25  times  brighter  as  compared 
to  that  of  a  thick  GaN  him  grown  on  planar  sapphire21  as  is 
also  shown  in  Fig.  8. 

V.  SUMMARY 

We  have  introduced  a  novel  heterostructure  growth  tech¬ 
nique,  nanoheterepitaxy,  which  shows  great  promise  in  gain¬ 
ing  fundamental  insights  into  nanocry stalline  growth  and  ad¬ 
vancing  toward  the  grand  challenge  of  integrating  additional 
functionality  with  silicon  circuitry.  The  critical  island  diam¬ 
eter  2 lc  has  been  established  to  be  —(15-30 )hc  .  The  prox¬ 
imity  of  the  free  surface  at  the  island  edge  makes  the  nucle- 
ation  and  propagation  of  defects  fundamentally  different 
from  the  traditional  growth  case  where  the  top  of  the  grow¬ 
ing  him  is  the  closest  free  surface. 

Interferometric  lithography  is  proposed  as  a  flexible  and 
cost-effective  technique  for  fabricating  the  large-area  uni¬ 
form  nanoisland  substrate  arrays  required  by  NHE.  SOI  is¬ 
land  diameters  down  to  20  nm  on  a  360  nm  pitch  are  dem¬ 
onstrated  using  IL  and  nonlinear  develop/etch  processes. 

Initial  experiments  of  NHE  growth  of  GaN  on  SOI  shows 
an  improvement  in  material  quality  versus  growth  on  planar 
substrates.  The  SEM  and  TEM  results  show  a  dramatic  re¬ 
duction  in  threading  defect  concentration  and  show  promise 
for  coalescence  of  the  GaN  nanoislands  into  a  continuous 
him.  PL  measurements  show  that  the  emission  from  the 
NHE  samples  is  much  stronger  (—  100X)  and  exhibits  much 
less  midgap,  defect-related  emission. 

These  preliminary  results  are  very  encouraging  and  sug¬ 
gest  that  NHE  may  play  a  role  in  advancing  toward  this 
semiconductor-materials  challenge.  Much  more  work  is  nec¬ 
essary  to  completely  assess  the  potential  of  this  new  tech¬ 
nique  over  a  variety  of  heterostructure  material  systems,  and 
to  extend  these  results  from  early  materials  studies  to  device 
investigations. 
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Initial  growth  studies  of  GaAs  on  an  array  of  Si  islands  nanostructured  on  (100) 
oriented  silicon-on-insulator  substrates  show  that  growth  occurs  through  a 
mixture  of  selective-area  and  3D  growth  modes.  An  optimum  initiation  growth 
temperature  must  tune  the  growth  conditions  to  the  geometry  of  the  seed  array 
so  that  selective-area  control  is  maintained  while  defect  density  is  minimized. 
The  optimum  temperature  for  a  square  array  of  Si  islands,  500  nm  in  pitch,  and 
100  nm  to  280  nm  in  diameter,  is  ~600°C.  This  temperature  yields  single-crystal 
nucleation  on  each  Si  island  while  maintaining  selective-area  growth  mode 
control.  Transmission  electron  microscope  (TEM)  analysis  of  optimized  and  non- 
optimized  grown  GaAs/Si  heterostructures  show  that  they  accommodate 
0.4-0. 7%  strain.  Further  reduction  in  stacking-fault  defects  attributed  to  side 
wall  growth  may  be  possible  through  masking  of  side  wall  or  annealing. 

Keywords:  Heteroepitaxy,  GaAs  on  Si,  growth  initiation,  lattice  mismatch 


INTRODUCTION 

Nanoheteroepitaxy  has  recently  been  proposed  as  a 
technique  for  significantly  reducing  defect  density  in 
lattice  mismatched  heteroepitaxy. 12  Nanoheteroepit¬ 
axy  incorporates  a  number  of  defect  reduction  tech¬ 
niques  including:  substrate  patterning,  lateral  epi¬ 
taxial  overgrowth,3-5  strain  partitioning  in  thin  com¬ 
pliant  substrates,67  and  epitaxial  necking,8  which 
exploits  free  surfaces  to  terminate  dislocations.  In 
addition,  nanoheteroepitaxy  exploits  three-dimen¬ 
sional  stress  relief  mechanisms  that  are  available 
only  in  nanoscale  heterostructures,  where  the  strain 
energy  density  decays  rapidly  away  from  the  inter¬ 
face.9  For  example,  nanoheteroepitaxy  theory  pre¬ 
dicts  that  growth  of  GaAs  on  Si  (~4%  misfit)  should 
support  a  coherent  interface  provided  the  island  size 
is  below  the  critical  island  diameter  (21c)  of  40  nm.2 
Nanoheteroepitaxy  addresses  highly  lattice  mis¬ 
matched  and  polarity  mismatched  systems  that  are 
expected  to  grow  in  Stranski-Krastanov  or  Volmer- 
Weber  mode.  One  example  is  the  growth  of  GaAs  on  Si 
which  becomes  three-dimensional  (3D)  at  an  early 
stage  of  film  growth  (<7  nm)  to  minimize  the  strain 
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energy.10  3D  growth  is  usually  undesirable  because  it 
leads  to  morphological  problems  and  one  approach  to 
limit  its  effects  has  been  to  employ  a  low-temperature 
buffer  layer  on  planar,1112  patterned,13  and  porous14 
substrates  to  achieve  smooth  continuous  films.  This 
approach  has  been  successfully  applied  to  GaN 
heteroepitaxy,15  where  atomic-force  microscopy  mea¬ 
surements  show  that  a  low  temperature  buffer  is  also 
3D  but  on  a  very  fine  scale.  This  finer  scale  of  3D 
growth  is  presumably  related  to  the  reduced  surface 
mobility  of  the  growth  species  at  the  lower  tempera¬ 
ture.  However,  a  tradeoff  appears  to  exist  between 
low  defect  density  and  smooth  film  morphology,10 
where  the  finer  scale  3D  growth  results  in  a  smoother 
film  but  can  cause  a  higher  defect  density  as 
misorientation  between  the  grains  will  lead  to  dislo¬ 
cations  and  low  angle  grain  boundaries  when  they 
coalesce.  In  nanoheteroepitaxy,  the  seed  diameter  is 
in  the  nanoscale,  in  the  range  of  the  domain  size  of 
crystals  grown  at  normal  temperatures.  Therefore,  it 
may  be  possible  to  grow  an  ordered  array  of  single¬ 
crystal  and,  perhaps,  coherent  GaAs  crystals  on  Si 
using  nanoheteroepitaxy.  This  paper  describes  obser¬ 
vations  of  the  initial  stages  of  nanoheteroepitaxial 
growth  of  GaAs  on  nanoscale-patterned  silicon-on- 
insulator  substrates. 
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EXPERIMENTAL 

Epitaxial  GaAs  was  grown  on  nanostructured  sili¬ 
con  islands  patterned  on  (100)  oriented  silicon-on- 
insulator  substrates  using  organometallic  vapor-phase 
epitaxy.  Four  stages  of  the  growth  process  were  iden¬ 
tified  as  (1)  growth  initiation,  (2)  lateral  growth,  (3) 
coalescence,  and  (4)  planarization.  This  study  empha¬ 
sizes  stages  (1),  (2)  and,  in  particular,  addresses  the 
effect  that  growth  initiation  has  on  the  quality  of 
GaAs  during  lateral  growth. 

The  surface  silicon  layer  of  (100)  oriented  silicon- 
on-insulator  substrates  was  patterned  using  inter¬ 
ferometric  lithography,  a  maskless,  large-area  pat¬ 
terning  technology.  For  details  on  interferometric 
lithography  please  refer  to  Ref.  16.  The  initial  thick¬ 
ness  of  the  surface  silicon  layer  was  200  nm.  The 
thickness  of  the  oxide  layer  was  370  nm.  The  surface 
silicon  layer  was  thinned  to  110  nm  by  an  oxidation 
and  hydrofluoric  acid  ( HF )  etching  sequence.  A  square 
array  of  silicon  islands  was  then  patterned  on  the 
surface  silicon  layer.  The  pitch  of  the  array  was  500 
nm  and  the  diameter  of  the  islands  varied  between 
100  and  280  nm  from  batch  to  batch.  Figure  1  contains 
a  plan  view  SEM  image  of  a  nanostructured  ( 100 )sili- 
con-on-insulator  (SOI)  wafer.  The  inset  in  Fig.  1  is  a 
schematic  cross-sectional  view  of  the  nanostructured 
(lOO)SOI  illustrating  the  different  layers  in  the  sub¬ 
strate.  Unintentional  variations  in  processing  caused 
the  shape  of  the  islands  to  vary  from  diamond  to  circular 
(see  Fig.  2);  however,  for  consistency  the  lateral  extent 
or  size  of  the  islands  is  referred  to  as  the  diameter. 

Arsine  and  trimethylgallium  were  used  as  precur¬ 
sors  for  group  V  and  III  sources,  respectively.  Reactor 
parameters  common  to  all  stages  were  the  reactor 
pressure  (100  torr),  the  carrier  gas  (hydrogen),  and 
the  total  gas  flow  rate  (5.9  slm).  The  arsine  flow  rate 
was  kept  constant  at  35  seem  in  all  growth  stages 
(1)  to  (4). 

Outgassing  and  oxide  desorption,  which  is  nor¬ 
mally  performed  at  a  temperature  of  900-1000°C  11 
was  limited  to  860°C  for  10  min  in  hydrogen  due  to 
gross  morphological  changes  that  occur  in 
nanostructured17  and  porous18  silicon  samples  at  tem¬ 
peratures  above  ~1000°C  and  750°C,  respectively. 
The  temperature  was  then  reduced  in  preparation  of 
the  growth  initiation  stage.  In  order  to  avoid  anti¬ 
phase  domain  boundaries,  an  arsenic  terminated 
silicon  surface12  was  created  by  exposing  silicon  to 
arsine  at  a  rate  of  35  seem  for  2  min  immediately  prior 
to  growth  of  GaAs.  In  this  stage,  four  different  growth 
temperatures  were  tested  between  430°C  to  730°C  to 
study  the  effects  of  growth  of  this  stage  on  material 
grown  in  later  stages.  The  trimethylgallium  flow  rate 
was  between  1  and  2.4  seem.  This  gave  a  V/III  molar 
ratio  of  -580  (this  ratio  is  calculated  based  on  the 
input  flows  of  the  constituents.  The  reduced  decompo¬ 
sition  rate  of  arsine  at  the  lower  growth  temperature 
no  doubt  reduced  the  actual  V/III  ratio  during  the 
lower  temperature  growths.)  The  initial  growth  rate 
(stage  1)  was  -15  nm/min.  During  lateral  growth 


Fig.  1 .  Plan  view  SEM  image  of  Si  array  of  nanoislands  patterned  on 
(100)  oriented  silicon-on-insulator.  Inset  is  cross-sectional  schematic 
of  nanostructured  (100)  silicon-on-insulator  substrate. 


Fig.  2.  Plan  view  SEM  images  of  GaAs  nucleation  on  silicon  islands  at 
a  growth  initiation  temperature  of  (a)  433°C,  (b)  51 9°C,  (c)  605°C,  and 
(d)  734°C.  All  scale  bars  indicate  200  nm. 


(stage  2)  the  temperature  was  605°C  and  the 
trimethylgallium  flow  rate  was  8  seem. 

A  Hitachi  S-800  field-emission,  scanning  electron 
microscope  (SEM)  operating  at  20  kV  and  a  JOEL  2010 
transmission  electron  microscope  (TEM)  operating  at 
200  kV  were  used  to  characterize  the  GaAs  films. 

RESULTS 

Figure  2  shows  the  normal  incidence  SEM  images 
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Temperature  (°C) 

Fig.  3.  Average  inter-island  distance  of  GaAs  nuclei  on  planar  Si  and 
migration  length  of  GaAs  on  Si02  versus  temperature. 


Fig.  4.  Plan  view  SEM  image  of  an  array  of  GaAs  crystals  deposited 
on  nanostructured  (100)  oriented  silicon-on-insulator.  Approximately 
1 00  nm  of  material  was  deposited  at  a  temperature  of  605°C.  Coales¬ 
cence  of  two  pairs  of  crystals  occurred  during  GaAs  growth  and  is  not 
due  to  the  patterning. 


of  GaAs  nucleation  behavior  after  approximately  15  nm 
of  deposition  on  nanostructured  silicon  islands  at 
substrate  temperatures  of  Ts  =  433,  519,  605,  and 
734°C.  At  a  low  substrate  temperature  of  433°C, 
multiple  (5  to  8)  GaAs  crystals  with  grain  size  ap¬ 
proximately  25  nm  in  diameter  nucleate  on  each 
silicon  island  as  shown  in  Fig.  2a.  As  the  temperature 
increases  the  number  of  nuclei  per  silicon  island 
decreases,  until  at  605°C  only  one  GaAs  crystal  nucle¬ 
ates  per  silicon  island  (Fig.  2c).  However  if  the  tem¬ 
perature  is  too  high  [734°C  nucleation  occurs  only  on 
some  silicon  islands  (see  Fig.  2d)].  It  appears  that 


there  is  indeed  a  tendency  to  3D  growth;  however,  the 
size  and  spacing  of  the  nuclei  is  influenced  by  surface 
mobility,  which  is  controlled  largely  by  surface  diffu¬ 
sion  and  therefore  growth  temperature.  Figure  3 
plots  data  from  related  experiments,  where  the  aver¬ 
age  inter-island  distance  of  GaAs  nuclei  was  mea¬ 
sured  for  growth  on  planar  Si(100)  as  a  function  of 
temperature.  The  temperature  dependence  of  island 
spacing  is  quite  strong  and  indicates  that  nucleation 
is  diffusion  controlled.10  This  figure  also  plots  the 
surface  migration  length  of  GaAs  on  Si02  from  Ref. 
19.  Clearly,  the  surface  migration  length  of  GaAs  on 
Si02  is  for  all  temperatures  much  larger  than  the 
pitch  of  the  nanostructured  islands  (500  nm)  so  the 
growth  species  has  an  adequate  migration  length  for 
selective  growth  as  indicated  in  Fig.  2.  In  contrast,  the 
average  inter-island  distance  for  GaAs  on  Si  is  of  the 
same  order  as  the  silicon  island  diameter  and  pitch. 
This  is  consistent  with  the  results  of  Fig.  2,  where,  at 
low  growth  temperature,  the  GaAs  has  a  low  mobility 
on  Si  and  multiple  nuclei  are  observed  on  each  sub¬ 
strate  island.  While  at  high  temperature,  the  GaAs 
mobility  is  much  higher  and  the  growth  species  may 
have  too  much  mobility.  For  example,  at  734°C,  the 
average  inter-island  distance  is  1700  nm  and  growth 
selectivity  is  lost  because  the  large  surface  migration 
length  leaves  some  of  the  islands  denuded.  This  sug¬ 
gests  that  there  is  an  optimum  temperature  for  achiev¬ 
ing  selective  growth  in  the  nanoheteroepitaxial  (NHE ) 
process.  Under  optimum  conditions  the  growth  spe¬ 
cies  will  have  adequate  mobility  to  reach  a  Si  island 
and  to  create  a  single  nucleus  on  each  Si  island,  but 
they  will  not  have  excessive  mobility  that  can  leave 
some  Si  islands  without  any  GaAs  at  all.  Based  on 
Figs.  2  and  3,  this  optimum  temperature  is  approxi¬ 
mately  605°C.  Figure  4  shows  an  array  of  GaAs  NHE 
nuclei  grown  on  silicon  islands  after  -100  nm  of 
deposition  at  605°C.  Some  coalescing  can  be  observed 
in  the  image.  Well-aligned  rectangular  faceting  can 
also  be  observed  indicative  of  epitaxy  although  sev¬ 
eral  crystal  habits  are  present. 

Cross-sectional  TEM  analysis  of  NHE  GaAs  nuclei 
on  Si  islands  reveals  that  the  temperature  during 
growth  initiation  has  a  strong  effect  on  the  defect 
structure.  Figure  5  shows  cross-sectional,  diffraction 
contrast  images  taken  along  the  [Oil]  zone  axis.  This 
figure  compares  samples  that  were  all  grown  at  605°C 
during  the  lateral  stage  (stage  2)  but  had  a  different 
temperature  during  the  growth  initiation  stage  ( stage 
1).  All  samples  had  -100  nm  of  GaAs  deposition. 
Figure  5a  (stage  1  at433°C)and  5b  (stage  1  at519°C) 
were  typical  of  several  GaAs/Si  islands  observed  that 
had  a  growth  initiation  at  either  433°C  or  519°C.  In 
the  island  in  Fig.  5a  stacking  faults  can  be  observed. 
Notice  that  moire  fringes  in  the  Si  area,  caused  by  the 
superposition  of  the  GaAs  and  Si  lattices,  can  also  be 
observed.  The  average  period,  x,  of  the  moire  pattern 
was  measured  at  5.33  nm  near  the  right-hand  side  of 
the  Si  island.  At  the  top  GaAs/Si  interface  the  average 
period  was  5.79  nm.  Using  a  simple  analysis,  it  is 
speculated  that  the  periods  correspond  to  lattice  mis- 
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Fig.  5.  Cross-section  diffraction-contrast  images  of  GaAs/Si  heterostructures  after  100  nm  of  GaAs  deposition  at  605°C  during  the  lateral  growth 
stage  but  with  a  growth  initiation  temperature  of  (a)  433°C,  (b)  519cC,  and  (c)  605°C.  Inset  diffraction  patterns  in  (b)  and  (c)  were  taken  from  the 
areas  indicated  by  circles.  No  silicon  island  is  observed  in  (b)  because  the  thinned  sample  was  from  an  area  adjacent  to  the  island.  All  scale  bars 
indicate  70  nm. 


matches  of  e  =  dSi((m/T  =  3.6%  ±  0.1%  at  the  edge  and 
3.3%  ±  0.1%  at  the  interface,  where  dSi[0U]  =  0.192  nm 
is  the  lattice  plane  spacing  of  Si  in  the  [011]  direction. 
Thus  the  materials  are  accommodating  0.4-0. 7% 
strain.  The  diffraction  pattern  in  Fig.  5b  was  taken 
approximately  from  the  area  indicated  by  the  circle 
and  contains  two  sets  of  diffraction  spots  indicating 
that  twinning  has  occurred.  In  addition  a  low-angle 
grain  boundary  runs  diagonally  across  the  island  as 
revealed  by  difference  in  contrast  in  the  GaAs.  In 
contrast,  the  material  with  a  growth  initiation  tem¬ 
perature  of  605°C  (Fig.  5c)  had  no  low-angle  grain 
boundaries  although  stacking  faults  can  still  be  ob¬ 
served.  Figure  5c  contains  two  diffraction  patterns. 
One  pattern  taken  approximately  from  the  defected 
area  contains  streaks,  confirming  that  the  defects  are 
stacking  faults.  Another  diffraction  pattern  taken 
approximately  from  the  GaAs/Si  interface  area  con¬ 
tains  two  sets  of  spots,  one  created  by  the  GaAs  and 
the  other  by  the  Si.  Analysis  of  the  diffraction  pattern 
taken  from  the  interface  shows  a  0.8C  misalignment 
between  GaAs[100]  and  SiflOO]  directions.  Measure¬ 
ment  of  the  distance  from  the  origin,  000,  to  the  002 
spots  show  a  -3.5%  lattice  mismatch  agreeing  well 
with  the  moire  fringe  in  Fig.  5a.  Finally,  inspection  of 
Fig.  5a  and  c  show  that  GaAs  grows  both  on  the  top 
Si(100)  surface  and  on  the  side  walls  of  the  silicon 
islands.  The  image  in  Fig.  5c  was  typical  of  many 
GaAs/Si  islands  observed,  showing  that  most  of  the 
defects  originate  from  the  side  walls. 

CONCLUSIONS 

F  our  stages  of  the  growth  process  were  identified  as 
(1)  growth  initiation,  (2)  lateral  growth,  (3)  coales¬ 
cence,  and  (4)  planarization.  Growth  occurs  through 
a  mixture  of  selective-area  growth  (driven  by  the 


differential  surface  mobility  of  GaAs  on  Si02  versus 
Si),  and  3D  growth  of  GaAs  on  Si  (controlled  by 
surface  diffusion  and  strain  energy  considerations). 
Due  to  the  high  surface  mobility  of  GaAs  on  Si02, 
GaAs  grows  selectively  on  the  nanostructured  silicon 
islands  and  not  on  the  Si02  mask  at  all  temperatures 
tested.  The  temperature  of  the  initial  growth  controls 
the  surface  diffusion  length  and  ideally  should  be 
optimized  so  that  single  crystal  nucleation  occurs  on 
each  silicon  island.  If  the  temperature  is  too  high, 
single  crystal  nucleation  occurs  only  on  a  fraction  of 
the  total  silicon  islands  with  other  islands  left  de¬ 
nuded.  If  the  temperature  is  too  low,  multiple-crystal 
nucleation  will  occur  on  each  island  leading  to  grain 
boundaries  and  twinning.  The  optimum  temperature 
for  the  nanostructured  substrates  utilized  in  this 
study  is  approximately  605°C. 

The  GaAs/Si  heterostructures  are  partially  strained 
to  0.4-0. 7%,  although  the  data  does  not  reveal  whether 
strain  relaxation  occurred  through  misfit  disloca¬ 
tions  or  three-dimensional  strain  relief  mechanisms. 
TEM  images  show  that  GaAs  grows  both  on  the  top 
Si(  100)  and  side  wall  surfaces  and  indicate  that  stack¬ 
ing  faults  originate  from  the  side  walls.  Covering  the 
side  walls  with  a  mask  to  prevent  growth  might  be 
helpful  in  eliminating  the  stacking  faults.  Alterna¬ 
tively,  annealing  might  also  reduce  the  defects;  how¬ 
ever,  due  to  gross  morphological  changes,  the  anneal¬ 
ing  temperature  must  be  kept  below  850°C. 
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Initial  growth  studies  of  GaAs  on  an  array  of  Si  islands  nanostructured  on  (100) 
oriented  silicon-on-insulator  substrates  show  that  growth  occurs  through  a 
mixture  of  selective-area  and  3D  growth  modes.  An  optimum  initiation  growth 
temperature  must  tune  the  growth  conditions  to  the  geometry  of  the  seed  array 
so  that  selective-area  control  is  maintained  while  defect  density  is  minimized. 
The  optimum  temperature  for  a  square  array  of  Si  islands,  500  nm  in  pitch,  and 
100  nm  to  280  nm  in  diameter,  is  ~600°C.  This  temperature  yields  single-crystal 
nucleation  on  each  Si  island  while  maintaining  selective-area  growth  mode 
control.  Transmission  electron  microscope  (TEM)  analysis  of  optimized  and  non- 
optimized  grown  GaAs/Si  heterostructures  show  that  they  accommodate 
0.4-0. 7%  strain.  Further  reduction  in  stacking-fault  defects  attributed  to  side 
wall  growth  may  be  possible  through  masking  of  side  wall  or  annealing. 

Keywords:  Heteroepitaxy,  GaAs  on  Si,  growth  initiation,  lattice  mismatch 


INTRODUCTION 

Nanoheteroepitaxy  has  recently  been  proposed  as  a 
technique  for  significantly  reducing  defect  density  in 
lattice  mismatched  heteroepitaxy. 12  Nanoheteroepit¬ 
axy  incorporates  a  number  of  defect  reduction  tech¬ 
niques  including:  substrate  patterning,  lateral  epi¬ 
taxial  overgrowth,3-5  strain  partitioning  in  thin  com¬ 
pliant  substrates,67  and  epitaxial  necking,8  which 
exploits  free  surfaces  to  terminate  dislocations.  In 
addition,  nanoheteroepitaxy  exploits  three-dimen¬ 
sional  stress  relief  mechanisms  that  are  available 
only  in  nanoscale  heterostructures,  where  the  strain 
energy  density  decays  rapidly  away  from  the  inter¬ 
face.9  For  example,  nanoheteroepitaxy  theory  pre¬ 
dicts  that  growth  of  GaAs  on  Si  (~4%  misfit)  should 
support  a  coherent  interface  provided  the  island  size 
is  below  the  critical  island  diameter  (21c)  of  40  nm.2 
Nanoheteroepitaxy  addresses  highly  lattice  mis¬ 
matched  and  polarity  mismatched  systems  that  are 
expected  to  grow  in  Stranski-Krastanov  or  Volmer- 
Weber  mode.  One  example  is  the  growth  of  GaAs  on  Si 
which  becomes  three-dimensional  (3D)  at  an  early 
stage  of  film  growth  (<7  nm)  to  minimize  the  strain 


(Received  January  21,  2001;  accepted  March  24,  2001) 


energy.10  3D  growth  is  usually  undesirable  because  it 
leads  to  morphological  problems  and  one  approach  to 
limit  its  effects  has  been  to  employ  a  low-temperature 
buffer  layer  on  planar,1112  patterned,13  and  porous14 
substrates  to  achieve  smooth  continuous  films.  This 
approach  has  been  successfully  applied  to  GaN 
heteroepitaxy,15  where  atomic-force  microscopy  mea¬ 
surements  show  that  a  low  temperature  buffer  is  also 
3D  but  on  a  very  fine  scale.  This  finer  scale  of  3D 
growth  is  presumably  related  to  the  reduced  surface 
mobility  of  the  growth  species  at  the  lower  tempera¬ 
ture.  However,  a  tradeoff  appears  to  exist  between 
low  defect  density  and  smooth  film  morphology,10 
where  the  finer  scale  3D  growth  results  in  a  smoother 
film  but  can  cause  a  higher  defect  density  as 
misorientation  between  the  grains  will  lead  to  dislo¬ 
cations  and  low  angle  grain  boundaries  when  they 
coalesce.  In  nanoheteroepitaxy,  the  seed  diameter  is 
in  the  nanoscale,  in  the  range  of  the  domain  size  of 
crystals  grown  at  normal  temperatures.  Therefore,  it 
may  be  possible  to  grow  an  ordered  array  of  single¬ 
crystal  and,  perhaps,  coherent  GaAs  crystals  on  Si 
using  nanoheteroepitaxy.  This  paper  describes  obser¬ 
vations  of  the  initial  stages  of  nanoheteroepitaxial 
growth  of  GaAs  on  nanoscale-patterned  silicon-on- 
insulator  substrates. 
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EXPERIMENTAL 

Epitaxial  GaAs  was  grown  on  nanostructured  sili¬ 
con  islands  patterned  on  (100)  oriented  silicon-on- 
insulator  substrates  using  organometallic  vapor-phase 
epitaxy.  Four  stages  of  the  growth  process  were  iden¬ 
tified  as  (1)  growth  initiation,  (2)  lateral  growth,  (3) 
coalescence,  and  (4)  planarization.  This  study  empha¬ 
sizes  stages  (1),  (2)  and,  in  particular,  addresses  the 
effect  that  growth  initiation  has  on  the  quality  of 
GaAs  during  lateral  growth. 

The  surface  silicon  layer  of  (100)  oriented  silicon- 
on-insulator  substrates  was  patterned  using  inter¬ 
ferometric  lithography,  a  maskless,  large-area  pat¬ 
terning  technology.  For  details  on  interferometric 
lithography  please  refer  to  Ref.  16.  The  initial  thick¬ 
ness  of  the  surface  silicon  layer  was  200  nm.  The 
thickness  of  the  oxide  layer  was  370  nm.  The  surface 
silicon  layer  was  thinned  to  110  nm  by  an  oxidation 
and  hydrofluoric  acid  ( HF )  etching  sequence.  A  square 
array  of  silicon  islands  was  then  patterned  on  the 
surface  silicon  layer.  The  pitch  of  the  array  was  500 
nm  and  the  diameter  of  the  islands  varied  between 
100  and  280  nm  from  batch  to  batch.  Figure  1  contains 
a  plan  view  SEM  image  of  a  nanostructured  ( 100 )sili- 
con-on-insulator  (SOI)  wafer.  The  inset  in  Fig.  1  is  a 
schematic  cross-sectional  view  of  the  nanostructured 
(lOO)SOI  illustrating  the  different  layers  in  the  sub¬ 
strate.  Unintentional  variations  in  processing  caused 
the  shape  of  the  islands  to  vary  from  diamond  to  circular 
(see  Fig.  2);  however,  for  consistency  the  lateral  extent 
or  size  of  the  islands  is  referred  to  as  the  diameter. 

Arsine  and  trimethylgallium  were  used  as  precur¬ 
sors  for  group  V  and  III  sources,  respectively.  Reactor 
parameters  common  to  all  stages  were  the  reactor 
pressure  (100  torr),  the  carrier  gas  (hydrogen),  and 
the  total  gas  flow  rate  (5.9  slm).  The  arsine  flow  rate 
was  kept  constant  at  35  seem  in  all  growth  stages 
(1)  to  (4). 

Outgassing  and  oxide  desorption,  which  is  nor¬ 
mally  performed  at  a  temperature  of  900-1000°C  11 
was  limited  to  860°C  for  10  min  in  hydrogen  due  to 
gross  morphological  changes  that  occur  in 
nanostructured17  and  porous18  silicon  samples  at  tem¬ 
peratures  above  ~1000°C  and  750°C,  respectively. 
The  temperature  was  then  reduced  in  preparation  of 
the  growth  initiation  stage.  In  order  to  avoid  anti¬ 
phase  domain  boundaries,  an  arsenic  terminated 
silicon  surface12  was  created  by  exposing  silicon  to 
arsine  at  a  rate  of  35  seem  for  2  min  immediately  prior 
to  growth  of  GaAs.  In  this  stage,  four  different  growth 
temperatures  were  tested  between  430°C  to  730°C  to 
study  the  effects  of  growth  of  this  stage  on  material 
grown  in  later  stages.  The  trimethylgallium  flow  rate 
was  between  1  and  2.4  seem.  This  gave  a  V/III  molar 
ratio  of  -580  (this  ratio  is  calculated  based  on  the 
input  flows  of  the  constituents.  The  reduced  decompo¬ 
sition  rate  of  arsine  at  the  lower  growth  temperature 
no  doubt  reduced  the  actual  V/III  ratio  during  the 
lower  temperature  growths.)  The  initial  growth  rate 
(stage  1)  was  -15  nm/min.  During  lateral  growth 


Fig.  1 .  Plan  view  SEM  image  of  Si  array  of  nanoislands  patterned  on 
(100)  oriented  silicon-on-insulator.  Inset  is  cross-sectional  schematic 
of  nanostructured  (100)  silicon-on-insulator  substrate. 


Fig.  2.  Plan  view  SEM  images  of  GaAs  nucleation  on  silicon  islands  at 
a  growth  initiation  temperature  of  (a)  433°C,  (b)  51 9°C,  (c)  605°C,  and 
(d)  734°C.  All  scale  bars  indicate  200  nm. 


(stage  2)  the  temperature  was  605°C  and  the 
trimethylgallium  flow  rate  was  8  seem. 

A  Hitachi  S-800  field-emission,  scanning  electron 
microscope  (SEM)  operating  at  20  kV  and  a  JOEL  2010 
transmission  electron  microscope  (TEM)  operating  at 
200  kV  were  used  to  characterize  the  GaAs  films. 

RESULTS 

Figure  2  shows  the  normal  incidence  SEM  images 
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Temperature  (°C) 

Fig.  3.  Average  inter-island  distance  of  GaAs  nuclei  on  planar  Si  and 
migration  length  of  GaAs  on  Si02  versus  temperature. 


Fig.  4.  Plan  view  SEM  image  of  an  array  of  GaAs  crystals  deposited 
on  nanostructured  (100)  oriented  silicon-on-insulator.  Approximately 
1 00  nm  of  material  was  deposited  at  a  temperature  of  605°C.  Coales¬ 
cence  of  two  pairs  of  crystals  occurred  during  GaAs  growth  and  is  not 
due  to  the  patterning. 


of  GaAs  nucleation  behavior  after  approximately  15  nm 
of  deposition  on  nanostructured  silicon  islands  at 
substrate  temperatures  of  Ts  =  433,  519,  605,  and 
734°C.  At  a  low  substrate  temperature  of  433°C, 
multiple  (5  to  8)  GaAs  crystals  with  grain  size  ap¬ 
proximately  25  nm  in  diameter  nucleate  on  each 
silicon  island  as  shown  in  Fig.  2a.  As  the  temperature 
increases  the  number  of  nuclei  per  silicon  island 
decreases,  until  at  605°C  only  one  GaAs  crystal  nucle¬ 
ates  per  silicon  island  (Fig.  2c).  However  if  the  tem¬ 
perature  is  too  high  [734°C  nucleation  occurs  only  on 
some  silicon  islands  (see  Fig.  2d)].  It  appears  that 


there  is  indeed  a  tendency  to  3D  growth;  however,  the 
size  and  spacing  of  the  nuclei  is  influenced  by  surface 
mobility,  which  is  controlled  largely  by  surface  diffu¬ 
sion  and  therefore  growth  temperature.  Figure  3 
plots  data  from  related  experiments,  where  the  aver¬ 
age  inter-island  distance  of  GaAs  nuclei  was  mea¬ 
sured  for  growth  on  planar  Si(100)  as  a  function  of 
temperature.  The  temperature  dependence  of  island 
spacing  is  quite  strong  and  indicates  that  nucleation 
is  diffusion  controlled.10  This  figure  also  plots  the 
surface  migration  length  of  GaAs  on  Si02  from  Ref. 
19.  Clearly,  the  surface  migration  length  of  GaAs  on 
Si02  is  for  all  temperatures  much  larger  than  the 
pitch  of  the  nanostructured  islands  (500  nm)  so  the 
growth  species  has  an  adequate  migration  length  for 
selective  growth  as  indicated  in  Fig.  2.  In  contrast,  the 
average  inter-island  distance  for  GaAs  on  Si  is  of  the 
same  order  as  the  silicon  island  diameter  and  pitch. 
This  is  consistent  with  the  results  of  Fig.  2,  where,  at 
low  growth  temperature,  the  GaAs  has  a  low  mobility 
on  Si  and  multiple  nuclei  are  observed  on  each  sub¬ 
strate  island.  While  at  high  temperature,  the  GaAs 
mobility  is  much  higher  and  the  growth  species  may 
have  too  much  mobility.  For  example,  at  734°C,  the 
average  inter-island  distance  is  1700  nm  and  growth 
selectivity  is  lost  because  the  large  surface  migration 
length  leaves  some  of  the  islands  denuded.  This  sug¬ 
gests  that  there  is  an  optimum  temperature  for  achiev¬ 
ing  selective  growth  in  the  nanoheteroepitaxial  (NHE ) 
process.  Under  optimum  conditions  the  growth  spe¬ 
cies  will  have  adequate  mobility  to  reach  a  Si  island 
and  to  create  a  single  nucleus  on  each  Si  island,  but 
they  will  not  have  excessive  mobility  that  can  leave 
some  Si  islands  without  any  GaAs  at  all.  Based  on 
Figs.  2  and  3,  this  optimum  temperature  is  approxi¬ 
mately  605°C.  Figure  4  shows  an  array  of  GaAs  NHE 
nuclei  grown  on  silicon  islands  after  -100  nm  of 
deposition  at  605°C.  Some  coalescing  can  be  observed 
in  the  image.  Well-aligned  rectangular  faceting  can 
also  be  observed  indicative  of  epitaxy  although  sev¬ 
eral  crystal  habits  are  present. 

Cross-sectional  TEM  analysis  of  NHE  GaAs  nuclei 
on  Si  islands  reveals  that  the  temperature  during 
growth  initiation  has  a  strong  effect  on  the  defect 
structure.  Figure  5  shows  cross-sectional,  diffraction 
contrast  images  taken  along  the  [Oil]  zone  axis.  This 
figure  compares  samples  that  were  all  grown  at  605°C 
during  the  lateral  stage  (stage  2)  but  had  a  different 
temperature  during  the  growth  initiation  stage  ( stage 
1).  All  samples  had  -100  nm  of  GaAs  deposition. 
Figure  5a  (stage  1  at433°C)and  5b  (stage  1  at519°C) 
were  typical  of  several  GaAs/Si  islands  observed  that 
had  a  growth  initiation  at  either  433°C  or  519°C.  In 
the  island  in  Fig.  5a  stacking  faults  can  be  observed. 
Notice  that  moire  fringes  in  the  Si  area,  caused  by  the 
superposition  of  the  GaAs  and  Si  lattices,  can  also  be 
observed.  The  average  period,  x,  of  the  moire  pattern 
was  measured  at  5.33  nm  near  the  right-hand  side  of 
the  Si  island.  At  the  top  GaAs/Si  interface  the  average 
period  was  5.79  nm.  Using  a  simple  analysis,  it  is 
speculated  that  the  periods  correspond  to  lattice  mis- 
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Fig.  5.  Cross-section  diffraction-contrast  images  of  GaAs/Si  heterostructures  after  100  nm  of  GaAs  deposition  at  605°C  during  the  lateral  growth 
stage  but  with  a  growth  initiation  temperature  of  (a)  433°C,  (b)  519cC,  and  (c)  605°C.  Inset  diffraction  patterns  in  (b)  and  (c)  were  taken  from  the 
areas  indicated  by  circles.  No  silicon  island  is  observed  in  (b)  because  the  thinned  sample  was  from  an  area  adjacent  to  the  island.  All  scale  bars 
indicate  70  nm. 


matches  of  e  =  dSi((m/T  =  3.6%  ±  0.1%  at  the  edge  and 
3.3%  ±  0.1%  at  the  interface,  where  dSi[0U]  =  0.192  nm 
is  the  lattice  plane  spacing  of  Si  in  the  [011]  direction. 
Thus  the  materials  are  accommodating  0.4-0. 7% 
strain.  The  diffraction  pattern  in  Fig.  5b  was  taken 
approximately  from  the  area  indicated  by  the  circle 
and  contains  two  sets  of  diffraction  spots  indicating 
that  twinning  has  occurred.  In  addition  a  low-angle 
grain  boundary  runs  diagonally  across  the  island  as 
revealed  by  difference  in  contrast  in  the  GaAs.  In 
contrast,  the  material  with  a  growth  initiation  tem¬ 
perature  of  605°C  (Fig.  5c)  had  no  low-angle  grain 
boundaries  although  stacking  faults  can  still  be  ob¬ 
served.  Figure  5c  contains  two  diffraction  patterns. 
One  pattern  taken  approximately  from  the  defected 
area  contains  streaks,  confirming  that  the  defects  are 
stacking  faults.  Another  diffraction  pattern  taken 
approximately  from  the  GaAs/Si  interface  area  con¬ 
tains  two  sets  of  spots,  one  created  by  the  GaAs  and 
the  other  by  the  Si.  Analysis  of  the  diffraction  pattern 
taken  from  the  interface  shows  a  0.8C  misalignment 
between  GaAs[100]  and  SiflOO]  directions.  Measure¬ 
ment  of  the  distance  from  the  origin,  000,  to  the  002 
spots  show  a  -3.5%  lattice  mismatch  agreeing  well 
with  the  moire  fringe  in  Fig.  5a.  Finally,  inspection  of 
Fig.  5a  and  c  show  that  GaAs  grows  both  on  the  top 
Si(100)  surface  and  on  the  side  walls  of  the  silicon 
islands.  The  image  in  Fig.  5c  was  typical  of  many 
GaAs/Si  islands  observed,  showing  that  most  of  the 
defects  originate  from  the  side  walls. 

CONCLUSIONS 

F  our  stages  of  the  growth  process  were  identified  as 
(1)  growth  initiation,  (2)  lateral  growth,  (3)  coales¬ 
cence,  and  (4)  planarization.  Growth  occurs  through 
a  mixture  of  selective-area  growth  (driven  by  the 


differential  surface  mobility  of  GaAs  on  Si02  versus 
Si),  and  3D  growth  of  GaAs  on  Si  (controlled  by 
surface  diffusion  and  strain  energy  considerations). 
Due  to  the  high  surface  mobility  of  GaAs  on  Si02, 
GaAs  grows  selectively  on  the  nanostructured  silicon 
islands  and  not  on  the  Si02  mask  at  all  temperatures 
tested.  The  temperature  of  the  initial  growth  controls 
the  surface  diffusion  length  and  ideally  should  be 
optimized  so  that  single  crystal  nucleation  occurs  on 
each  silicon  island.  If  the  temperature  is  too  high, 
single  crystal  nucleation  occurs  only  on  a  fraction  of 
the  total  silicon  islands  with  other  islands  left  de¬ 
nuded.  If  the  temperature  is  too  low,  multiple-crystal 
nucleation  will  occur  on  each  island  leading  to  grain 
boundaries  and  twinning.  The  optimum  temperature 
for  the  nanostructured  substrates  utilized  in  this 
study  is  approximately  605°C. 

The  GaAs/Si  heterostructures  are  partially  strained 
to  0.4-0. 7%,  although  the  data  does  not  reveal  whether 
strain  relaxation  occurred  through  misfit  disloca¬ 
tions  or  three-dimensional  strain  relief  mechanisms. 
TEM  images  show  that  GaAs  grows  both  on  the  top 
Si(  100)  and  side  wall  surfaces  and  indicate  that  stack¬ 
ing  faults  originate  from  the  side  walls.  Covering  the 
side  walls  with  a  mask  to  prevent  growth  might  be 
helpful  in  eliminating  the  stacking  faults.  Alterna¬ 
tively,  annealing  might  also  reduce  the  defects;  how¬ 
ever,  due  to  gross  morphological  changes,  the  anneal¬ 
ing  temperature  must  be  kept  below  850°C. 
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Selective  growth  of  GaAs  on  a  nanoscale  Si02-pattemed  GaAs(OOl)  substrate  by  mo¬ 
lecular  beam  epitaxy  is  reported.  Reduction  of  the  lateral  dimensions  of  a  Si02  pattern 
below  the  surface  diffusion  length  of  an  incident  Ga  atom  results  in  preferential  migration 
from  the  Si02  surface  to  stable  bonding  configurations  at  nearby  open  GaAs  substrate 
surfaces.  This  intrinsic  selectivity  is  achieved  under  high  growth  temperature  with  low 
growth  rate  where  surface  migration  on  the  Si02  surface  is  highly  extended.  A  large-area 
nanoscale  Si02  pattern  is  realized  by  interferometric  lithography.  A  330-nm  period  two- 
dimensional  array  of  GaAs  disks  having  a  height  of  40  nm  and  a  diameter  of  about  240 
nm,  selectively  grown  on  GaAs(OOl)  substrates  is  presented.  Variation  of  the  nanoscale 
selective  growth  mode  depending  on  the  diameter  of  GaAs  disks  is  also  observed. 
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I.  Introduction 

Selective  growth  potentially  will  have  a  strong  impact 
on  device  and  circuit  applications.  Since  Cho  and  Reinhart 
first  demonstrated  shadow-masked,  isolated-area  growth,1 
several  attempts  to  realize  selective  growth  in  molecular 
beam  epitaxy  (MBE)  have  been  reported.1'9  Except  for  the 
work  using  mechanical  masks,1,2  these  experiments  rely  on 
patterning  of  a  substrate  or  of  a  deposited  masking  material 
and  can  be  divided  into  two  categories:3'9  those  dependent  on 
process  technology  and  those  relying  on  material  properties 
such  as  surface  mobility  of  incident  atoms  and  their  reaction 
with  a  mask. 

The  first  category  includes  direct  deposition  on  a  mask- 
patterned  substrate  with3  or  without4  a  subsequent  lift-off 
step  or  a  self-masking  substrate  taking  an  advantage  of  topo¬ 
logical  shadowing  in  MBE.5  The  second  category  utilizes 
variation  in  the  physical  or  chemical  properties  of  a  material 
system  consisting  of  a  mask,  an  epilayer,  and  a  substrate. 
An  example  of  the  latter  category  is  providing  several  facets 
on  a  substrate  upon  which  an  incident  atom  has  different 
sticking  coefficient  and  surface  mobility.  These  approaches 
are  not  perfectly  selective  but  lead  to  a  lateral  thickness 
variation.  Several  facet-dependent  selective  growths  on  pla¬ 
nar  and  nonplanar  substrates  have  been  reported.6'8  Selective 
group  IV  MBE  using  chemical  reactions  between  an  incident 
Si  or  Ge  atom  and  a  Si02  mask  to  produce  SiO  or  GeO 
volatilized  from  the  mask  can  be  included  in  this  category.9 

A  new  selective  growth  mechanism  basically  different 
from  those  previously  mentioned  is  available  for  a  mask- 
patterned  substrate  if  the  lateral  dimension  of  the  mask  is 
reduced  to  less  than  a  surface  diffusion  length  of  the  incident 
atom.  Atoms  incident  on  the  mask  material  will  then  mi¬ 


grate  to  nearby  unmasked  substrate  regions  where  favorable 
bonding  sites  are  available.  That  is,  reduction  of  a  mask 
dimension  below  the  surface  diffusion  length  on  the  mask 
material  can  result  in  a  nanoscale  selective  growth  (NSG) 
mode. 

Si02  has  been  widely  used  as  a  mask  material,  but  the 
surface  diffusion  length,  LD  Si02,  of  an  Ga  atom  incident  on 
Si02  surface  has  not  been  well  studied.  The  surface  diffusion 
length  on  a  GaAs  surface,  LDGaAs  critically  depends  on  the 
growth  conditions,  ranging  from  20  -  to  30-nm  up  to  a  few 
pm.4, 7,10,11  The  surface  diffusion  length  of  a  Ga  atom  is  dif¬ 
ferent  for  epitaxial  GaAs-on-GaAs  and  non-epitaxial  GaAs- 
on-Si02.  Comparing  the  adsorption  mechanisms  for  these 
two  cases,  we  speculate  that  a  Ga  atom  will  have  a  higher 
surface  mobility  but  a  shorter  surface  lifetime  on  a  Si02 
than  on  a  GaAs  surface.  If  nanoscale  lithography  is  applied 
to  Si02  patterning,  LD  Si02  can  be  experimentally  character¬ 
ized  and  the  possibility  of  NSG  also  can  be  examined.  Re¬ 
cently,  there  has  been  remarkable  progress  in  optical  and 
electron-beam  lithographies.  Among  them,  interferometric 
lithography  (IL)  is  very  promising  as  a  low-cost,  reliable, 
and  scalable  technology  for  deep  sub- 100  nm  scales.12,13  In 
this  work,  we  report  the  first  demonstration  of  NSG  of 
GaAs  on  a  Si02-pattemed  substrate  prepared  by  IL.  We  pre¬ 
sent  a  330  nm-period  2-dimensional  (2-D)  array  of  GaAs 
disks  with  a  height  of  40  nm  and  diameters  of  about  240 
nm,  selectively  grown  on  a  Si02-pattemed  GaAs(001)  sub¬ 
strate  by  MBE.  We  also  report  the  dependence  of  NSG  on 
the  diameter  of  the  GaAs  disks. 
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Figure  1  A  schematic  diagram  of  interferometric  lithogra¬ 
phy. 

II.  Nanoscale  pattern  processing  and  MBE 
growth 

As  shown  in  Fig.  1,  the  basic  structure  for  processing 
consists  of  a  negative  photoresist  (PR)  film,  a  40  nm-thick 
Si02  film,  and  a  GaAs(OOl)  substrate.  A  single  longitudinal 
and  transverse  mode,  frequency  tripled  YAG  laser  at  a  355- 
nm  wavelength  was  used  as  an  ultraviolet  (UV)  light  source. 
In  IL,  as  illustrated  in  Fig.  1,  a  direct  and  a  mirror-reflected 
UV  beam  are  used  to  produce  a  330-nm  period  interference 
pattern  on  the  PR  film.  To  make  a  2-D  array  of  holes,  a 
double  exposure  with  a  right  angle  rotation  of  the  sample 
was  performed.  Fig.  2(a)  and  (b)  are  a  45°-tilted  and  a  cross- 
sectional  scanning  electron  microscope  (SEM)  image  of  the 
2-D  array  of  holes  generated  on  the  PR  film.  In  Fig.  2(b), 
standing  wave  effects  produce  a  sinusoidal  profile  along  the 
vertical  direction  in  the  PR  film. 

After  development,  the  sample  was  dry-etched  to  transfer 
the  PR  pattern  of  Fig.  2  (a)  through  the  Si02  film.  Figure  3 
(a)  shows  a  45° -tilted  SEM  image  of  a  2-D  array  of  holes 
transferred  onto  the  Si02  film.  These  holes  provide  access 
for  Ga  atoms  to  the  substrate  surface  for  epitaxial  growth.  In 
this  work,  our  concern  is  the  lateral  dimensions  of  the  Si02 
mask.  While  the  diameter  of  the  holes,  d ,  ranges  from  200  - 
280  nm,  the  lateral  width,  LM ,  between  the  holes  in  the  Si02 


Figure  2(a)  A  45°tilted  and  (b)  a  cross-sectional  SEM  image 
of  a  2-D  array  of  holes  generated  in  a  negative  PR  film. 


Figure  3  45°-tilted  SEM  images  of  (a)  a  2-D  array  of  holes 
transferred  through  the  Si02  films  by  dry  etching,  (b)  the 
MBE  as-grown  sample,  and  (c)  a  2-D  array  of  GaAs  disks 
obtained  by  diluted  HF  treatment  for  removal  of  the  Si02 
pattern.  In  (a),  white  arrows  indicate  the  maximum  and  the 
minimum  LM 

mask  ranges  from  -  90  nm  at  a  minimum  to  a  ^  230  nm  at 
a  maximum  as  indicated  by  white  arrows  in  Fig.  3  (a). 
These  values  are  within  the  range  reported  forLDGaAs. 4,7,10,11 

On  the  Si02  pattern  of  Fig.  3  (a),  MBE  growth  corre¬ 
sponding  to  a  30-nm  thick  homogeneous  GaAs  deposition 
was  performed.  In  this  work,  to  enhance  the  surface  diffusion 
length  of  Ga  atoms,  low  growth  rate  at  high  growth  tem¬ 
perature  was  employed.  The  flux  of  Ga  atoms  was  controlled 
to  achieve  a  growth  rate  of  0.25  monolayer  (ME)  s'1,  which 
was  separately  calibrated  by  reflection  high  energy  electron 
diffraction  (RHEED)  oscillation.  The  growth  temperature 
was  set  to  630°C.  This  is  very  close  to,  but  does  not  exceed, 


the  Ga  desorption  temperature.14  The  growth  temperature 
was  monitored  by  optical  pyrometry.  Although  about  50% 
of  the  surface  of  the  patterned  substrate  was  covered  by  Si02, 
the  thermocouple  and  the  pyrometer  showed  consistent  tem¬ 
perature  control  compared  with  a  bare  GaAs  substrate.  This 
means  that  the  real  temperatures  were  close  to  630° C.  In 
consideration  of  this  high  growth  temperature  of  GaAs  depo¬ 
sition,  the  As/Ga  flux  ratio  was  adjusted  to  about  40  in 
beam  equivalent  pressure. 

III.  Results  and  discussion 

Figure  3(b)  shows  a  45° -tilted  SEM  image  of  the  as- 
grown  sample.  This  figure  provides  important  experimental 
evidence  of  selective  growth.  First  of  all,  from  the  compari¬ 
son  of  Fig.  3(b)  with  Fig.  3(a),  we  find  a  change  of  topog¬ 
raphy.  This  is  because  the  holes  were  filled  by  GaAs 
whereas  the  Si02  regions  do  not  show  any  apparent  deposi¬ 
tion.  If  the  deposition  had  occurred  unselectively  over  the 
whole  sample,  the  topography  of  the  Si02  pattern  of  Fig. 
3(a)  would  have  been  maintained  during  growth.  No  ring 
patterns  appeared  in  the  in  situ  RHEED,  implying  no  for¬ 
mation  of  poly  crystalline  GaAs  on  the  Si02  surface.  These 
observations  show  that  selective  growth  of  GaAs  was  epi¬ 
taxially  achieved  only  inside  the  2-D  array  of  holes  shown  in 
Fig.  3(a)  of  which  the  bottoms  are  GaAs(OOl)  surfaces. 

Dipping  the  as-grown  sample  in  a  diluted  HF  solution 
strips  the  Si02  mask  leaving  only  the  selectively  deposited 
GaAs  disks  on  the  substrate.  Figure  3(c)  shows  a  45° -tilted 
SEM  image  of  2-D  array  of  GaAs  disks.  Figure  4  presents  a 
top-SEM  image  of  a  single  GaAs  disk.  In  Fig.  4,  the  GaAs 
disk  has  a  circular  shape  without  any  apparent  facets  in  the 
growth  plane.  Side-view  SEM  shown  in  Fig.  5,  however, 
reveals  several  different  cross  sections  such  as  a  circular  sec¬ 
tion  [(a)],  a  triangle  [(b)],  and  a  trapezoid  [(c)  and  (d)].  As  is 
discussed  in  detail  below,  this  variability  reflects  changes  in 
the  hole  diameter  resulting  from  pattern  nonuniformity. 
Among  these,  as  shown  in  Fig.  5(c)  which  has  a  bottom 
diameter  similar  to  that  of  Fig.  4,  a  trapezoidal  profile  with 
a  rounded  top  edge  is  the  most  frequently  observed.  The  disk 
height  of  Fig.  5(c)  is  approximately  40  -  45  nm,  almost  30 
-  50%  greater  than  the  30-nm  thickness  predicted  by  the 
deposition  time  and  the  Ga  flux.  This  trapezoidal  side-view 
implies  that  Ga  atoms  incident  near  the  boundary  between 


Figure  4  A  top-view  SEM  image  of  a  single  GaAs  disk  of  d  ~ 
240  nm. 


100  nm 


Figure  5  Side-view  SEM  images  of  GaAs  disks  having  dif¬ 
ferent  diameters  of  (a)  ~  180  nm,  (b)  and  (c)  ~  240  nm,  and 
(d)  -  330  nm. 

GaAs  and  Si02  surfaces  migrate  to  a  (001)  plane  at  the  top 
surface  of  the  disk  to  find  a  more  stable  adsorption  site  as 
shown  in  Fig.  5(c).6  In  Fig.  4,  a  dark  ring  with  a  width  of 
about  10-nm  appears  on  the  periphery  of  the  disk.  This  was¬ 
n't  observed  in  the  side-view  SEM  image  of  Fig.  5(c)  but 
can  be  seen  as  a  shadow  of  a  shallow  groove  around  each 
disk  in  Fig.  3(c).  The  origin  of  these  rings  is  not  clear  but 
they  may  be  due  to  some  loss  of  As  atoms  from  the  sub¬ 
strate  surface  very  close  to  the  edge  of  the  Si02  mask.  Such 
a  region  is  not  exposed  to  an  As-rich  environment  and  can 
lose  As  atoms  at  high  growth  temperature. 

We  also  investigated  the  behavior  of  Ga  atoms  on  a  un- 
pattemed  Si02  surface  during  growth.  This  is  very  important 
because  the  sticking  coefficient  of  a  Ga  atom  on  a  Si02  sur¬ 
face  at  the  given  growth  condition  must  be  examined.  Fig¬ 
ure  6  shows  a  45° -tilted  SEM  image  of  a  unpatterned  Si02 
surface  for  the  same  growth  conditions  as  the  Si02-pattemed 
substrate.  Rather  than  a  uniform  deposition,  isolated  islands 
of  GaAs  are  formed  and  are  randomly  distributed  on  the  Si02 
surface. 

However,  it  should  be  noted  that  Fig.  6  gives  not  a  real 
but  an  apparent  sticking  coefficient,  SGa.Si02,  of  a  Ga  atom 
on  a  Si02  surface.  Here,  we  define  SGa.Si02  at  a  given  growth 
condition  as  a  ratio  of  the  area  covered  by  GaAs  in  a  unit 
area  of  Si02  to  the  unit  area.  The  island  generation  observed 
in  Fig.  6  provides  seed  formation  for  poly  crystalline  GaAs 
growth.  The  growth  of  GaAs  on  the  unpattemed  Si02  sur¬ 
face  is  quasi-isotropic.  In  other  words,  once  a  seed  atom 
finds  a  favorable  site  on  a  Si02  surface,  growth  proceeds  not 
only  vertically  but  also  laterally.  Lateral  growth,  however, 
is  not  entirely  due  to  adsorption  of  Ga  atoms  on  a  Si02 
surface  but  partly  due  to  preferred  bonding  between  itinerant 
Ga  atoms  and  Ga  or  As  atoms  already  captured  at  the  seed 


Figure  6  A  45°tilted  SEM  image  of  a  unpatterned  Si02  sur¬ 
face  treated  with  same  deposition  under  the  same  growth 


condition  (see  the  text).  The  inset  is  a  side-view  SEM  image 
of  a  GaAs  island  formed  on  the  unpatterned  Si02  mask. 

site.  Because  of  this,  if  growth  is  continued  further,  the 
Si02  surface  would  eventually  be  covered  by  GaAs  and  SGa_ 
si02  would  seem  to  be  close  to  unity  regardless  of  the  initial 
sticking  coefficient.  That  is,  the  apparent  SGa_SiQ2  depends  on 
deposition  time  as  well  as  temperature  and  can  be  very  dif¬ 
ferent  from  the  initial  sticking  coefficient.  To  measure  the 
initial  sticking  coefficient,  submonolayer  deposition  with 
extremely  low  incident  Ga  flux  is  required  so  that  an  intera¬ 
tomic  spacing  of  Ga  atoms  on  Si02  surface  is  greater  than 
LD-Si02 t0  avoid  an  encounter  between  a  newly  incident  atom 
and  pre-existing  ones.  In  this  non-epitaxial  deposition,  how¬ 
ever,  apparent  sticking  coefficient  is  more  meaningful  since 
experimental  extraction  of  initial  sticking  coefficient  is  very 
difficult. 

In  the  inset  SEM  image  of  Fig.  6,  the  height  of  the 
GaAs  island  is  about  30  nm  which  is  very  close  to  the 
thickness  expected  from  deposition  time  and  Ga  flux.  In 
Fig.  6,  however,  distribution  of  GaAs  islands  is  sparse,  im¬ 
plying  a  low  SGa_siQ2  for  30  nm-thick  deposition.  The  island 
density  is  about  100  ±  20  qm'2  with  an  average  spacing 
between  GaAs  islands  of  about  100  nm.  Since  the  diameter 
of  an  island,  shown  in  the  inset  to  Fig.  5,  is  about  50  nm, 
5* Ga-Si02  ~  0-19.  By  multiplying  SGa.Si02  by  0.67  from  the 
consideration  of  the  ellipsoidal  shape  of  an  GaAs  island  [the 

volume  of  a  GaAs  island  =  2;ra2Z?/3  where  a  ~  25  nm  and  b 
~  30  nm],  the  fraction  of  incident  Ga  atoms  which  stick  on 
the  unpatterned  Si02  surface  for  30  nm-thick  deposition  is 
about  0.13.  This  implies  a  significant  loss  of  incident  Ga 
atoms.  In  spite  of  the  low  SGa_Si02,  approximately  13%  of 
the  incident  Ga  atoms  still  remain  on  the  unpatterned  Si02 
surface  to  form  GaAs  islands.  Comparing  Fig.  3(b)  with  Fig 
6  shows  a  very  important  direct  evidence  confirming  NSG. 
In  Fig.  3(b),  no  GaAs  islands  appear  on  the  patterned  Si02 
mask.  According  to  our  definition,  the  apparent  SG a.SiG2  on  a 
nanoscale  patterned  Si02  surface  is  zero  for  these  growth 
conditions.  Seed  formation  similar  to  Fig.  6  also  could 
occur  on  the  patterned  Si02  region.  But,  the  nanoscale  Si02 
pattern  having  lateral  dimensions  less  than  LD  Si02  suppresses 
GaAs  seed  formation  by  providing  Ga  atoms  a  chance  to 
migrate  and  bond  to  Ga  or  As  atoms  on  a  nearby  GaAs  sur¬ 
face.  Generally,  a  low  sticking  coefficient  implies  a  short 
surface  lifetime.  As  previously  mentioned,  however,  incident 
Ga  atoms  can  have  a  high  surface  mobility  on  the  Si02  sur¬ 


face  and  some  of  them  travel  a  distance  long  enough  to  reach 
the  nearest  GaAs  surface  before  desorption.  Therefore,  this 
comparison  confirms  that  the  selective  growth  originated 
from  the  migration  of  Ga  atoms  across  nanoscale  Si02  pat¬ 
terns.  Since  the  direction  of  movement  of  an  incident  Ga 
atom  on  a  Si02  surface  is  random,  LD  Si02  is  comparable  to 
or  greater  than  the  maximum  LM  for  our  growth  conditions. 

Although  some  of  Ga  atoms  incident  on  the  patterned 
Si02  region  move  to  adjacent  GaAs  surfaces,  the  volume  of 
deposited  GaAs  indicates  that  not  all  Ga  atoms  incident  on 
the  Si02  surface  are  captured.  As  a  rough  calculation,  the 
height  of  a  GaAs  disk,  h ,  of  Fig.  5(c)  is  at  most  40  -  50  nm 
which  is  not  too  different  from  the  volume  of  a  30-nm  thick 
cylindrical  disk  with  the  same  bottom  area  expected  from 
homogeneous  unpatterned  growth.  This  can  be  explained  by 
a  low  5'Ga.Si02-  In  addition,  the  ratio  of  the  bottom  area  of  a 
GaAs  disk  shown  in  Fig.  4  and  Fig.  5(c)  to  a  single  period 
area,  y,  is  only  about  0.42.  By  controlling  the  dry  etch 
time,  however,  d  can  be  varied  from  180  nm  to  330  nm  or 
more  so  that  y  can  change  from  0.23  to  0.79  or  more.  Here, 
d  greater  than  the  period  is  obtained  by  Si02-overetching 
leading  to  complete  removal  of  a  Si02  film  between  nearest 
neighbor  holes,  igure  7  shows  45° -tilted  SEM  images  ob¬ 
tained  from  as-grown  samples  having  different  hole  diame¬ 
ters  but  the  same  growth  conditions.  Figure  5(a)  shows  a 
side-view  SEM  image  of  the  as-grown  sample  taken  after 
HF  treatment  which  has  a  diameter  similar  to  those  of  the 
GaAs  disks  in  Fig.  7  (a).  Reduction  of  d  means  a  decrease  of 
y  and,  as  a  result,  an  increased  contribution  of  Ga  atoms 
migrating  from  the  Si02  surface  to  the  volume  of  the  GaAs 
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Figure  7  45°tilted  SEM  images  obtained  from  the  as-grown 
samples  having  d  of  (a)  about  180  nm  and  (b)  more  than  330 
nm. 


disk.  For  d  ~  180  nm  shown  in  Fig.  7  (a)  and  Fig.  5  (a),  y 
approaches  0.23.  As  d  is  decreased,  the  dominant  profile  in 
side  view  SEM  varies  from  a  trapezoid  to  a  circular  section 
as  shown  in  Fig.  5.  However,  the  overall  shape  of  GaAs 
disks  in  Fig.  7  (a)  is  not  spherical.  The  change  of  shape 
seems  to  be  result  from  distortions  of  the  holes  in  the  Si02 
film  which  generally  occurs  in  underetching  for  small  d. 
This  could  be  induced  by  different  etch  rate  in  the  center  re¬ 
gion  which  is  faster  than  that  near  the  periphery  of  a  hole, 
resulting  in  U-shaped  profile  of  the  Si02  film  of  which  the 
thickness  close  to  the  opening  is  too  thin  to  be  controlled 
properly.  These  distortions  can  cause  irregular  facet  genera¬ 
tion  along  the  periphery  or  may  prevent  GaAs  disks  from 
achieving  a  (001)  top-surface  facet.  In  Fig.  7  (b),  the  Si02 
film  was  overetched  and  mask  regions  between  the  nearest 
neighbor  holes  disappeared.  Therefore,  the  remaining  Si02 
mask  forms  a  2-D  array  of  approximately  square  posts  rather 
than  a  2-D  array  of  holes  as  seen  in  Fig.  3(b)  and  Fig.  7(a). 

Figure  8  shows  the  observed  relation  between  d  and  h  for 
GaAs  disks  measured  from  side-view  SEM.  The  horizontal 
dotted  line  represents  the  nominal  thickness  estimated  from 
Ga  flux  and  deposition  time.  As  shown  in  Fig.  5,  h  in¬ 
creases  with  decreasing  d.  When  d  ~  180  nm  or  y  -  0.23,  h 
is  about  65  nm.  This  represents  a  relative  increase  of  vol¬ 
ume  of  a  GaAs  disk  with  decreasing  y ,  implying  an  addi¬ 
tional  contribution  of  Ga  atoms  migrating  from  Si02  re¬ 
gions  to  the  volume  of  the  GaAs  disk.  In  a  rough  calcula¬ 
tion  from  Fig.  8,  about  a  40  %  increase  in  the  volume  of  a 
GaAs  disk  was  observed  when  y  ~  0.23  using  the  same  el¬ 
lipsoidal  scheme.  For  a  circular- sectioned  GaAs  disk  of  h  ~ 
65  nm  and  d  ~  180  nm,  about  12  %  of  the  Ga  atoms  inci¬ 
dent  on  the  patterned  Si02  surface  in  an  unit  period  area  then 
must  stick  to  the  patterned  Si02  surface  and  migrate  to  an 
open  GaAs  surface,  to  explain  the  40%  volume  increase. 
The  volume  calculation  of  a  GaAs  disk  includes  crude  as¬ 
sumptions  such  as  perfect  ellipsoidal  shape,  but,  12%  is 


Figure  8  Relation  between  the  diameter  ( d )  and  the  height  ( h ) 
of  GaAs  disks  measured  from  side-view  SEM.  The  vertical 
and  the  horizontal  dotted  line  represent  the  pattern  period 
and  the  nominal  thickness  of  GaAs  estimated  from  Ga  flux 
and  deposition  time,  respectively 


close  to  the  fraction  of  Ga  atoms  that  remain  on  the  unpat¬ 
terned  Si02  surface.  The  relative  increment  of  the  volume  of 
a  GaAs  disk  with  decreasing  d,  therefore,  also  supports  NSG 
induced  by  preferential  migration  of  Ga  atoms. 

Figure  7(a)  provides  the  upper  limit  of  TD,Si02-  As  in¬ 
dicated  by  white  arrows,  some  islands  similar  to  the  GaAs 
islands  formed  on  unpatterned  Si02  surface  were  observed 
although  their  areal  density  is  not  enough  to  affect  the  con¬ 
tribution  of  migrating  Ga  atoms  in  the  volume  calculation. 
The  formation  of  GaAs  island  on  a  patterned  Si02  surface 
means  that  its  LM  is  no  longer  short  enough  for  NSG.  The 
low  density  implies  initiation  of  island  formation  for  the 
given  Lm.  A  diameter  of  d  ~180  nm  corresponds  to  maxi¬ 
mum  Lm  ~  290  nm.  At  growth  temperature  of  630 °C  with 
growth  rate  of  0.25  MLs  1  therefore,  230  nm  <  LDSi02  <  290 
nm  on  a  330-nm  period  2-D  Si02-pattemed  GaAs(001)  sub¬ 
strate.  This  value  is  much  less  than  LDSi02  ~  4  -  6  pm  in 
[110]  direction  extrapolated  for  similar  growth  conditions 
which  is  generally  less  than  that  in  [1-10]  direction.11  This 
small  LD  Si02  explains  the  absence  of  selective  growth  mode 
in  pm-scale  Si02  mask  patterns.  On  the  other  hand,  this  is 
more  than  twice  the  average  spacing  of  GaAs  islands  on  the 
unpatterned  Si02  surface.  Once  the  Ga  atoms  stick  on  and 
move  from  the  patterned  Si02  surface  to  open  GaAs  surface, 
they  are  no  longer  available  to  play  a  role  in  seeding  a  GaAs 
island  for  additional  arriving  atoms.  This  means  that  the 
steady  state  of  NSG  during  growth  consists  of  a  continuous 
flow  of  Ga  atoms  from  the  patterned  Si02  surface  to  the 
open  GaAs  surface  keeping  the  patterned  Si02  surface  clean 
and  without  seed  atoms.  This  explains  the  realization  of 
NSG  in  spite  of  LDSi02  which  is  greater  than  the  average 
spacing  between  GaAs  islands  observed  from  Fig.  6. 

As  previously  mentioned,  trapezoidal,  triangular,  and  cir¬ 
cular-sectional  shapes  were  observed  in  side-view  SEM.  The 
side-views  shown  in  Fig.  5(a)  and  (c)  correspond  to  Fig.  7(a) 
and  Fig.  3(b),  respectively.  Of  the  various  side-view  shapes, 
the  circular- sectional  shape  of  Fig.  5(a)  seems  to  result  when 
imperfect  dry  etching  gives  noncircular  hole  patterns  for 
small  diameter  holes.  In  case  of  the  mid- size  GaAs  disks 
shown  in  Fig.  5(c),  the  pattern  is  very  close  to  circles  as 
seen  in  Fig.  4  and  no  particular  facets  are  developed  on  the 
side- walls.  Also,  the  trapezoidal  shape  is  dominant  in  this 
case.  However,  a  few  triangular  GaAs  disks  in  side-view  (or 
conical  GaAs),  as  shown  in  Fig.  5(b),  were  observed  among 
the  trapezoidal  ones.  This  implies  that  the  migration  of  Ga 
atoms  from  the  region  near  the  boundary  between  GaAs  and 
Si02  surfaces  to  a  (001)  top  surface  of  the  open  GaAs  region 
generates  trapezoidal  shape  in  side-view  at  the  initial  stage  of 
growth  and  is  continued  until  the  (001)  top  surface  disap¬ 
pears.  If  growth  is  continued,  then  most  of  trapezoidal  GaAs 
disks  will  be  converted  to  triangular  ones.  It  is  interesting 
to  imagine  what  would  happen  if  the  growth  is  continued 
further.  We  deposited  about  100  nm  of  GaAs  on  a  Si02- 
patterned  substrate  similar  to  Fig.  3(a).  It  should  be  noted 
that  the  thickness  of  100  nm  is  about  2.5  times  greater  than 
the  thickness  of  the  Si02  film.  In  this  case,  most  of  GaAs 
disks  have  a  form  of  a  cylindrical  shape  based  on  the  hole 
pattern  with  a  rounded  conical  top.  No  lateral  growth  was 
observed  for  this  100-nm  deposition.  If  rectangular  or  stripe 
type  patterns  which  can  produce  some  particular  facets  dur¬ 
ing  growth  are  employed,  a  nanoscale  lateral  growth  mode 
for  MBE  possibly  could  be  realized. 


In  this  work,  we  enhanced  LD  Si02  by  using  a  low  Ga  flux 
with  high  growth  temperature.  These  conditions,  however, 
induce  a  low  SGa_Si02  different  from  the  sticking  coefficient  of 
Ga  atoms  on  a  GaAs  surface  which  is  almost  unity  under 
our  growth  conditions.14  Basically,  the  low  SGa.Si02  results  in 
an  incomplete  selective  growth  mode  which  results  in  GaAs 
islands  on  the  unpatterned  Si02  surface.  This  also  provides  a 
favorable  situation  for  migration  of  Ga  atoms.  By  fabricat¬ 
ing  a  Si02  mask  with  LM  smaller  than  LDSi02,  perfectly  se¬ 
lective  growth  is  achieved  on  a  nanoscale  Si02-pattemed 
substrate. 

NSG  is  radically  different  from  previously  reported  selec¬ 
tive  growth  techniques  in  several  aspects.  First  of  all,  the 
process  is  very  simple.  Neither  special  wafer  treatments  nor 
sophisticated  lithography  technologies  are  required.  A  single 
Si02  film  on  a  substrate  patterned  by  IL  is  sufficient.  Mi¬ 
gration  enhanced  epitaxy  may  be  helpful  for  enhanced  LDSi02. 
Control  of  the  interference  pattern  in  IL  will  allow  explora¬ 
tion  of  a  wide  range  of  patterns  extending  from  the  mi¬ 
croscale  to  the  nanoscale  regime. 

The  most  significant  potential  of  NSG  is  in  its  applica¬ 
tion  to  heteroepitaxy.  It  is  known  that  the  critical  thickness 
of  a  lattice-mismatched  heterostructure  is  enhanced  if  the 
growth  area  is  reduced.15'18  In  heteroepitaxy,  an  unselectively 
deposited  homogeneous  epilayer  can  be  replaced  by  selec¬ 
tively  grown  2-D  array  of  nanoscale  disks  offering  both 
strain  relief  and  quantum  confinement.  In  this  work,  we  pre¬ 
sented  NSG  with  a  hole  diameter  of  about  -  240  nm.  But, 
IL  and  refined  processing  can  generate  patterns  of  sub- 100- 
nm-diameter  holes,  approaching  the  lateral  dimensions  of 
typical  self-assembled  quantum  dots.19  NSG,  therefore,  sug¬ 
gests  the  possibility  of  growing  various  kinds  of  epilayers 
beyond  their  conventional  critical  thicknesses  and  of  process- 
arranged  quantum  dots  with  well-controlled  size  and  spatial 
distribution.  Although  there  are  still  several  problems  to  be 
overcome  such  as  optimization  of  growth  temperature,  im¬ 
provement  of  lithography  resolution,  and  reliable  process¬ 
ing,  NSG  presents  a  new  frontier  for  strained-layer  MBE 
technology.  Presently,  NSG  of  In/Ja^As  and  In  As  on  a 
GaAs  substrate  with  sub-100-nm  diameter  hole  patterns  is 
underway. 

IV.  Summary  and  conclusion 

Homoepitaxial  nanoscale  selective  growth  (NSG)  on  a 
Si02-pattemed  GaAs(OOl)  substrate  realized  by  interferomet¬ 
ric  lithography  (IL)  has  been  reported.  On  a  Si02  surface  for 
which  the  lateral  dimension  is  less  than  a  Ga  atom  surface 
diffusion  length,  the  Ga  atom  either  desorbs  or  migrates  to 
nearby  GaAs  surfaces  and  is  incorporated  into  a  local  epi¬ 
taxial  growth.  With  NSG,  a  330-nm  period,  two¬ 


dimensional  array  of  GaAs  disks  having  a  height  of  40  nm 
and  a  diameter  of  about  240  nm  has  been  successfully  grown 
on  a  GaAs(001)  substrate  by  MBE.  This  selectivity  is 
achieved  under  the  growth  temperature  of  630° C  with  the 
growth  rate  of  0.25  ML  s'1  where  surface  migration  length 
of  a  Ga  atom  on  Si02  surface  is  extended  up  to  230  -  290 
nm.  Conclusively,  this  work  demonstrates  that  selective 
growth  is  possible  in  MBE  by  understanding  preferential 
migration  of  a  Ga  atom  for  mask  dimensions  less  than  the 
surface  diffusion  length. 
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Single-dot- wide  rows  of  InAs  quantum  dots  (QDs)  aligned  along  a  [01 T]  direction  on  a 
180-nm-period  nanoscale-patterned  (nanopatterned)  GaAs(100)  substrate  are  reported.  The 
nanopatterned  substrate  is  realized  by  interferometric  lithography  along  with  the  selective  growth 
mode  of  GaAs.  Orientation-dependent  migration  and  incorporation  of  In  atoms  from  (lll)A  to  (100) 
facets  on  the  nanopatterned  substrate  localizes  QD  formation  exclusively  along  a  30-40-nm-wide 
(100)  facet  defined  by  neighboring  (lll)A-type  facets  within  each  period.  These  aligned  QDs  form 
face-to-face  multi-QDs  analogous  to  multi-quantum-well  structures,  in  a  one-dimensional 
configuration.  Spatially  controlled  formation  of  QDs  with  an  improved  size  uniformity  on  the 
nanopatterned  substrate  is  presented.  ©  2001  American  Institute  of  Physics. 

[DOI:  10.1063/1.1409947] 


Recent  progress  in  the  field  of  semiconductor  nanostruc¬ 
tures  shows  strong  promise  of  realizing  the  characteristics 
required  for  next-generation  electronic  and  optoelectronic 
devices  and  for  basic  research  in  low-dimensional  physics.  In 
particular,  self-assembled  quantum  dots  (QDs)  have  been  in¬ 
tensively  studied  as  a  result  of  their  fabrication-free  forma¬ 
tion  leading  to  high-quality,  low-dimensional  nanoscale 
structures.1 

In  spite  of  the  advantages  resulting  from  the  self- 
assembled  formation  mechanism,  the  inherent  size  fluctua¬ 
tion  and  random  spatial  distribution  are  critical  issues  that 
are  not  yet  adequately  controlled.  Several  attempts  relying  on 
patterned  growth  have  employed  formation  of  steps  or  par¬ 
ticular  facets,  to  enable  orientation-dependent  migration  and 
incorporation  (ODMI)  of  In  atoms.2-9 

ODMI  is  mass  transport  during  growth  from  one  facet  to 
another  one  caused  by  different  adsorption  kinetics  corre¬ 
sponding  to  alternative  surface  bonding  configurations.  For 
molecular  beam  epitaxy  (MBE),  it  is  known  that  Ga  and  In 
atoms  have  a  property  of  ODMI  from  a  (n  1 1) A -type  facet  to 
a  (100)  facet  (n=  1  or  3),  resulting  in  a  faster  growth  rate  on 
a  (100)  facet  than  on  a  (^11) A -type  facet  when  both  are 
exposed  to  identical  beam  fluxes.10-13  ODMI,  then,  can  also 
affect  QD  formation.  Although  several  investigations  of 
ODMI  on  QD  formation  with  a  patterned  substrate  have  been 
reported,  the  patterns  reported  so  far  have  had  scales  beyond 
the  lateral  dimension  of  typical  self-assembled  QDs,  LSQ. 
Generally,  LSQ~ 20-30nm  for  InAs  QDs  on  a  GaAs 
substrate.1  Recent  developments  in  lithography  now  allow 
the  possibility  of  large-area  nanoscale  patterns  with  critical 
dimensions  approaching  LSQ.  Especially,  large-area  inter¬ 
ferometric  lithography  (IL)  is  emerging  as  a  low-cost,  reli¬ 
able,  and  scalable  technology  for  sub-100-nm  scales.14 

In  this  work,  we  demonstrate  a  nanopatterned 
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GaAs(100)  substrate  fabricated  with  a  180-nm-period  ID 
stripe  pattern  by  IL  followed  by  selective  MBE  growth  to 
form  the  (lll)A-type  facets.  Each  period  of  the  pattern  con¬ 
sists  of  a  30-40-nm-wide  (100)  facet  along  a  [01 F]  direction 
sandwiched  between  (lll)A-type  facets.  We  report  growth  of 
ID  rows  of  InAs  multi-QDs  aligned  on  these  (100)  facets  by 
ODMI.  We  also  present  photoluminescence  (PL)  of  these 
InAs  multi-QDs  and  compare  it  with  PL  of  QDs  randomly 
generated  on  an  unpatterned  substrate. 

The  critical  steps  are:  (1)  to  prepare  a  GaAs  substrate 
patterned  with  nanoscale  (lll)A  and  (100)  facets;  and  (2)  to 
grow  InAs  QDs  self-aligned  to  the  pattern  by  MBE.  Figure  1 
shows  the  major  steps  of  the  process.  A  180-nm-period  ID 
pattern  is  generated  on  a  25-nm-thick  Si02  film  by  IL  and 
dry  etching.  The  pattern  was  aligned  along  the  [011]  direc¬ 
tion  of  the  substrate.  Figure  2  presents  a  45°-tilted  scanning 
electron  microscopy  (SEM)  image  of  a  Si02-patterned  sub¬ 
strate  corresponding  to  Fig.  1(a).  The  width  of  each  Si02 
stripe  shown  in  the  inset  of  Fig.  2  is,  less  than  50  nm,  com¬ 
parable  to  LSq  . 

Formation  of  the  nanopatterned  substrate  is  accom¬ 
plished  by  MBE  growth  of  GaAs  on  the  Si02-patterned  sub¬ 
strate  under  selective  growth  conditions  with  a  zero  sticking 
coefficient  of  a  Ga  atom  on  a  Si02  surface  as  evidenced  by 
the  lack  of  any  GaAs  growth  on  an  unpatterned  Si02  film. 
For  certain  growth  conditions  such  as  high  growth  tempera¬ 
ture  at  extremely  low  growth  rate,  the  adsorption  behavior  of 
a  Ga  atom  on  the  Si02  surface  results  in  a  zero  sticking 
coefficient  while  still  keeping  the  sticking  coefficient  on  the 
GaAs  surface  close  to  unity.  If  the  growth  temperature  is 
above  615  °C  when  the  growth  rate  is  less  than  0.1  mono- 
layer  (ML)  s-1,  we  found  that  the  sticking  coefficient  of  a 
Ga  atom  on  an  unpatterned  Si02  surface  becomes  zero.  To 
build  up  (100)  and  (lll)A  facets  on  the  Si02-patterned  sub¬ 
strate,  selective  growth,  instead  of  chemical  etching,  was  em¬ 
ployed  in  this  work.  As  shown  in  Fig.  1(b),  selective  growth 
results  in  the  formation  of  an  isosceles  triangular  cross  sec¬ 
tion  on  an  open  GaAs  stripe  separated  by  Si02  stripes.  This 
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FIG.  1.  Schematic  diagram  of  the  process  consisting  of  (a)  Si02  patterning 
by  IL  and  dry  etching,  (b)  formation  of  isosceles  triangular- shaped  GaAs  by 
selective  growth,  and  (c)  formation  of  QDs  self-aligned  on  a  nanopatterned 
substrate  prepared  in  (b)  at  cross-sectional  view. 
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is  because  Ga  atoms  incident  near  the  boundary  between 
GaAs  and  Si02  surfaces  move  to  an  inner  side  of  the  open 
GaAs  stripe  to  find  a  more  stable  adsorption  site.10  The  re¬ 
sulting  sidewalls  developed  by  this  movement  are  (lll)A- 
type  facets.  Therefore,  selective  growth  produces  (lll)A- 
type  facets  keeping  the  (100)  facet  protected  by  the  Si02 
stripe  mask. 

To  completely  form  (lll)A-type  facets,  deposition  cor¬ 
responding  to  42  nm  of  GaAs  was  performed  on  the 
Si02-patterned  substrate  of  Fig.  2  at  the  growth  conditions 
mentioned  above.  Dipping  the  as-grown  sample  in  HF  sim¬ 
ply  removes  the  Si02  stripe  mask  and  the  cross-sectional 
SEM  image  shown  in  Fig.  3(b)  reveals  an  isosceles  triangu¬ 
lar  cross  section  as  predicted.  In  Fig.  3(b),  the  widths  of 
nanoscale  (lll)A  sidewalls  and  (100)  bottom  facets  are  about 
90-110  and  30-40  nm,  respectively.  These  distances  are 
much  less  than  the  reported  surface  migration  length  of  an  In 
atom  under  similar  growth  conditions  and  enable  ODMI 
across  the  entire  (lll)A-type  facets.11,12  The  45°-tilted  SEM 
image  shown  in  Fig.  3(a)  reveals  that  sidewalls  along  the 
pattern  are  not  perfect  (lll)A  type  because  of  nonuniformi¬ 
ties  of  the  Si02  pattern  but  are  sufficient  to  activate  ODMI  as 
seen  below. 

On  the  nanopatterned  substrate  shown  in  Fig.  3,  deposi¬ 
tion  of  InAs  was  performed  at  510  °C  with  a  nominal  growth 
rate  of  0.05  MLs-1  so  that  InAs  QDs  are  formed  only  on 
(100)  facets  by  ODMI  as  shown  in  Fig.  1(c).  The  total 
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FIG.  2.  45°-tilted  SEM  image  of  a  Si02-patterned  GaAs(100)  substrate.  The 
inset  shows  30-40-nm-wide  Si02  mask  stripes  comparable  to  LSQ  . 
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FIG.  3.  (a)  45°-tilted  and  (b)  cross-sectional  SEM  image  of  a  nanopatterned 
GaAs(100)  substrate  prepared  by  selective  growth. 

amount  of  InAs  supplied  corresponds  to  about  1.5-2. 5  ML. 
Figure  4  shows  atomic-force  microscopy  (AFM)  images  ob¬ 
tained  from  the  as-grown  sample.  As  shown  in  Fig.  4(a), 
InAs  QDs  were  generated  but  aggregated  only  along  the  bot¬ 
tom  faces.  No  QDs  were  located  on  the  sidewalls.  This  con¬ 
firms  that  mass  transport  of  In  atoms  from  (lll)A  to  the 
(100)  facet  due  to  ODMI  occurs  during  growth. 

The  dot  density  across  the  entire  area  measured  from 
Fig.  4(a)  is  about  1.2±0.2X  1010cm~2.  This  value  is  about 
40% -50%  of  dot  densities  of  random-generated  QDs  on  a 
unpatterned  substrate  grown  under  the  same  growth  condi¬ 
tions  reported  in  patterned  growth.  ODMI  effectively  re¬ 
stricts  the  area  available  for  QD  formation  to  the  (100)  facet 
within  each  period.  In  Figs.  4(a)  and  4(b),  most  of  the  QDs 
seem  to  be  squeezed  by  adjacent  sidewalls.  These  QDs  are 
apparently  very  close  to  each  other  and  build  up  ID  rows  of 
multi-QDs  analogous  to  multi-quantum- well  structures.  Fig¬ 
ure  4(b)  reveals  that  most  of  the  aligned  QDs  of  Fig.  4(a) 
have  AFM-convolved  lateral  dimensions  in  the  range  of 
30-33  nm  which  is  highly  uniform  compared  with  typical 
random-generated  QDs  on  an  unpatterned  substrate.  In  the 
sideviews  shown  in  Figs.  4(c)  and  4(d),  the  aligned  QDs 
have  a  height  of  about  5-7  nm,  almost  the  same  as  that  of  a 
typical  QD  grown  on  a  unpatterned  substrate.1 

Room-temperature  PL  of  the  aligned  QDs  on  a  nanopat¬ 
terned  substrate  and  of  randomly  generated  QDs  on  an  un¬ 
patterned  substrate,  grown  at  the  same  growth  conditions,  are 
presented  in  Fig.  5.  These  PL  spectra  were  measured  with 
He-Ne  laser  excitation  (633  nm)  and  a  77-K-cooled  Ge  de¬ 
tector.  In  Fig.  5,  the  aligned  QDs  on  the  nanopatterned  sub¬ 
strate  have  a  lower  integrated  PL  intensity  than  the  randomly 
generated  QDs  on  the  unpatterned  substrate,  supporting  the 
lower  dot  density  on  the  nanopatterned  substrate  previously 
mentioned.  PL  spectra  from  the  aligned  QDs  and  the  ran¬ 
domly  generated  QDs  of  Fig.  5  have  peak  wavelengths  at 
1559  and  1564  nm  or  0.795  and  0.793  eV,  respectively.  It  is 
known  that  air-exposed  InAs  QDs  are  under  lower  compres¬ 
sion  than  capped  ones  and  have  a  PL  peak  around  these 
values  and  also  have  a  relatively  large  full  width  at  half 
maximum  (FWHM)  compared  with  capped  QDs.15  However, 
it  should  be  noted  that  the  FWHM  of  the  PL  spectrum  of 

QDs  on  a  nanopatterned  substrate  is  about  97  meV,  which  is 
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FIG.  5.  Room-temperature  PL  spectra  of  the  aligned  QDs  on  a  nanopat- 
terned  substrate  and  the  randomly  generated  QDs  on  an  unpatterned  sub¬ 
strate. 

In  this  work,  we  have  reported  ID  single-dot- wide  rows 
of  InAs  QDs  aligned  along  the  [011]  direction  on  a  180-nm- 
period  nanopatterned  GaAs(001)  substrate  prepared  by  IL 
and  selective  MBE.  QD  formation  forced  by  ODMI  is  exclu¬ 
sively  observed  along  a  30-40-nm-wide  (100)  facet  defined 
by  neighboring  (lll)A-type  facets  within  each  period.  These 
aligned  QDs  form  face-to-face  multi-QDs  in  a  ID  configu¬ 
ration  suggesting  the  possibility  of  dot-to-dot  coupling.  They 
also  show  improved  size  uniformity  in  AFM,  which  is  con¬ 
firmed  by  a  26%  reduced  FWHM  of  the  PF  spectrum  com¬ 
pared  with  that  of  randomly  generated  QDs. 
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FIG.  4.  AFM  images  of  aligned  QDs  (a)  with  2.5  jam  scan  range,  and  (b) 
with  its  5  X  magnification.  Sideviews  of  the  AFM  image  of  (b)  seen  from  (c) 
the  top  and  (d)  the  bottom. 

26%  less  than  the  FWHM  of  131  meV  of  the  PF  for  ran¬ 
domly  generated  QDs  on  an  unpatterned  substrate.  Reduc¬ 
tion  of  the  FWHM  generally  implies  improved  size  unifor¬ 
mity  and  confirms  the  AFM  result  of  Fig.  4. 

Although  most  of  the  aligned  QDs  in  the  ID  single  row 
of  Fig.  4(a)  appear  adjacent,  the  tip-limited  resolution  of  the 
AFM  does  not  allow  precise  measurement  of  nearest- 
neighbor  distances.  It  is  not  clear  whether  they  are  close 
enough  to  expect  coupling  between  adjacent  QDs. 

The  reduced  FWHM  of  Fig.  5  suggests  an  advantage  for 
aligned  QDs  grown  on  a  nanopatterned  substrate  in  device 
applications  lowering  the  threshold  for  laser  diodes.  Figures 
4  and  5,  therefore,  show  important  experimental  results  that 
ODMI  spatially  forces  QD  formation  to  be  aligned  along 
30-40-nm-wide  (100)  facets  and  that  those  aligned  QDs 
form  ID  multi-QDs  on  a  nanopatterned  substrate  with  im¬ 
proved  size  uniformity. 
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Introduction  to  the  Feature  Section  on  Growth  of 
Heterostructure  Materials  on  Nanoscale  Substrates 


HETEROSTRUCTURED  materials  are  important — and 
now  often  indispensable — enablers  of  a  wide  range  of 
electronic  and  optoelectronic  semiconductor  devices.  Hetero¬ 
junctions  play  a  dominant  role  in  quantum  effect  devices  by 
providing  carrier  and  photon  confinement,  enabling  tunneling 
transport,  and  providing  a  broad  gain  spectrum. 

The  papers  in  this  Feature  Section  provide  a  snapshot  of  some 
new  approaches  to  heteroepitaxy  that  exploit  micro-  and  nano¬ 
scale  features,  created  by  both  “bottom-up”  self-assembly  and 
“top-down”  lithographic  techniques.  These  approaches  promise 
further  significant  device  improvements  by  relaxing  the  tradi¬ 
tional  restrictions  imposed  by  defect  formation  in  non-lattice- 
matched  material  systems  and  they  will  eventually  allow  highly 
mismatched  semiconductors  to  be  integrated  monolithically. 

Drucker  discusses  the  growth  of  self-assembled  Ge:Si 
quantum  dots,  investigating  both  the  morphological  and 
optical/electronic  properties  as  a  function  of  growth  param¬ 
eters.  Ichikawa  investigates  ^10-nm  nanostructures,  in  the 
same  material  system,  that  are  defined  using  ultrathin  SiC>2 
technology.  Buttard  et  al.  also  investigate  the  Ge:Si  system 
using  molecular  hydrophobic  bonding  of  ultra-thin  Si  layers 
to  provide  self-assembled  nanoscale  dislocation  networks, 
confined  to  the  interfacial  region,  leading  to  a  periodic  strain 
field  that  extends  to  the  growth  interface  and  controls  the 
nucleation  of  the  Ge  quantum  dots.  Roskowski  et  al.  extend 
the  understanding  of  pendeo-epitaxy  for  the  GaN:SiC  system 


to  achieve  marked  reductions  in  the  dislocation  density  in  this 
technologically  important  system.  Hersee  et  al.  investigate  the 
use  of  nanoheteroepitaxy — growth  on  lithographically  defined 
nanoscale  seeds,  taking  advantage  of  both  strain  partitioning 
between  the  growing  film  and  the  nanoscale  substrate  and  the 
local  presence  of  a  free  surface  at  the  nanoscale  boundaries,  to 
modify  defect  formation  in  the  GaAs:Si  and  GaN:Si  systems. 

These  approaches  are  evolving  and  maturing.  As  they  move 
from  materials  science  studies  to  device  structures,  they  promise 
to  improve  the  performance  of  existing  devices  and  to  offer  new 
capabilities  and  new  functionality  in  electronics  and  optoelec¬ 
tronics. 
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Abstract — We  describe  an  ongoing  study  of  nanoheteroepitaxy 
(NHE),  the  use  of  nanoscale  growth-initiation  areas  for  the  inte¬ 
gration  of  highly  mismatched  semiconductor  materials.  The  con¬ 
cept  and  theory  of  NHE  is  briefly  described  and  is  followed  by  a 
discussion  of  the  design  and  fabrication  by  interferometric  lithog¬ 
raphy  of  practical  sample  structures  that  satisfy  the  requirements 
of  NHE.  Results  of  NHE  growth  of  GaAs-on-Si  and  GaN-on-Si  are 
described,  following  the  NHE  process  from  nucleation  through  to 
coalescence.  Micro-Raman  measurements  indicate  that  the  strain 
in  partially  coalesced  NHE  GaN-on-Si  films  is  <0.1  GPa. 

Index  Terms — Coalescence,  GaN,  MOCVD,  nanoheteroepitaxy, 
Raman,  selective  growth. 


I.  Introduction 

EPITAXIAL  crystal  growth  has  developed  from  a  par¬ 
adigm  of  large-area  film  growth,  where  the  formation 
and  propagation  of  defects  in  the  growing  film  has  restricted 
the  materials  used  to  nearly  lattice  matched  systems.  There 
has  been  a  recent  increase  in  interest  in  selective- area  and 
lateral  overgrowth  on  patterned  substrates,  [l]-[4]  because  of 
the  potential  for  fabricating  low-defect  density  heterostruc- 
tures  [5],  [6].  Of  particular  interest  are  materials  such  as  GaN, 
for  which  lattice  matched  substrates  are  not  available.  The 
integration  of  such  materials  [7]  has  a  huge  potential  payoff, 
for  example,  in  increasing  the  functionality  of  Si-based  inte¬ 
grated  circuits  through  the  integration  of  III-V  based  optical 
and  electronic  device  technologies.  Selective- area  growth  also 
offers  new  solutions  to  difficult  processing  challenges,  such 
as  contacting  the  base  region  of  Al/GaN/GaN  heterojunction 
bipolar  transistors  [8]. 

Although  most  research  on  patterned  growth  has  concentrated 
on  patterning  on  the  <l-/xm  scale,  recent  theoretical  [9],  [10] 
and  experimental  results  [11]  indicate  that  novel  strain  relief 
mechanisms  are  available  when  the  substrate  is  patterned  at 
the  nanoscale.  These  mechanisms,  active  when  the  scale  of  the 
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Fig.  1.  Nanoheteroepitaxy  concept.  Epilayer  nucleates  selectively  onto 
nanoscale  substrate  islands  and  coalescence  occurs  once  strain  free  material 
is  obtained. 

nanostructure  approaches  the  critical  thickness,  offer  new  ap¬ 
proaches  for  dealing  with  the  stress  associated  with  lattice  and 
thermal  mismatches  [12].  Nanometer- scale  structures  are  also 
of  interest  in  their  own  right.  Quantum-dot  structures  are  being 
investigated  particularly  as  gain  media  for  semiconductor  lasers 
[13],  and  nanoscale  patterning  [14]  provides  a  potential  route  to 
an  ordered,  monosized  distribution  of  dots  that  will  have  impor¬ 
tant  device  implications. 

This  paper  discusses  an  ongoing  study  of  nanoheteroepitaxy 
(NHE)  [10],  which  exploits  the  three  dimensional  strain  relief 
mechanisms  that  are  available  to  nanoscale  nuclei,  in  a  practical 
and  scalable  technology.  Crystal  growth  on  patterned  substrates 
[15]  is  quite  different  from  that  observed  on  planar  surfaces  [16] 
and  much  remains  to  be  understood.  In  patterned  substrates, 
three-dimensional  (3-D)  nuclei  are  spatially  ordered  with  a 
fixed  density  as  opposed  to  the  random  distribution,  in  both 
nucleus  size  and  nearest  neighbor  distance,  that  is  characteristic 
of  self  assembled  planar  samples.  Faceting  of  the  nuclei  is 
mainly  determined  by  surface  energy  and  not  interfacial  energy, 
and  coarsening  due  to  Ostwald  ripening  is  less  frequently 
observed  in  ordered  arrays.  Coalescence  of  the  nuclei,  which 
is  necessary  to  obtain  useful  macroscopic  areas  of  epilayer,  is 
critically  dependent  on  the  initial  faceting  behavior.  In  NHE, 
coalescence  occurs  late  in  the  growth  stage,  after  the  nuclei 
have  grown  sufficiently  large  to  contact  neighboring  islands, 
rather  than  by  the  physical  movement  of  small  islands  [17].  We 
show  later  that  even  when  there  is  a  large  lattice  mismatch 
an  excellent  coalescence  can  be  achieved.  However,  if  the 
structure  is  nonuniform  competing  facets  can  cause  growth 
instabilities  and  lead  to  rough  morphologies  [1],  [18]. 
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(a)  (b)  (c) 

Fig.  2.  (a)  SOI-based  NHE  sample,  (b)  Patterned  oxide  NHE  substrate,  (c)  Plan  view  of  grouped  nanoscale  pattern,  where  within  the  group  each  nucleus  is  at 

nanoscale  and  during  growth  there  is  coalescence  between  members  of  the  same  group.  However,  the  large  group-to-group  spacing  means  that  neighboring  groups 
do  not  coalesce. 


A  facile,  large-area  nanopatterning  technique  is  a  require¬ 
ment  of  NHE.  Interferometric  lithography  [19]  provides  large 
areas  of  nanostructured  material  (scales  as  small  as  10  nm  and 
densities  to  1011  cm-2  over  areas  of  >10  cm2)  with  a  large 
flexibility  in  the  pattern  characteristics  (size,  density,  array,  and 
nanoelement  symmetry,  area,  etc.).  The  current  state-of-the-art 
in  nanopatterning  is  discussed  along  with  projections  for  devel¬ 
oping  smaller  features  and  denser  arrays  for  NHE  substrates. 

II.  NHE 

NHE  is  a  novel  approach  [10]  to  the  heteroepitaxy  of  highly 
mismatched  materials.  NHE  uses  selective  epitaxy  to  nucleate 
nanosize  epitaxial  islands  onto  a  substrate  that  is  patterned 
with  nanoscale  features  (Fig.  1).  The  growth  is  selective  and 
the  small  size  of  the  growth  nucleus  allows  both  the  substrate 
and  epilayer  to  deform  in  three  dimensions  in  response  to 
lattice  mismatch  stress.  The  stress  and  strain  associated  with 
lattice  mismatch  at  the  heterojunction  decays  exponentially 
away  from  the  heterointerface  [9]  with  a  characteristic  length 
on  the  order  of  the  diameter  of  the  nanoscale  nucleus.  The  strain 
energy  associated  with  the  growing  film  saturates  with  film 
thickness  as  the  film  reaches  its  unstressed  lattice  parameters; 
if  this  saturation  value  can  be  kept  below  the  energy  required 
for  a  defect  to  form,  strain-free  and  defect-free  material  can 
be  grown.  In  addition,  the  local  availability  of  a  free  surface  at 
the  edges  of  the  nanosize  islands  modifies  the  defect  formation 
and  propagation  kinetics  and  can,  in  some  cases,  confine  the 
defects  to  a  local  region  at  the  initial  interface  in  the  epitaxial 
structure  [11]. 

III.  NHE  Substrate  Preparation 

Fig.  2(a)  shows  the  sample  structure  used  in  our  initial  NHE 
experiments,  based  on  an  silicon-on-insulator  (SOI)  substrate 
where  the  top  silicon  layer  is  thinned  to  <100  nm  and  then 
patterned  using  interferometric  lithography  (described  later). 
During  MOCVD  growth  of  GaAs  or  GaN  films,  the  Si02 
in  the  SOI  provides  excellent  growth  selectivity,  allowing 
growth  only  on  the  Si  islands.  For  this  structure,  the  substrate 
islands  are  patterned  at  nanoscale  and  NHE  theory  predicts 
that  a  significant  portion  (approximately  half)  of  the  mismatch 


strain  energy  will  be  taken  up  by  the  substrate  [10].  While  this 
SOI-based  NHE  sample  structure  represents  an  ideal  substrate 
in  terms  of  the  practical  implementation  of  NHE  theory, 
high-quality  (111)  SOI  (as  required  for  GaN  growth)  is  much 
less  readily  available  than  bulk  (111)  Si. 

Recently,  we  have  evaluated  an  alternative  NHE  substrate 
structure  shown  in  Fig.  2(b).  Here,  a  thermal  oxide  is  grown  onto 
a  bulk  silicon  substrate  and  then  patterned  using  interferometric 
lithography.  This  structure  offers  the  key  properties  required 
for  NHE,  Si  versus  Si02  growth  selectivity  and  a  nanoscale 
growth  nucleus;  however,  the  substrate  compliance  is  reduced  to 
essentially  zero  because  it  is  bulk  material.  We  anticipate  that,  in 
this  case,  the  mismatch  strain  energy  will  be  accommodated 
almost  entirely  by  the  epitaxial  layer  and  this  structure  is 
equivalent  to  that  first  discussed  by  Luryi  and  Suhir  [9]. 
While  this  structure  will  exhibit  higher  strain  energy  than 
that  shown  in  Fig.  2(a),  it  has  an  important  advantage  in  that 
it  uses  standard  silicon  processing  and  is  straightforward  to 
fabricate  over  large  substrate  areas.  The  modification  of  the 
defect  formation  due  to  the  local  availability  of  free  surfaces 
remains  active.  Furthermore,  this  approach  allows  NHE  to  be 
performed  on  refractory  substrates  such  as  SiC  and  sapphire, 
which  are  difficult  to  pattern  directly. 

A  coalesced  NHE  epilayer  will  also  exhibit  strain  due  to 
thermal  mismatch  between  the  epilayer  and  substrate.  In  many 
cases,  e.g.,  GaN  on  Si,  this  is  sufficient  to  crack  the  epilayer.  We 
show  later  that  the  effects  of  thermal  strain  can  be  mitigated  by 
restricting  the  size  of  the  heteroepitaxial  area.  This  anticipates 
the  eventual  application,  where,  for  example,  a  group  of  III-V 
devices  with  a  limited  footprint,  is  fabricated  within  a  larger 
Si  CMOS  circuit.  To  achieve  this  limited  area,  we  are  also 
evaluating  NHE  structures  of  the  type  shown  in  Fig.  2(c).  Here, 
the  nanoscale  nuclei  are  grouped  and  the  number  of  nuclei 
within  each  group  may  be  as  small  as  one.  Each  nucleus  within 
the  group  is  nanosize,  providing  the  enhanced  strain  relief  of 
the  NHE  approach,  and  within  the  group  there  is  coalescence. 
However,  the  group-to-group  spacing  is  made  large  such  that 
at  the  end  of  growth  the  groups  remain  separate  (they  do  not 
coalesce).  Raman  measurements  of  strain  in  such  structures 
(presented  later)  indicate  that  there  is  significantly  less  thermal 
mismatch  strain  in  this  partially  coalesced  NHE  structure. 
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A.  Interferometric  Lithography  for  Nanoscale 
Substrate  Preparation 

A  clear  requirement  for  nanoheteroepitaxy  is  the  de¬ 
velopment  of  a  facile,  scalable  and  inexpensive  nanoscale 
lithographic  technique.  Nanoscale  features  are  required  over 
the  large  areas  associated  with  crystal  growth,  up  to  300-mm 
diameter  wafers  in  the  case  of  Si  and  SOL  There  is  a  broad 
parameter  space  to  investigate:  not  only  the  dimensions  of  the 
nm- scale  features,  their  density,  the  array  pattern  symmetry 
and  the  nanoelement  symmetry;  but  also  the  details  of  the 
nano-element  structure,  the  grouping  of  nanoelements,  the  un¬ 
derlying  layer  structure,  alternatives  for  passivation  at  the  edges 
of  the  nano-elements,  etc.  must  be  explored.  Coupled  with  the 
vast  parameter  space  inherent  in  epitaxial  growth  techniques, 
the  need  for  large  numbers  and  variety  of  samples  is  evident. 
This  argues  for  a  nanolithography  technique  that  both  covers 
large  areas  and  is  easy  to  adapt  to  new  patterns  and  feature 
sizes.  As  nanoheteroepitaxy  matures  and  impacts  devices,  the 
demands  of  manufacturing  will  impose  new  constraints  on  any 
nanolithography  technique. 

Interferometric  lithography  (IL)  uniquely  addresses  all  of 
these  requirements  [19].  In  this  section,  a  brief  introduction 
to  the  accessible  pattern  space  is  presented  along  with  a  com¬ 
parison  with  other  possible  nanolithography/nanopatterning 
approaches.  IL  is  a  maskless  process  that  uses  interference 
between  coherent  laser  beams  to  define  a  pattern.  An  in¬ 
terference  pattern  between  two  beams  provides  a  periodic 
one-dimensional  (1-D)  or  grating  pattern.  Additional  exposures 
with  the  wafer  rotated  relative  to  the  grating  direction  provide 
the  two-dimensional  (2-D)  patterns  often  required  for  NHE. 
Alternatively,  multiple  laser  beams  can  be  used  to  write  the 
pattern  in  a  single  exposure  [20]. 

In  order  to  understand  the  fundamental  limits  of  IL,  it  is 
simplest  to  begin  by  analyzing  1-D-grating  patterns.  The 
fundamental  relation  describing  any  interferometric  process 
(including  conventional  optical  imaging)  is  that  the  pattern 
period  p  is  given  by: 

p  —  X/2sm(0)  (1) 

where  A  is  the  optical  wavelength  and  20  is  the  angle  between 
the  two  incident  beams  with  the  normal  to  the  wafer  parallel  to 
the  bisector  between  the  incident  optical  wavevectors.  Large 
incident  angles  of  80°  to  85°  are  realizable  in  practice  so  the 
limiting  period  is  very  close  to  A/2.  Note  that  this  corresponds 
to  an  optical  system  with  an  AL4(=sin#)  of  0.96  to  0.98, 
much  closer  to  unity  than  is  possible  with  achievable  optics 
in  an  imaging  system.  Fig.  3  shows  a  180-nm  period  1-D 
grating  pattern  consisting  of  ^40-nm  wide  SiC>2  stripes  on 
a  GaAs  substrate  written  using  a  355-nm  frequency-tripled 
YAG  laser  source.  A  reactive-ion  etch  process  was  used  to 
transfer  the  pattern  from  the  photoresist  into  the  oxide  layer. 

For  all  of  the  experiments  reported  in  this  paper,  we  used 
wavelengths  of  either  364  nm  (Ar-ion  laser)  or  355  nm  (third 
harmonic  of  a  YAG  laser),  which  are  compatible  with  commer¬ 
cial  I-line  photoresists.  Shorter  wavelength  coherent  sources  are 
available,  in  particular  at  244  nm  (a  doubled  Ar  laser  matched  to 
commercial  248-nm  resists  for  KrF  laser  sources)  and  at  213  nm 
(fifth-harmonic  of  YAG  matched  to  193  nm  resists  being  devel¬ 
oped  for  ArF  lithography  laser  sources).  Use  of  these  sources 


Fig.  3.  Example  of  limits  of  IL:  1 80-nm  period  Si02  on  GaAs  grating  using  a 
355  nm  laser  source  corresponding  to  an  NA  of  0.986  (after  [5]). 

will  allow  a  corresponding  reduction  in  the  period  or  increase  in 
the  pattern  density.  Further  reductions  can  be  realized  by  using 
immersion  techniques,  which  reduce  the  achievable  period  by 
an  extra  factor  of  1/n,  where  n  ~  1.5  is  the  refractive  index  of 
the  immersion  fluid  to  [21,  Eq.  (1)].  Immersion  has  been  demon¬ 
strated  for  wavelengths  as  short  as  157  nm  [22].  While  there  are 
practical  issues  in  adapting  a  full  optical  system  for  immersion, 
for  IL  which  only  involves  a  limited  range  of  angles  of  inci¬ 
dence,  the  optics  is  quite  straightforward. 

Another  important  result  to  take  from  Fig.  3  is  that  while  the 
period  is  limited  by  the  optical  wavelength,  there  is  no  such  limit 
on  the  smallest  feature  dimensions  (often  called  the  critical  di¬ 
mension  or  CD  in  lithography).  In  the  case  of  Fig.  3,  the  CD 
is  the  oxide  linewidth  of  30-40  nm.  These  smaller  features  are 
possible  as  a  result  of  the  nonlinear  response  of  the  photoresist 
and  of  other  processing  steps  which  introduce  higher  harmonics 
of  the  fundamental  frequency  in  the  optical  exposure.  The  resist 
response  is  inherently  square  law,  responding  to  intensity  not 
to  electric  field.  In  addition,  modern  commercial  resists  have 
been  engineered  to  have  highly  nonlinear  development  char¬ 
acter,  very  close  to  a  thresholding  behavior.  The  transition  from 
unexposed  to  fully  exposed  occurs  over  a  narrow  dose  range. 
This  provides  process  latitude  for  the  semiconductor  industry, 
and  allows  us  to  achieve  small  features  with  almost  vertical  side- 
walls.  Additional  nonlinearities  are  available  in  the  processes 
such  as  reactive-ion  etching  used  to  transfer  the  photoresist  pat¬ 
terns  into  hard  layers  such  as  the  oxide  layer  shown  in  Fig.  3. 
The  semiconductor  industry  routinely  takes  advantage  of  these 
nonlinearities  in  its  manufacturing  processes. 

The  pattern-array  symmetry  can  be  controlled  by  adjusting 
the  IL  parameters  including  the  number  of  beams  and  the 
number  of  exposures.  Fig.  4(a)  and  (b)  show  examples  of  both 
square  and  hexagonal  pattern  symmetries  achieved  by  two  IL 
exposures  with  the  wafer  rotated  between  exposures  by  90°  for 
the  square  and  60°  for  the  hexagonal  symmetry.  These  were 
achieved  over  areas  of  ^4  cm2 . 

Controlling  nano-element  symmetry  requires  additional  pro¬ 
cessing  steps.  Fig.  4(c)  shows  an  example  of  ^100  x  100  nm 
square  holes  in  photoresist  that  was  printed  with  a  double  expo¬ 
sure,  double-develop  technique  [23].  Other  possibilities  involve 
transferring  partial  patterns  into  sacrificial  layers  such  as  oxide 
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Fig.  4.  Examples  of  2-D  pattern  arrays  at  a  period  of  360  nm  achieved  by  rotation  of  the  wafer  by:  (a)  90°  for  the  square  pattern  and  (b)  60°  for  the  hexagonal 
pattern.  Feature  sizes  are  much  less  than  the  optical  wavelength  due  to  nonlinearities  the  exposure  and  pattern  transfer  steps,  (c)  Square  array  of  100-nm  CD  square 
via  holes  printed  by  a  double  exposure/double  development  technique  (after  [24]). 


TABLE  I 

Aspects  of  Nanostructure  Patterning  Controllable  by  Interferometric  Lithography 


Nanoisland 

Parameter 

NHE  Impact 

IL  approach 

Accessible 

Range 

Pattern 

periodicity 

Film  coalescence 

Incident  angles  [p  =  ?J[2nsm(6)]}, 
nonlinear  IL  &  integration  of  opti¬ 
cal  and  interferometric  litho¬ 
graphies 

>  70  nm 

Size 

Island  growth,  strain 
partitioning,  active 
compliance 

Vary  exposure  flux,  use  photoresist 
and  other  process  nonlinearities. 

to  20  nm 

Pattern 

symmetry 

Film  coalescence 

Number  and  orientation  of  beams 

Rectangular, 
hexagonal, . . . 

Nano-element 

symmetry 

Island  growth,  de¬ 
fect  formation 

Multiple  exposures,  hard  masks, 
additional  nonlinearities. 

Circular, 
square,  hex¬ 
agonal.,  ... 

Localized 
growth  areas 

Thermal  expansion 
strain,  integration 
with  Si  circuitry 

Imaging  IL,  integration  of  conven¬ 
tional  optical  and  IL,  hard  masks. 

Fully  arbitrary 
to  ~  A/6 

or  nitride  films  between  IL  steps.  (These  techniques  were  not 
necessary  to  fabricate  the  samples  reported  here.) 

Isolated  areas  of  nanostructures  can  be  generated  by  com¬ 
bining  IL  with  conventional  optical  lithography  to  define  larger 
areas  or  by  using  Moire  techniques;  a  Moire  example  will  be 
illustrated  in  the  GaN  growth  described  later.  More  sophis¬ 
ticated  structures  are  possible  with  imaging  interferometric 
lithography  (IIL)  that  allows  definition  of  arbitrary  patterns  (as 
in  imaging)  with  a  resolution  approaching  that  of  interfero¬ 
metric  lithography  [24].  We  have  demonstrated  dense  features 
at  a  resolution  of  A/2.  The  theoretical  limit  is  ~A/3.  Nonlinear 
techniques  extend  this  range  to  A/6.  Table  I  summarizes  the 
capabilities  and  limitation  of  IL  for  the  definition  of  NHE 
substrates. 

As  NHE  matures  and  impacts  device  capabilities,  any  nano- 
lithographic  technique  will  have  to  be  adapted  to  the  volume 
and  cost  requirements  of  manufacturing.  Because  of  the  large 
area  capabilities  of  IL  and  the  similarity  to  traditional  optical 
lithography,  IL  is  scalable  to  manufacturing  [25].  Imprint  lithog¬ 
raphy  is  also  an  attractive  candidate  for  this  role  [26].  Other  ad¬ 
vanced  lithographic  tools  such  as  157-nm,  F2 -laser-based  step¬ 
pers,  extreme  ultraviolet  steppers,  and  e-beam  lithography  tools 
are  likely  to  be  too  expensive  for  this  application. 


IV.  NHE  OF  GaAs-ON-Si  AND  GaN-ON-Si 

Fig.  5  shows  the  calculated  total  strain  energy/unit  area  as 
a  function  of  epilayer  thickness  for  two  materials  systems,  (a) 
GaAs  on  (001)  Si  with  an  epitaxial  alignment  [100]  GaAs,  par¬ 
allel  to  [100] si-  Here  the  lattice  mismatch  in  the  epitaxial  plane 
is  4%.  (b)  GaN  on  silicon  where  the  (lll)si  direction  is  par¬ 
allel  to  the  [0001] GaN  direction.  In  this  case  the  lattice  mismatch 
in  the  basal  plane  is  22%.  Four  heteroepitaxy  approaches  are 
plotted  for  each  case.  The  superior  strain  relief  offered  by  NHE 
is  evident  as  NHE  yields  the  lowest  strain  energy  for  any  epi¬ 
layer  thickness  in  both  material  systems.  The  energy  required 
to  create  a  screw  dislocation  [27]  is  also  plotted.  In  the  case  of 
GaAs/Si  with  a  growth  island  diameter  of  40  nm,  the  NHE  strain 
energy  remains  below  the  dislocation  formation  energy  for  all 
epilayer  thicknesses,  suggesting  that  any  thickness  of  GaAs  can 
be  grown  without  dislocations  on  islands  of  <40-nm  diameter. 
The  other  growth  approaches  modeled  in  Fig.  5  are: 

•  planar,  conventional  growth  on  an  unpatterned,  thick  Si 
wafer; 

•  compliant,  growth  on  a  40-nm-thick  Si  wafer; 

•  nanoscale — patterned  growth  where  only  the  epilayer  (not 
the  substrate)  undergoes  3-D  strain. 
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Fig.  5.  Strain  energy  versus  epilayer  thickness  for  NHE  and  other  growth  approaches  (after  [10])  for  two  materials  systems:  (a)  GaAs-on-Si  and  (b)  GaN-on-Si. 
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Fig.  6.  (a)  Plan  view  SEM  micrograph  of  GaAs  nucleation  on  Si  islands  at  four  different  temperatures,  (b)  Arrenhius  plot  of  average  distance  between  GaAs 

islands  versus  inverse  temperature,  (c)  SEM  micrograph  showing  NHE  GaAs  nuclei  grown  at  605  °C. 


All  of  these  are  above  the  screw  dislocation  energy,  suggesting 
that  dislocations  are  inevitable  for  this  island  diameter. 

The  large  mismatch  in  the  GaN/Si  system  means  that  the 
strain  energy  in  the  epilayer  is  always  above  the  energy  required 
to  create  a  screw  dislocation,  and  we  do  not  expect  to  eliminate 
mismatch  defects  by  NHE  strain  management.  However,  the 
unique  substrate/epilayer  interface  of  the  NHE  approach  con¬ 
tains  extensive  free  surface  area  which  offers  opportunities  for 
defect  termination.  There  is  prior  evidence  that  the  density  of 
mismatch  defects  in  large  mesa  structures  can  be  reduced  when 
these  defects  are  allowed  to  diffuse  to  the  mesa  edge  [5],  [6]. 
For  GaN,  where  the  commonly  observed  defects  are  c-oriented 
edge  dislocations  [28]  which  have  a  1/3  [1120]  Burger’s  vector 
in  the  basal  plane,  it  is  expected  that  these  defects  will  glide 
to  the  edge  of  a  nanoscale  growth  nucleus.  In  an  NHE  sample, 
where  each  nucleus  is  disconnected  from  its  neighbor,  a  defect 
can  terminate  at  the  edge  of  the  nucleus.  However,  we  do  not  ex¬ 
pect  this  type  of  defect  termination  to  be  possible  during  growth 
on  a  porous-Si  substrate,  where  there  is  a  crystalline  connection 
through  the  substrate  skeleton  and  where  the  Burger’s  vector 
must  be  conserved  across  the  whole  layer. 


The  NHE  approach  is  quite  generic  and  is  not  limited  to  these 
two  materials  systems.  Amongst  other  heteroepitaxial  materials 
combinations  of  interest  are:  Sii-^Ge^/Si,  GaN/sapphire, 
GaN/SiC,  Gai-^In^As/InP,  and  B12As2/SiC.  NHE  theory 
predicts  that  lattice  mismatch  defect  elimination  should  be 
possible  for  mismatch  strains  up  approximately  4%.  However, 
as  we  show  later  in  the  case  of  GaN  on  Si,  even  with  a  mismatch 
strain  much  higher  than  this  4%  value,  the  unique  nanoscale 
features  of  the  NHE  interface  presents  additional  mechanisms 
for  defect  reduction. 

V.  The  Importance  of  Growth  Selectivity  in  NHE 

A  critical  step  in  NHE  is  the  initial  nucleation  on  the  pat¬ 
terned  substrate.  We  require  that  there  is  selective  nucleation  on 
each  substrate  island  as  shown  schematically  in  Fig.  1,  but  not 
on  the  surrounding  growth  mask.  Optimization  of  the  MOCVD 
growth  temperature  allows  this  requirement  to  be  satisfied  for 
both  GaAs/Si  and  GaN/Si. 

Fig.  6  gives  a  synopsis  of  the  observed  GaAs/Si  nucleation 
process.  The  upper  part  of  Fig.  6(a)  shows  the  typical  nucleation 
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Fig.  7.  Plan  view  SEM  micrographs  of  GaN-on-Si  using  the  “grouped”  hexagonal  array  NHE  sample  structure  that  was  shown  in  Fig.  2(c). 


behavior  at  growth  temperatures  of  433  °C,  519  °C,  and  605  °C. 
The  lower  part  of  this  figure  shows  the  behavior  at  734  °C.  The 
surface  mobility  of  the  growth  species  is  adequate  to  provide 
selectivity  between  the  Si  and  SiC>2,  so  that  growth  only  occurs 
on  the  Si  islands.  This  is  expected  from  previous  results  [29], 
which  indicate  that,  under  the  growth  conditions  used  here,  the 
migration  length  for  GaAs  on  SiC>2  masked  surfaces  will  be  in 
the  range  100-1000  fm l.  However,  the  surface  migration  length 
on  the  Si  islands  is  much  shorter,  and  in  Fig.  6(b),  we  plot  the 
average  distance  between  GaAs  growth  nuclei  on  the  Si  islands 
as  a  function  of  temperature.  At  low  temperature  (433  °C),  the 
GaAs  nucleus  separation  is  small  and  there  are  multiple  nuclei 
on  each  Si  island.  This  separation  increases  with  growth  tem¬ 
perature  until  at  605  °C  we  have  the  optimum  number  (one) 
of  nuclei  per  Si  island.  If  we  increase  the  temperature  further, 
then  as  the  lower  part  of  Fig.  6(a)  shows,  the  migration  length 
becomes  too  long  and  some  of  the  Si  islands  are  left  without 
GaAs  nuclei.  This  migration  appears  to  be  a  well  behaved,  tem¬ 
perature  activated  process  and  Fig.  6(b)  yields  an  activation  en¬ 
ergy  of  0.71  eV  for  the  migration  of  the  GaAs  growth  species 
on  the  Si  islands.  These  data  illustrate  the  critical  role  of  growth 
selectivity  in  the  NHE  process.  If  the  growth  temperature  (and 
selectivity)  are  too  low,  we  have  multiple  nuclei  on  each  growth 
island  which  can  lead  to  twinning  and  polycrystalline  growth 
[15].  The  nanopatterned  substrate  for  this  growth  was  produced 
using  the  Moire  techniques  outlined  earlier.  Four  two-beam  ex¬ 
posures,  each  with  identical  360-nm  pitch  were  used.  The  wafer 
rotation  was  0°,  60°,  62°,  and  122°  for  the  four  exposures. 
This  produced  clustered  arrays  of  nanoscale  (^120-nm  diam¬ 
eter)  holes  with  a  local  spacing  of  360  nm  and  a  cluster  spacing 
of  ^10  fim.  Fig.  7(a)  shows  GaN  nucleation  and  that  the  se¬ 
lectivity  offered  by  the  simple  oxide  growth  mask  is  adequate. 
This  figure  also  reveals  that  the  GaN  nuclei  exhibit  well  defined 
hexagonal  facets,  which  have  been  determined  previously  [30] 
to  be  of  the  prismatic  (01  In)  family,  which  are  more  stable  than 
the  orthogonal  (112n)  facets. 

In  both  material  systems  discussed  earlier,  the  nuclei  exhibit 
3-D  growth  behavior,  reflecting  either  a  Volmer- Weber  or  a 
Stranski-Krastanov  growth  mode,  which  is  expected  due  to  the 
large  lattice  mismatches  involved.  In  the  absence  of  a  growth 


mask,  the  size  and  separation  of  these  3-D  nuclei  would  be 
more  random  but  would  increase  naturally  with  increasing 
growth  temperature.  The  optimum  growth  temperature  (typified 
by  growth  at  605  °C  for  the  GaAs-on-Si  system)  therefore 
represents  a  resonance  condition,  where  the  natural  spacing  of 
the  nuclei  that  is  determined  by  migration  length,  is  equal  to 
the  period  of  the  pattern  on  the  NHE  substrate. 

VI.  Uncoalesced  Nanoscale  Nuclei 

Previously  reported  TEM  examination  of  GaAs  growth  on 
the  Si  islands  [15]  revealed  a  tendency  to  twinning  at  lower 
growth  temperature,  while  at  605 °C  the  GaAs  was  typically 
single  crystal,  but  has  stacking  fault  defects  on  the  (111) 
planes.  While  defects  have  not  yet  been  eliminated  by  NHE 
growth,  the  defect  structure  in  NHE  samples  of  GaAs-on-Si 
was  significantly  different  from  that  observed  in  regular  GaAs 
on  planar  Si  growth.  Specifically,  the  incidence  of  threading 
dislocations  has  been  largely  replaced  in  NHE  by  stacking  fault 
defects.  This  is  also  the  case  for  NHE  GaN-on-Si  growth  [31], 
where  again  the  concentration  of  threading  type  defects  was 
significantly  reduced  and  the  most  frequently  occurring  defects 
were  found  to  be  stacking  faults.  In  the  case  of  GaAs-on-Si,  the 
(111)  stacking  faults  are  on  inclined  planes  that  propagate  into 
the  epilayer.  In  the  case  of  GaN-on-Si,  the  stacking  defects  are 
in  the  basal  plane,  and  while  there  is  a  high  defect  density  at  the 
heterointerface,  these  defects  are  confined  to  the  interface  and 
do  not  propagate  into  the  epilayer.  Cross-sectional  transmission 
electron  microscopy  (XTEM)  for  a  GaN  nucleus  measuring 
0.32  fm l  across  is  shown  in  Fig.  8(a).  The  NHE  sample  in  this 
case  was  of  the  type  shown  in  Fig.  2(b)  with  an  approximately 
120-nm  diameter  hole  size.  At  this  hole  size,  NHE  theory 
suggests  that  there  will  be  little  3-D  deformation  or  “nanoscale 
advantage.”  Interestingly,  however,  we  observe  the  same  defect 
behavior  [see  Fig.  8(b)]  that  we  reported  earlier  [11]  for  much 
smaller  (70-nm  diameter)  GaN-on-Si  nuclei,  i.e.,  disorder  on 
the  Si  side  of  the  interface,  a  high  density  of  stacking  faults  in 
the  first  layers  of  GaN,  defect-free  GaN  growth  away  from  the 
interface,  and  a  significantly  reduced  incidence  of  c-oriented 
threading  dislocations. 
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Fig.  8.  XTEM  micrographs  showing  (a)  absence  of  threading  dislocations  in  0.32-jum  nucleus,  (b)  High-resolution  XTEM  micrograph  showing  stacking  faults 
at  heterointerface. 


VII.  Coalescence  of  NHE  Films 

Coalescence  behavior  is  very  different  in  the  GaAs-on-Si  and 
GaN-on-Si  materials  systems  and  we  attribute  this  difference  in 
part  to  the  different  faceting  behavior  of  the  nuclei,  that  can  be 
seen  by  a  comparison  of  Figs.  6(c)  and  7.  GaAs  nuclei  exhibit 
nonuniform  faceting  that  leads  to  local  variations  of  growth  rate 
and  uneven  coalescence.  In  contrast,  the  facets  of  the  GaN  nu¬ 
clei  are  extremely  uniform  and  this  permits  the  simultaneous 
coalescence  of  all  parts  of  the  film. 

A  second  and  related  cause  of  poor  coalescence  in  NHE 
GaAs  is  revealed  by  Fig.  9,  which  shows  SEM  views  of  orthog¬ 
onal  cleaves  from  a  coalesced  GaAs-on-GaAs  NHE  film.  (In 
this  experiment,  GaAs  was  grown  directly  on  a  nanopatterned 
NHE  GaAs  substrate  to  specifically  study  GaAs  coalescence 
in  the  absence  of  strain.)  The  darker  regions  are  marker  layers 
of  Al0.4Ga0.eAs  that  were  grown  approximately  every  300  nm, 
to  allow  visualization  of  the  evolution  of  the  film  morphology. 
The  nanopatterning  period  is  0.5  \i m  and  the  initial  pyramidal 


shape  of  the  GaAs  nuclei  can  be  clearly  seen  at  the  bottom  of 
each  figure.  For  the  cleave  direction  shown  in  Fig.  9(a),  one 
can  define  three  regions.  In  region  1  (up  to  the  second  marker 
layer),  the  GaAs  film  is  planarizing.  In  region  2  (second  marker 
layer  to  fifth  marker  layer),  the  film  becomes  nonplanar  with 
a  roughness  that  is  unrelated  to  the  pattern  period.  In  region 
3  (after  the  fifth  marker  layer),  the  film  begins  to  planarize 
again.  Fig.  9(b)  shows  the  orthogonal  cleave  for  the  same 
growth.  In  this  case,  the  growth  is  conformal  up  to  the  third 
marker  layer,  then  the  film  begins  to  planarize.  This  difference 
in  growth  habit  is,  we  believe,  a  second  major  cause  of  the 
poor  morphology  observed  for  the  coalesced  GaAs  films.  Even 
though  the  GaAs  surface  planarizes  rapidly  in  one  direction 
[Fig.  9(a)]  this  planarity  becomes  disturbed  by  the  nonplanar, 
conformal  growth  that  is  simultaneously  taking  place  in  the 
orthogonal  direction  [Fig.  9(b)].  The  conformal  growth  will 
develop  a  range  of  facets,  and  the  faster  growth  of  these  facets 
will  compete  for  growth  nutrient  causing  a  local  reduction  of 
growth  rate  in  the  orthogonal  direction.  Overall  planarization 
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Fig.  10.  NHE  GaN-on-Si  using  a  square  2-D  sample  structure,  (a)  Edge  region  where  growth  rate  is  reduced,  (b)  Center  of  sample  where  full  coalescence  has 
taken  place. 


Fig.  1 1 .  NHE  GaN-on-Si  using  a  hexagonal  “grouped”  sample  structure,  (a)  Edge  region  where  growth  rate  is  reduced,  (b)  Center  of  sample  where  full  coalescence 
has  taken  place. 


can,  therefore,  only  take  place  when  both  directions  planarize 
equivalently.  As  might  be  expected,  macroscopic  measure¬ 
ments  (XRD,  PL,  etc.,)  of  fully  coalesced  GaAs  films  of  this 
type  indicate  substandard  quality  by  comparison  with  bulk 
GaAs  and  more  optimization  of  NHE  coalescence  conditions 
is  clearly  necessary. 

Despite  the  large  mismatch  between  GaN  and  Si,  the  coa¬ 
lescence  of  GaN  appears  to  be  well  behaved  and  large  areas 
of  smooth  films  are  readily  produced.  This  is  a  result  of  the 
highly  uniform  size  and  shape  of  the  nuclei  as  shown  in  Fig.  7. 
Figs.  10  and  1 1  compare  the  coalescence  of  NHE  GaN-on-Si  for 
a  uniform  square  array  (Fig.  10)  and  a  grouped  hexagonal  array 
(Fig.  1 1).  Figs.  10(a)  and  1 1(a)  show  the  sample  edge  where  the 
growth  rate  is  lower  and  this  region  reveals  the  superior  (more 
uniform)  coalescence  obtained  with  the  hexagonal  array.  The 
sample  shown  in  Fig.  11(a)  used  the  “grouped”  NHE  structure 
shown  in  Fig.  2(c);  thus,  the  elongated  hexagons  at  the  left  of 
Fig.  11(a)  are  the  already  coalesced  groups  of  nuclei  [note  the 
scale  difference  between  Figs.  10(a)  and  11(a)].  Figs.  10(b)  and 
11(b)  show  the  corresponding  fully  coalesced  regions  of  these 
samples  and,  as  expected,  cracking  is  evident  as  a  result  of  the 
large  thermal  expansion  mismatch. 


VIII.  Characterization  of  Coalesced  NHE 
GaN-ON-Si  Films 

Atomic  force  microscopy  measurements  on  the  coalesced 
NHE  GaN  films  showed  rms  roughness  values  of  between 
5  and  6  A  for  all  samples,  including  the  regions  exhibiting 
cracks.  Cracking  is  evident  in  the  fully  coalesced  regions 
of  all  GaN-on-Si  samples  and  is  expected  from  the  thermal 
mismatch  between  GaN  and  Si,  which  puts  the  GaN  in  tension. 
Bright  field  cross-sectional  TEM  (Fig.  12)  on  fully  coalesced 
GaN-on-Si  NHE  films  (2-D  square  array  sample  structure) 
yields  a  defect  density  of  3  x  109  cm-2,  assuming  a  specimen 
thickness  of  750  A. 

Room  temperature  PL  measurements  reported  earlier  [32] 
indicated  that  coalesced  NHE  GaN-on-Si  films  have  a  sig¬ 
nificantly  higher  luminescence  efficiency  than  either  planar 
GaN-on-Si  films  or  planar  GaN-on-sapphire  films. 

Residual  strain  in  these  samples  was  measured  by  micro- 
Raman  scattering  measurements  made  in  a  standard  back- 
scattering  geometry  using  a  488-nm,  Ar-ion-laser  source,  a 
40  x  microscope  objective  (spot  size  ~  1  fi m),  a  1-m  double 
spectrometer  and  an  optical  multichannel  analyzer  (OMA). 
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Fig.  12.  Bright  field  cross  sectional  TEM  micrograph  on  coalesced  NHE 
GaN-on-Si  on  the  2-D  square  array  sample  structure. 
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<  ^sapphire  =  7.5  x  10  6/°C.  The  rms  variation  in 

wavenumber  detailed  in  the  a  column  of  Table  II  is  much 
larger  than  the  experimental  repeatability  and  represents  a  real 
variation  of  local  strain  in  the  sample.  This  variation  is  at  the 
measurement  limit  for  the  planar  samples  (0.12  cm-1)  and 
large  (~1  cm-1)  for  the  coalesced  NHE  samples.  Interestingly, 
the  partially  coalesced  NHE  hexagonal  sample  exhibits  a  small 
spread  in  wavenumbers  and  also  has  the  smallest  stress  of  only 
—0.09  GPa.  This  stress  is  less  than  20%  of  the  stress  measured 
in  the  other  NHE  samples  and  is  approximately  half  of  the  low 
stress  achieved  by  growing  GaN  onto  porous  GaN  sublayers 
[34].  This  suggests  that  the  “grouped”  NHE  sample  structure 
[Fig.  2(c)]  may  be  a  viable  route  to  higher  quality  GaN-on-Si. 
More  detailed  mapping  [35]  of  these  stress  variations,  in  both 
the  coalesced  and  partially  coalesced  cases,  is  underway  and 
should  yield  additional  useful  information  to  guide  future 
development  of  NHE. 

The  GaN  bandgap  peak  energy  measured  by  photolumines¬ 
cence  (PL)  can  also  be  used  to  determine  residual  stress  in 
GaN  films.  At  room  temperature,  the  band  edge  peak  shifts 
by  approximately  24  meV  per  GPa  of  biaxial  stress  [33].  The 
right-hand  column  of  Table  II  lists  the  GaN  room  temperature, 
photoluminescence  peak  energy  measured  for  these  GaN 
samples.  We  calculate  that  the  GaN-on-planar-Si  and  the 
GaN-on-Si  square  array  samples  have  a  tensile  stress  that  is 
approximately  —1  GPa  relative  to  the  stress  in  the  partially 
coalesced  NHE  GaN  film  (on  the  “grouped”  hexagonal  array). 
This  is  approximately  twice  the  value  of  stress  indicated  by  the 
Raman  measurement  but  the  agreement  is  not  unreasonable 
given  the  large  width  of  the  room  temperature  PL  peak. 


IX.  Discussion  and  Conclusion 


Fig.  13.  Raman  spectrum  showing  the  Si  substrate  peak  at  521  cm-1  and  the 
GaN  (E2)  peak  in  the  range  560  to  570  cm-1.  (Note:  The  traces  have  been 
displaced  vertically  for  clarity.) 

Spectral  shifts  were  calibrated  against  emission  lines  from  the 
Ar  plasma  with  a  repeatability  of  ~  one  diode  in  the  OMA 
array  or  0.12  cm-1.  Typical  micro-Raman  spectra  for  various 
sample  structures,  showing  a  Si  peak  at  521  cm-1  and  the 
GaN  E2  peak  in  the  vicinity  of  570  cm-1,  are  given  in  Fig.  13. 
Table  II  summarizes  Raman  data  from  these  measurements 
for  fully  coalesced  and  partially  coalesced  GaN  films.  [The 
partially  coalesced  films  are  on  the  “grouped”  NHE  sample 
structure  as  shown  at  the  left  of  Fig.  11(a),  where  there  is 
coalescence  within  the  group  of  nanonuclei  but  adjacent  groups 
have  not  yet  coalesced.]  The  shift  of  the  Raman  E2  peak  was 
translated  to  biaxial  stress  in  the  epitaxial  GaN  film  using 
the  approach  described  by  Reiger  et  al.  [33]. 

For  all  of  the  GaN  growths  on  silicon,  the  E2  Raman 
peak  wavenumber  is  less  than  the  bulk  (unstrained)  value 
of  568  cm-1,  indicating  that  the  GaN  is  in  tension.  For  the 
reference  sample  of  GaN  grown  on  sapphire,  the  Raman 
peak  wavenumber  is  greater  than  the  bulk  value  indicating  that 
the  GaN  film  is  in  compression.  These  results  are  consistent 
with  previous  data  [33]  and  with  the  in-plane  thermal  expansion 
coefficients:  aSi  =  2.6  x  l(r6/0C  <  aGaN  =  5.6  x  l(r6/0C 


There  are  both  similarities  and  differences  in  the  nucleation 
and  coalescence  phases  of  the  NHE  growth  in  the  two  material 
systems  we  have  investigated.  For  both  GaAs/Si  and  GaN/Si, 
stacking  faults  are  found  to  be  the  dominant  defect.  For  the 
GaAs/Si  case,  these  defects  nucleate  on  the  {111}  sidewalls 
of  the  growth  nuclei  and  are  problematic  because  they  propa¬ 
gate  up  into  the  epitaxial  layer.  This  type  of  sidewall  growth 
nucleation  may  be  eliminated  by  using  for  example,  the  NHE 
structure  shown  in  Fig.  2(b)  (holes  in  an  oxide  overlayer  atop 
a  blanket  Si  film,  similar  to  that  demonstrated  for  the  GaN 
growth).  However,  this  type  of  NHE  structure  has  an  inherently 
lower  compliance  than  the  SOI  structure  of  Fig.  2(a). 

In  GaN/Si  NHE  growth,  the  (111)  Si  facet  is  the  lowest 
energy  face  and  the  problem  of  nucleation  on  the  sidewalls 
does  not  occur  with  island  growth  [11]  (furthermore,  sidewall 
growth  is  impossible  for  the  Si02 -masked  blanket  Si  growth 
investigated  here).  However,  the  lattice  mismatch  in  this  mate¬ 
rial  system  is  too  large  to  be  accommodated  by  the  nanoscale 
growth  and  defect  formation  is  inevitable.  The  most  significant 
impact  of  the  NHE  growth  is  in  the  character  of  the  defects 
which  are  stacking  faults  confined  to  a  thin  region  just  at  the 
initiation  of  the  growth.  These  defects  appear  to  largely  replace 
the  threading  dislocations  that  propagate  throughout  the  film 
and  that  are  characteristic  of  planar  (large  area)  GaN/Si  growth. 
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TABLE  II 

Micro-Raman  E2  Measurements  and  Average  Stress  for  GaN  Films.  (Note:  A  Negative  Stress  Indicates 
That  GaN  Film  is  in  Tension  and  a  Positive  Stress  Indicates  Compression) 


Sample 

<e2> 

(cm1) 

Ok 

(crrr ) 

Average 
biaxial  stress 
(GPa) 

RT  PL 
peak 
(eV) 

GaN  on  planar  Si 

563.82 

0.12 

-0.52 

3.396 

NHE  GaN  on  square  2D  array 
(coalesced) 

563.53 

1.21 

-0.56 

3.396 

NHE  GaN  on  hexagonal  2D  array 
(coalesced) 

563.95 

0.84 

-0.51 

3.420 

NHE  GaN  on  hexagonal  2D  array 
(partially  coalesced) 

567.3 

0.25 

-0.09 

GaN  on  planar  sapphire 

569.18 

0.12 

+  0.15 

- 

The  kinetics  of  the  coalescence  are  also  dramatically  dif¬ 
ferent  for  the  two  material  systems.  Because  of  the  differences 
in  growth  rate  along  different  directions  of  the  GaAs  surface, 
any  uneven  growth  at  the  initial  separated  island  stage  is  not 
smoothed  over  in  the  coalescence  but  rather  is  amplified,  leading 
to  an  uneven  growth  habit.  Clearly,  much  further  investigation 
is  needed  before  useful  device  quality  material  is  achieved. 

Raman  measurements  on  coalesced  GaN/Si  films  show  a 
level  of  strain  consistent  with  the  thermal  expansion  coefficients 
of  the  materials  involved.  This  implies  that  the  lattice  mismatch 
is  largely  accommodated  at  growth  temperature  and  that  the 
residual  strain  is  due  mainly  to  differential  thermal  expansion 
arising  during  postgrowth  cooling.  In  the  absence  of  other  strain 
relief  mechanisms,  Raman  measurements  should  yield  the 
same  level  of  strain,  a  dimensionless  parameter,  for  large  and 
small  areas.  The  observation  of  a  significantly  reduced  strain 
in  the  partly  coalesced  NHE  GaN  films,  therefore,  suggests  an 
additional  mechanism  is  at  work  in  these  samples.  We  speculate 
that  in  these  smaller  area,  partially  coalesced  samples,  much 
of  the  thermal  expansion  mismatch  is  being  accommodated  by 
a  relative  motion  of  the  film  and  the  substrate  during  cooling 
as  proposed  by  Lo  et  al.  [12].  This  relative  motion  is  clearly 
an  extensive  parameter  that  must  increase  with  the  linear 
dimensions  of  the  film  and,  thus,  is  not  possible  in  the  large 
areas  of  the  fully  coalesced  samples.  We  are  currently  pursuing 
micro-Raman,  high-resolution  “mapping”  of  these  samples  to 
study  the  spatial  variation  of  strain  and  correlate  this  with  the 
underlying  NHE  sample  structure.  Clearly,  more  investigations 
of  this  strain  relaxation  as  a  function  of  the  size  of  the  NHE 
group  and  the  extent  of  the  overgrowth  will  be  critical  to 
understanding  this  phenomenon. 

Another  interesting  observation  is  the  apparent  loss  of  se¬ 
lectivity  just  before  coalescence  in  the  GaN/Si  system  as  seen 
in  Fig.  11(a).  On  the  right  side  of  the  figure,  the  film  is  fully 
coalesced.  On  the  left  side,  individual  hexagons  of  GaN  at  a 
scale  of  several  nanometers  are  evident.  Just  at  the  transition 
region,  there  is  a  large  number  of  small  nuclei  to  the  side 
of,  and  independent  of,  the  Si  nucleation  sites  that  are  buried 
under  the  large  hexagons.  Presumably,  this  would  result  in  a 
polycrystalline  material  since  these  nuclei  are  likely  randomly 
oriented.  Yet  the  resulting  film  is  single  crystal.  Do  these  nuclei 
reorient  as  the  film  comes  together?  Why  does  the  selectivity, 
which  is  very  good  away  from  this  interface,  break  down  in  this 


transition  region?  Clearly,  much  more  needs  to  be  understood 
about  the  coalescence  regime. 

For  both  material  systems,  the  unique  features  of  NHE  lead 
to  substantial  changes  in  the  defect  structure  over  traditional 
blanket  film  growth.  There  is  a  large  parameter  space  to  investi¬ 
gate  and  systematic  investigation  is  necessary  to  find  the  optimal 
combination  of  NHE  substrate  and  growth  conditions.  Interfer¬ 
ometric  lithography  offers  a  large  degree  of  flexibility  to  aid  in 
this  investigation  while,  at  the  same  time,  is  suited  for  scaling 
to  higher  levels  of  manufacturing  as  devices  and  circuits  based 
on  the  integration  that  NHE  offers  between  Si  and  other  mate¬ 
rial  systems  evolves.  While  the  potential  of  defect- free  lattice 
mismatched  materials  and  devices  has  yet  to  be  realized,  NHE 
continues  to  demonstrate  promise  toward  that  potential.  At  the 
same  time  NHE  is  providing  a  rich  array  of  new  epitaxial  growth 
phenomena  that  will  lead  to  significant  improvements  in  the  sci¬ 
ence  of  heterostructure  growth. 
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Abstract 

Dynamical  faceting  during  homoepitaxial  growth  of  GaAs  on  nanoscale-patterned  surfaces  by  molecular  beam 
epitaxy  is  examined.  Selective  deposition  on  open  GaAs(lOO)  surfaces  with  lateral  dimensions  ranging  from  130  to 
250  nm,  separated  by  15-80nm-wide  (25-nm-thick)  Si02  stripes  aligned  along  the  [0  I  1]  direction  results  in  facet 
formation  and  lateral  growth  over  the  Si02  mask.  At  the  early  stage  of  growth,  (311)  facets  appear  on  sidewalls  near 
the  boundary  between  an  open  GaAs  surface  and  Si02  mask,  these  are  replaced  by  (1  1  1)  facets  starting  from  Si02 
boundaries  as  growth  continues.  After  complete  replacement,  growth  proceeds  laterally  in  the  direction  perpendicular 
to  [0  I  1]  retaining  the  (1  1  1)  facets  until  coalescence  occurs  between  adjacent  triangular  cross-sectioned  GaAs  stripes. 
Nanoscale  fabrication  nonuniformity  results  in  dynamical  formation  and  retention  of  multiple  (3  11)  facets  even  for 
growth  thicknesses  much  greater  than  the  thickness  of  the  Si02  mask  stripes.  This  dynamical  faceting  is  interpreted  by 
minimization  of  total  surface  free  energy  based  on  equilibrium  crystal  shape,  in  qualitative  agreement  with  our 
experimental  results.  ©  2002  Elsevier  Science  B.V.  All  rights  reserved. 

PACS:  68.55.B 

Keywords:  Al.  Faceting;  A2.  Lateral  growth;  A3.  Molecular  beam  epitaxy;  A3.  Selective  epitaxy 


1.  Introduction 

Nanoscale-patterned  (nanopatterned)  growth  is 
attracting  considerable  interest  because  of  the 
possibility  of  a  deterministic  alignment  of  quan- 
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turn  dots  (QDs)  with  improved  size  uniformity  and 
of  migration-assisted  selective  growth  in  molecu¬ 
lar  beam  epitaxy  (MBE)  [1,2].  For  1 -dimensional 
(1-D)  patterning,  as  the  lateral  dimension  of  the 
open  substrate  surface,  d,  is  decreased  to  a  scale 
comparable  to  or  smaller  than  the  thickness,  t,  of 
an  epilayer,  faceting  will  determine  the  overall 
shape  of  a  selectively  grown  epilayer.  This  faceting 
provides  a  powerful  new  approach  to  the  forma¬ 
tion  of  nanopatterned  surfaces  that  avoids  many 


0022-0248/02/$- see  front  matter  ©  2002  Elsevier  Science  B.V.  All  rights  reserved. 
PII:  S0022-0248(02)00867-9 


334 


S.  C.  Lee  et  al.  /  Journal  of  Crystal  Growth  240  ( 2002 )  333-339 


of  the  damage  and  crystal  disruption  issues 
associated  with  subtractive  (etching)  preparation 
of  nanopatterned  substrates.  In  this  paper,  we 
illustrate  this  behavior  in  three  regimes  for 
homoepitaxial  growth  of  GaAs  by  MBE  on 
~  100-  to  200-nm-wide  GaAs(lOO)  open  faces 
bounded  by  thin  —  50-nm-wide  Si02  stripes 
oriented  along  a  [0  I  1]  direction:  (1)  growth  within 
the  original  open  GaAs  stripes  where  the  original 
facets  adjust  dynamically  as  the  open  area  is  filled 
for  t<d ;  (2)  lateral  growth  over  the  Si02  stripes 
for  t~d ;  and  (3)  a  return  to  the  original  faceting 
for  films  much  thicker  than  the  stripe  spacing,  t  > 
d.  According  to  the  basic  concept  of  equilibrium 
crystal  shape  (ECS)  theory,  faceting  occurs  to 
minimize  the  total  surface  free  energy,  E  [3]  (for 
review  of  Wulffs  construction,  see  Ref.  [4]). 
Faceting  is  very  important  for  nanopatterned 
growth  and  associated  nanoscale  lateral  growth 
(NLG)  investigated  in  this  work;  evolution  of  the 
shape  and  of  the  faceting  in  nanoscale  growth 
leads  to  lateral  growth  which  has  not  been 
observed  in  pm-scale-patterned  MBE  growth. 

We  investigate  homoepitaxial  growth  of  GaAs 
with  a  1-D  Si02  stripe  pattern  atop  a  GaAs  (10  0) 
substrate,  which  demonstrates  the  basic  idea  of 
ECS  without  any  complications  from  heteroepi- 
taxial  strain.  Surface  migration  of  incident  atoms 
is  one  of  major  physical  origins  of  faceting  in 
MBE.  Boundary  faceting  occurs  near  the  edge 
between  a  Si02  mask  and  an  open  substrate 
surface  [5].  Microscopically,  boundary  faceting 
has  been  explained  by  variation  of  surface-bond¬ 
ing  configuration,  leading  to  different  surface 
reconstructions  and  different  surface  free  energies. 
As  a  consequence  of  the  restricted  growth  area, 
faceting  must  evolve  dynamically  to  maintain  a 
minimum  surface  free  energy.  Thus,  generation 
and  disappearance  of  various  facets,  referred  as 
dynamical  faceting,  occurs  during  growth.  As  the 
growth  thickness  increases  beyond  the  thickness  of 
the  Si02  masking  film,  dynamical  faceting  will 
determine  lateral  or  vertical  growth  mode  and  the 
continued  evolution  of  the  facet  structure. 

In  this  work,  we  present  what  is  to  our  know¬ 
ledge  the  first  experimental  evidence  of  dynamical 
faceting  observed  on  a  1-D  Si02  stripe-patterned 
GaAs  substrate  with  homoepitaxial  MBE.  We 


examine  NLG  on  Si02-patterned  substrates  show¬ 
ing  the  evolution  of  surface  faceting  and  propose  a 
qualitative  interpretation  using  minimization  of 
total  free  energy  based  on  ECS. 


2.  Nanoscale  patterning  and  MBE  growth 

A  1-D  25-nm- thick  Si02  stripe  pattern  oriented 
along  the  [Oil]  direction  was  generated  on  a 
GaAs(l  0  0)  substrate  by  large-area  interferometric 
lithography  and  dry  etching.  Fig.  1(a)  a  shows  a 
45°-tilted  scanning  electron  microscope  (SEM) 
image  of  a  180-nm  period  Si02-patterned  substrate 
for  which  d  is  about  130nm  and  w,  the  Si02  stripe 
width,  is  about  50  nm.  The  Si02  stripes  have  an 
edge  roughness  of  about  5-10  nm.  The  edge 
definition  in  the  SEM  is  due  partly  to  this 
roughness  and  partly  to  the  SEM  resolution 
limit.  Interferometric  lithography  and  the  detailed 
patterning  procedure  have  been  presented 
elsewhere  [2]. 


Fig.  1.  (a)  A  45°-tilted  SEM  image  of  a  180-nm  period  Si02 
pattern  with  d  ~  130nm,  generated  on  a  GaAs  (100)  substrate 
by  IL  and  dry  etching,  (b)  A  cross-sectional  SEM  image  of  a 
1 80-nm  period  (d~  130  nm)  Si02-patterned  substrate  after 
GaAs  growth  of  t&  =  1 5  nm. 
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Fig.  2.  A  schematic  illustration  of  a  cross-sectional  view  of  an 
as-grown  sample  with  the  parameters  used  in  this  work.  Si02 
stripe  masks  in  every  sample  were  removed  by  HF  treatment 
after  growth. 

MBE  was  performed  at  610  +  5°C  with  growth 
rate  of  about  0.1  monolayers.  Under  these  condi¬ 
tions,  the  sticking  coefficient  of  a  Ga  atom  on  the 
Si02  surface  is  very  low  and  selective  growth  is 
realized  [1].  Fig.  2  is  a  schematic  illustration  of  a 
cross-sectional  view  of  selectively  grown  GaAs 
stripes  of  which  each  sidewall  consists  of  a  single 
facet.  Deposition  thickness,  t d,  on  a  Si02-pat- 
terned  substrate  was  determined  from  growth  time 
multiplied  by  growth  rate  and  means  the  thickness 
of  GaAs  deposited  on  an  unpatterned  substrate 
under  the  same  growth  condition,  as  indicated  by 
the  dashed  line  in  Fig.  2.  Fig.  1(b)  shows  a  SEM 
cross-section  of  a  different  sample  with  d~  130  nm 
after  selective  growth  of  GaAs  of  about 
td  =  15nm.  For  this  and  all  subsequent  SEMs, 
the  Si02  has  been  removed  with  an  HF  etch  after 
the  growth.  For  this  work,  d  was  varied  from  130 
to  250  nm  by  changing  the  period  and  the  etch 
parameters,  and  t d  was  varied  from  15  to  400  nm. 


3.  Results  and  discussion 

Fig.  3  shows  cross-sectional  SEM  images  taken 
from  a  single  growth  run  on  a  sample  with  regions 
of  different  periods  and  d’ s  between  Si02  stripes. 
In  Fig.  3,  td  =  46  nm.  As  expected,  several  differ¬ 
ent  facet  structures  were  generated  depending  on 
d.  All  of  the  cross  sections  show  a  (1  1  1)  facet.  For 
d  =  130  nm  [Fig.  3(a)],  the  (1  1  1)  facets  fill  the 
entire  space.  For  the  SEMs  in  Figs.  3(b)  and  (c), 


x  [oT  1  ]  100  nm 


Fig.  3.  Cross-sectional  SEM  images  for  d~130nm  (a),  170  nm 
(b)  and  340  nm  (c)  Si02-patterned  substrate  after  GaAs  growth 
of  =  46  nm. 

both  (1  1  1)  and  (3  1  1)  facets  are  seen,  the  lengths 
of  the  (3  11)  facets  increase  with  d. 

Phenomenologically,  the  faceting  observed  in 
Fig.  3  can  be  understood  as  follows.  At  the  initial 
stage  of  growth,  a  (3  1  1)  facet  begins  to  appear  on 
an  open  GaAs  surface  near  the  boundary  between 
Si02  mask  and  open  substrate  surface  as  seen  in 
Fig.  1(b).  Study  on  the  very  early  stage  of  facet 
formation  is  beyond  the  scope  of  this  work.  For 
td  =  1 5  nm,  a  facet  generated  on  sidewalls  near  the 
boundary  is  (3  1  l)-type.  The  maximum  deposition 
thickness,  td 311,  for  which  this  facet  is  allowed  is 
given  by  ^,311  =  t/2  =  d  tan0imn/4  where  6\mn  = 
03ii  ~25°,  as  indicated  in  Fig.  2.  For  d  =  130 nm, 
fd,3ii  is  about  15  nm  as  is  clear  from  Fig.  1(b).  If  d 
is  increased,  the  allowable  thickness  to  retain  a 
(3  11)  facet  also  increases  as  is  evident  from 
Figs.  3(b)  and  (c).  For  d  =  340  nm  of  Fig.  3(c), 
td,3\\  is  ~40nm,  the  pictured  case  of  ^d~46nm,  a 
(111)  facet  has  just  begun  to  form  at  the  edges  of 
the  open  GaAs  surface.  For  d  =  170  nm,  Fig.  3(b), 
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the  (111)  facet  boundary  is  closer  to  the  apex  of 
the  growth,  and  finally  for  d  =  130nm,  the  (311) 
facets  is  gone  entirely  and  the  (111)  facets 
completely  fill  the  open  area  (t&  =  46  nm  ~  ^,m  = 
<itan0m/4  for  <i~130nm  where  0m~56°).  Co¬ 
existence  of  (3  11)  and  (111)  facets  on  a  sub-pm- 
scale-patterned  GaAs(lOO)  substrate  has  been 
reported  by  Dilger  et  al.  [6]. 

Initial  faceting  of  Fig.  1(b)  is  consistent  with 
minimization  of  E  if  (3  1  1)  facets  are  more  stable 
or  have  lower  surface  free  energy  than  (111) 
facets.  Although  there  has  been  no  absolute 
comparison  of  surface  free  energies  between 
different  orientations  of  GaAs,  Scheffler  and  his 
coworkers  have  evaluated  the  surface  free  energy 
of  several  GaAs  surfaces  with  density-functional 
theory,  and  suggest  that  a  (311)  surface  has  a 
surface  free  energy  comparable  with  other  lower 
index  surfaces  [7,8].  According  to  their  calculation, 
a  (3  1  1)  facet  has  surface  free  energy  of  47  meV/A2 
compared  to  5 1-54  meV/A2  for  a  (1  1  1)  facet  in  an 
As-rich  environment,  consistent  with  our  experi¬ 
mental  findings.  Irrespective  of  its  surface  free 
energy,  the  initial  (10  0)  facet  must  be  replaced  by 
other  facets  as  growth  proceeds  in  a  confined 
geometry. 

We  explored  the  possibility  of  lateral  growth 
through  continued  growth  over  the  faceted  sur¬ 
face,  e.g.,  for  t&  >  ?d,in  =  d  tan  9 m/4  for  a  given 
d.  For  this  purpose,  we  employed  265-nm  period, 
25-nm-thick  Si02  patterns  providing  better  resolu¬ 
tion.  Fig.  4  presents  45°-tilted  [(a)  and  (b)]  and 
cross-sectional  [(c)]  SEM  images  after  deposition 
of  t&  =  200  nm  of  GaAs  which  is  more  than  twice 
td,m  ~75nm  for  d~  180-220nm.  Adjacent  trian¬ 
gular  cross-sectioned  GaAs  stripes  undergo  coa¬ 
lescence  as  the  growth  proceeds.  The  coalescence  is 
nonuniform  as  a  result  of  variations  in  d  and  w 
associated  with  process  limitations  at  this  sub- 100- 
nm  scale.  If  the  Si02  pattern  were  uniform, 
simultaneous  overall  coalescence  might  have  oc¬ 
curred  although  the  edge  definition  is  intrinsically 
limited  at  this  nm-scale  by  the  discrete  atomic 
nature  of  the  interfaces.  Coalescence  over  the  Si02 
mask  stripe  is  retarded  for  a  larger  w  since  the 
sticking  coefficient  of  Ga  on  the  Si02  surface  is 
negligible  under  our  growth  conditions.  Addition¬ 
ally,  the  extrapolated  surface  diffusion  length  of  a 
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Fig.  4.  45°-tilted  [(a)  and  (b)]  and  cross-sectional  [(c)]  SEM 
images  of  a  265-nm  period  Si02-patterned  substrate  after  GaAs 
growth  of  td  =  200  nm. 

Ga  atom  on  GaAs  (10  0)  surface  along  the  [Oil] 
direction  at  615°C  is  several  pm  [9].  Also,  it  has 
been  reported  that  the  surface  diffusion  length  on 
(111)  surface  is  greater  than  that  on  (1  0  0)  surface 
[10].  This  implies  that  once  initial  coalescence 
occurs  at  some  point  over  Si02  mask  which  is 
generally  around  a  small-w  region,  growth  partly 
proceeds  along  the  [Oil]  direction,  onto  the  plane 
of  the  cross  sections,  as  well  as  filling  in  the  V- 
groove  between  the  (1  1  1)  faces  (see  below).  This 
and  other  nonlinear  growth  dynamics  appear  to 
amplify  the  effects  of  the  differences  in  w  and 
might  account  for  the  strong  nonuniformities  in 
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Fig.  5.  Cross-sectional  SEM  images  of  a  265-nm  period  Si02-patterned  substrate  having  different  w  (~  15-80nm)  after  GaAs  growth 
of  td  —  200  nm. 


the  original  Si02  mask.  The  coalescence  shown  in 
Fig.  4  results  in  growth  of  single  crystalline  GaAs 
over  the  Si02  mask.  The  clear  faceting  of  the 
surface  along  with  the  absence  of  any  ring  patterns 
in  in  situ  reflection  high-energy  electron  diffraction 
shows  that  no  polycrystalline  phase  occurs  over 
the  entire  growth  plane. 

Magnified  views  of  NLG  and  coalescence 
regions  are  shown  in  Fig.  5  for  different  w  values 
ranging  from  ~  1 5  to  80  nm.  These  images 
correspond  to  different  regions  of  the  sample 
prepared  with  differing  etch  times  to  vary  w,  but 
with  a  single  growth  to  ensure  identical  growth 
conditions.  Coalescence  depends  both  on  w  and  on 
the  oxide  thickness.  In  Fig.  5(a),  coalescence  is 
evident  for  w~15nm,  whereas  a  thicker  deposi¬ 
tion  is  required  for  w~85nm  in  Fig.  5(c),  which 
clearly  exhibits  NLG.  In  this  figure,  growth 
proceeds  in  lateral  direction,  over  the  Si02  mask, 
and  retaining  (111)  sidewall  facets.  It  is  not  clear 
whether  some  particular  facets  were  generated 
along  the  sidewalls  of  Si02  mask  in  Fig.  5.  One  of 
most  probable  facets  is  (0  1  l)-type  perpendicular 


to  (10  0)  if  Si02  edge  has  vertical  sidewalls.  The 
filling  process  of  Fig.  4  is  then  continued  until  the 
height  of  the  (0  1  l)-type  facets  reaches  the  thick¬ 
ness  of  the  Si02  mask.  To  minimize  E,  growth  is 
continued  with  extension  of  (1  1  1)  facets  over  Si02 
mask  leading  to  lateral  growth. 

More  investigation  of  the  details  of  the  growth 
up  to  the  thickness  of  Si02  mask  is  clearly 
necessary.  In  particular,  the  mask  used  in  this 
work  is  only  20-30  nm  thick  and  does  not  have 
vertical  sidewalls.  Regardless  of  its  thickness,  it  is 
evident  that  the  presence  of  Si02  mask  results  in 
discontinuity  of  bonding  sites  available  for  epitax¬ 
ial  growth  and  causes  facet  formation  on  adjacent 
open  GaAs  surface.  However,  these  slanted  Si02 
sidewalls  may  allow  extension  of  the  (1  1  1)  facets 
over  Si02  mask.  In  selective  growth,  initiation  of 
over-growth  also  requires  a  certain  activation 
energy  which  may  depend  on  the  slanted  degree 
of  Si02  sidewalls.  Growth  without  lateral  expan¬ 
sion  after  filling  out  open  GaAs  (10  0)  surface  has 
been  reported  in  circular-patterned  selective 
growth  that  does  not  provide  any  particular  facet 
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Fig.  6.  45°-tilted  (a)  and  cross-section  (b)  SEM  images  of  a 
265-nm  period  Si02-patterned  substrate  after  GaAs  growth 
of  td  =  400  nm. 


in  sidewall  [2].  If  extension  of  existing  facets  over 
Si02  mask  is  energetically  more  favorable  than 
creation  of  the  next-highest-surface-free-energy 
facet,  epitaxial  growth  would  keep  its  minimum 
E  by  facet  extension.1  This  interpretation  can  be 
applied  to  the  replacement  of  (3  11)  by  (111)  of 
Fig.  2.  Lateral  growth  is  then  accompanied  by 
over-growth  filling  out  the  space  bounded  by  open 
GaAs  (10  0)  surface  and  slanted  sidewalls  of  Si02 
mask.  Experimentally,  we  observe  lateral  growth 
of  GaAs  over  the  Si02  mask  through  extension  of 
(1  1  l)-type  facets  leading  to  coalescence  of  adja¬ 
cent  growth  stripes.  The  field-emission  SEM  used 
in  this  work  does  not  have  sufficient  resolution  to 


1  Facet-extension  accompanies  generation  of  other  facets 
along  the  interface  between  deposited  GaAs  and  Si02  mask. 
Besides,  kink  energies  due  to  facet  intersections  must  be  also 
considered  for  rigorous  approach.  More  detailed  analysis  is 
presently  underway  and  will  be  reported  elsewhere. 


allow  investigation  of  the  fine  details  of  the  region 
along  the  Si02  sidewall. 

What  happens  as  growth  is  continued  further 
after  coalescence?  For  the  SEM  images  of  Fig.  6(b) 
taken  after  a  deposition  of  t<±  =  400  nm  on 
d  =  180-220 nm  Si02-patterned  GaAs  (1  0  0)  sub¬ 
strate,  all  of  the  Si02  stripes  are  buried  in 
deposited  GaAs.  The  top  surface  shown  in  Fig.  6 
is  not  flat,  but  mainly  consists  of  ~(3  1  1)  facets 
just  as  in  the  initial  stage  of  facet  formation  of 
Fig.  1(b).  Fig.  4  gives  some  clues  about  the  reasons 
for  this  (311)  terminated  surface  after  complete 
coalescence.  In  Fig.  4,  the  numbered  arrows  (1-5) 
indicate  the  sequential  stages  of  evolution  after 
coalescence.  Owing  to  the  fluctuation  in  w,  several 
different  evolution  stages  can  be  observed  from  a 
single  sample.  The  evolution  order  is  from  stage  1 
to  5  with  the  coalescence  point  shown  in  Fig.  4(b) 
(1)  as  the  starting  point.  Once  coalescence  occurs 
between  adjacent  (111)  type  facets,  the  triangular 
dip  in  the  cross  section  starts  to  fill  in.  We 
speculate  that  the  steep  junction  of  the  two 
(111)  faces  at  the  coalescence  point  will  form  a 
rounded  concave  profile  with  a  radius  of  high 
curvature  at  the  very  initial  stage.  A  rounded 
profile  with  high  curvature  may  be  energetically 
unstable  compared  with  a  flat  one  since  such  a 
surface  does  not  have  a  well-defined  surface 
reconstruction.  Then,  some  portion  of  subse¬ 
quently  arriving  Ga  atoms  which  have  a  surface 
migration  length  much  greater  than  the  lateral 
dimension  of  the  (111)  facets  move  to  this  area  to 
reduce  surface  free  energy  and  total  area,  and  as  a 
result  gradually  fill  out  this  triangular  dip.  The 
rounded  surface  of  stage  2  of  Figs.  4(a)  and  (c)  are 
examples  of  this  filling  process.  A  well-known 
example  similar  to  this  filling  is  growth  of  a 
quantum  wire  in  a  Y-groove.  As  seen  in  stage  4  of 
Figs.  4(a)  and  (b),  however,  this  filling  process 
eventually  results  in  new  (3  1  l)-type  facets  on  the 
top  surface.  This  also  could  be  due  to  the 
fluctuation  of  w  in  Si02  pattern,  although  as 
mentioned  above,  nonlinearities  in  the  growth 
process  seem  to  accentuate  the  roughness  com¬ 
pared  with  the  original  pattern.  Since  the  degree  of 
filling  between  adjacent  triangular  dips  is  different 
because  of  pattern  nonuniformity,  dips  will  fill  [e.g. 
form  flat  (10  0)  surfaces]  at  different  growth  times. 
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As  the  growth  is  continued  further,  these  area- 
limited  (1  0  0)  surfaces  will  transform  into  (3  1  1)- 
type  surfaces  just  as  observed  in  Fig.  1(b).  Unlike 
Fig.  1(b),  however,  the  transformed  (3  1  l)-type 
surfaces  of  Fig.  4  do  not  have  nearby  Si02  mask 
stripes  anymore.  Depending  on  the  status  of 
neighboring  ridges,  these  (3  1  l)-type  surfaces  can 
grow  larger  than  the  original  d  as  indicated  by 
stage  5  in  Fig.  4(c).  Thus,  (3  1  1)  facets  appear  first 
at  the  earliest  growth  stages,  are  replaced  by  (1  1  1) 
facets  as  the  growth  proceeds  until  coalescence, 
and  finally  reappear  on  the  top  surface  as  the 
growth  is  continued. 


4.  Summary  and  conclusion 

We  have  examined  dynamical  faceting  in 
nanoscale  homoepitaxial  lateral  growth  on  GaAs 
(10  0)  by  molecular  beam  epitaxy.  Si02  patterns 
prepared  by  interferometric  lithography  and  dry 
etching  were  used  to  define  open  GaAs  stripe 
surfaces  with  a  lateral  dimension  ranging  from  130 
to  250  nm  along  [0  I  1]  directions.  Selective  growth 
resulted  in  facet  formation  and  lateral  growth  over 
15-80nm-wide  and  25-nm-thick  Si02  mask  de¬ 
pending  on  the  amount  of  deposition.  At  the  early 
stage  of  growth,  (311)  facets  appear  and  are 
gradually  replaced  by  (111)  facets  as  growth  is 
continued  in  these  restricted  areas.  When  (111) 
facets  are  fully  developed  as  sidewalls  of  a  GaAs 
stripe,  the  growth  proceeds  in  lateral  direction 


perpendicular  to  [0  I  1]  until  coalescence  between 
adjacent  triangular  cross-sectioned  GaAs  stripes 
occurs.  Dynamical  faceting  was  interpreted  with 
minimization  of  total  surface  free  energy  based  on 
equilibrium  crystal  shape  theory  which  qualita¬ 
tively  agreed  with  our  experimental  results. 
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Growth  of  InAs  islands  on  a  GaAs(OOl)  substrate  patterned  with  ~  50-  to  200-nm  diameter  holes 
in  an  SiC>2  mask  overlayer  providing  selective  GaAs  nucleation  areas  is  reported.  The  nanoscale 
pattern  was  generated  in  the  Si02  film  by  large-area  interferometric  lithography  and  dry  etching. 
Two-dimensional,  285-nm  period,  arrays  of  InAs  islands  having  heights  of  10-  to  15-nm  with 
three  different  bottom  diameters  of  50-  to  100-nm,  -150  nm,  and  -200  nm,  were  selectively 
grown  on  Si02  patterned  substrates  by  molecular  beam  epitaxy.  Growth  conditions  were  chosen  to 
provide  a  very  low  sticking  coefficient  of  In  atoms  on  the  Si02  surface  excluding  any  volume  con¬ 
tribution  from  migration  of  In  atoms  incident  on  Si02  mask  region  to  nearby  open  GaAs  surface 
areas.  Formation  of  spherical- section  InAs  dots  with  diameters  of  about  50  nm  entirely  relying  on 
nanoscale-limited  area  growth  is  demonstrated.  As  the  diameter  of  hole  increases  beyond  150  nm, 
InAs  islands  deviate  from  a  spherical-section  and  self-assembled  QDs  confined  within  the  open 
GaAs  surface  appear.  A  relation  between  activation  of  self-assembly  and  the  nanoscale  growth  area 
is  proposed,  with  an  apparent  transition  from  layer-by  layer  growth  to  self-assembly  occuring  at 
hole  diameters  of  -  100-  to  150-nm. 


I.  Introduction 

Self-assembled  QDs  have  significant  potential  for  many 
device  applications.  In  many  cases,  additional  control  of  the 
size  and  positional  distribution  of  the  dots  over  that  possi¬ 
ble  so  far  with  self-assembled  growth  would  be  desirable. 
Recently,  there  have  been  several  attempts  to  control  the 
nucleation  of  self-assembled  quantum  dots  (QDs)  with  pat¬ 
terned  substrates1.  Most  of  these  have  concentrated  on 
growth  of  InAs  or  InxGai_xAs  self-assembled  QDs  on  a 
GaAs  or  a  InP  substrate  using  optical  or  electron-beam  li¬ 
thography  for  patterning  either  an  overlayer  mask  or  ^the 
substrate2'6  or  have  exploited  facet-dependent  migration.2. 

Recent  progress  in  lithographic  technologies  has  ex¬ 
panded  the  regime  of  accessible  mask/substrate  patterning 
into  the  nanoscale.  A  pattern  for  which  the  scale  approaches 
the  lateral  dimension  of  typical  self-assembled  QDs  strongly 
suggests  the  capability  to  grow  QDs  on  a  limited  area  of  a 
substrate  with  better  controllability  in  both  size  and  spatial 
distribution.  This  so-called  nanoscale  patterned  growth  basi¬ 
cally  requires  a  highly  uniform  pattern  and  a  selective 
growth  mode. 

We  have  previously  reported14  on  nanoscale,  homoepitaxial 
molecular  beam  epitaxy  (MBE)  selective  growth  of  GaAs 
involving  migration-enhancement  by  surface  diffusion  of 
incident  Ga  atoms  from  the  masked  areas  onto  the  growth 
areas.  Recently,  formation  of  InAs  islands  on  a  Ga2C>3-  or 
Si02  -patterned  substrates3,6  using  similar  In-atom  migration 
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processes  have  been  reported.  However,  in  these  experiments 
the  lateral  size  of  the  open  substrate  surface  is  far  greater 
than  the  lateral  dimension  of  typical  self-assembled  InAs 
QDs. 

InAs  island  formation  on  a  patterned  substrate  is  basi¬ 
cally  different  from  self-assembled  QD  formation  on  a  un¬ 
patterned  substrate.  First,  mask-patterned  growth  does  not 
provide  an  extensive  wetting  layer  for  initiation  of  the 
Stranski-Krastanov  (S-K)  growth  mode.  Self-assembly  is  a 
relaxation  process  for  reduction  of  strain  energy.  This  sug¬ 
gests  that  the  migration  of  an  In  atom  on  a  GaAs  surface 
becomes  a  directional  movement  for  strain  reduction  once 
nucleation  of  QDs  begins.  However,  migration  of  an  In 
atom  on  a  Si02  surface  is  irrelevant  to  strain  relaxation. 
This  random  movement  provides  a  chance  for  In  atoms  to 
be  accidentally  captured  by  nearby  InAs  island  being  grown 
on  a  open  substrate  surface,  or  ends  up  with  desorption  from 
the  Si02  surface  during  migration.  As  a  pattern  period  de¬ 
creases,  this  accidental  capturing  due  to  migration  becomes 
more  significant.  In  spite  of  deposition  of  very  thin  InAs 
layer  of  about  -  2  monolayers  (MLs),  accidental  capturing 
makes  it  possible  to  form  islands  or  QDs  in  reported  pat¬ 
terned  growths.3,6  We  refer  to  this  mode  of  InAs  island  (or 
QD)  formation  on  a  patterned  substrate  as  migration-assisted 
QD  formation. 

Recently,  the  role  of  a  wetting  layer  in  the  activation  of 
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S-K  growth  mode,  ’  the  control  of  shape  and  size  and 
the  carrier  transport19,20  between  QDs  on  unpatterned  sub¬ 
strates  have  been  investigated.  However,  the  impact  of  the 
lateral  area  of  the  wetting  layer  on  SK  mode  growth  has  not 
been  clearly  established.  This  is  another  subject  which  must 
be  investigated  in  nanoscale  patterned  growth.  In  this  work, 
we  examine  the  growth  mode  for  formation  of  InAs  QDs  on 


a  nanoscale  Si02-patterned  substrate  in  a  new  regime.  We 
choose  growth  conditions  where  the  sticking  coefficient  of 
an  In  atom  on  a  Si02  surface  is  essentially  zero  as  evidenced 
by  the  absence  of  any  detectable  growth  on  an  unpattemed 
Si02  film.  Then,  formation  of  InAs  islands  is  only  due  to 
the  incoming  flux  to  a  open  surface  of  a  GaAs  substrate 
bounded  by  Si02  mask.  This  implies  that  accidental  capture 
due  to  In  migration  along  the  Si02  surface  is  minimized 
since  essentially  all  of  the  incident  In  atoms  are  rapidly 
desorbed  from  the  Si02  surface.  Unselective  growth,  for 
which  the  surface  diffusion  length  along  the  mask  is  less 
than  the  QD  size,  using  lift-off  processes  also  results  in  a 
limited  area  deposition  and  does  not  allow  accidental  captur- 
e  of  itinerant  In  atoms.  In  a  broad  sense,  we  refer  to  these 
QD  formation  mechanisms  without  a  large-area  wetting 
layer  and  without  growth  contributions  from  atoms  incident 
on  the  masking  layer  in  the  nanoscale  regime  as  nanoscale 
limited  area  growth  (NLAG).  The  focus  of  this  work  is  to 
examine  QD  formation  by  MBE  in  nanoscale  limited  areas, 
the  ultimate  goal  of  NLAG  will  be  to  customize  the  QD 
shape,  size,  spacing,  density  and  area  coverage. 

We  examine  the  formation  of  InAs  islands  on  a 
nanoscale  limited  area  of  GaAs(OOl)  substrate.  A  285-nm 
period,  2-D  array  of  circular  holes  with  diameters  ranging 
from  $  100  nm  to  200  nm  was  generated  through  a  Si02 

film  on  GaAs.  This  Si02-pattemed  substrate  was  prepared 
by  large-area  interferometric  lithography  and  dry  etching. 
InAs  islands  were  grown  by  MBE.  Different  growth  modes, 
indicating  the  nanoscale  regime  for  which  self-assembly  (S- 
K  growth)  becomes  important,  were  found  across  this  range 
of  hole  diameters  in  the  Si02.  We  propose  a  relationship 
between  self-assembly  and  the  area  of  any  wetting  layer  in 
the  S-K  growth  mode. 

II.  Nanoscale  patterning 

Reproducible  nanoscale  patterning  is  essential  to  realiz¬ 
ing  NLAG.  For  pattern  formation,  we  used  large-area  IL 
which  is  very  promising  as  a  low-cost,  reliable,  and  scalable 
technology  for  deep  sub-100-nm  scales.  The  structure  of  the 
sample  for  processing  consists  of  a  negative-tone  photoresist 
(PR)  film,  an  antireflection  coating  (ARC),  a  Si02  film,  and 
a  GaAs(OOl)  substrate.  A  120-nm  thick  ARC  film,  with 
strong  absorption  at  the  IL  wavelength,  was  embedded  be¬ 
tween  the  PR  and  the  Si02  film  to  suppress  vertical  stand¬ 
ing  wave  effects.  A  80-nm  thick  Si02  film  deposited  by 
electron-beam  evaporation  was  used  as  the  mask  material.  A 
single  longitudinal  and  transverse  mode,  frequency  tripled 
YAG  laser  at  a  wavelength  of  355  nm  was  used  as  an  ultra¬ 
violet  light  source.  Using  IL,  a  285-nm  period  interference 
pattern  was  generated  on  the  PR  film.  Figure  1(a)  shows  a 
45°-tilted  scanning  electron  microscope  (SEM)  image  of  a 
developed  PR  pattern  (all  SEM  images  presented  herein  are 
45°-tilted).  As  seen  in  the  figure,  a  2-D  array  of  circular 
holes  of  diameter  -100  nm  is  generated  on  the  PR  film.  The 
thickness  of  the  PR  film  after  development  was  140-160 
nm.  The  detailed  process  for  2-D  array  of  hole  patterns  by 
IL  has  been  summarized  elsewhere.15  Figure  1(b)  is  a  SEM 
image  revealing  the  sample  film  stack. 

Transfer  of  the  PR  pattern  into  the  mask  material  with¬ 
out  degradation  is  a  very  important  step  for  NLAG.  Dry 
etching  with  CF4  plasma  was  used  for  pattern  transfer.  Fig- 


Figure  1  A  45°-tilted  SEM  image  of  (a)  a  2-D  array  of 
holes  generated  in  a  negative  PR  film  and  (b)  the  sample 
film  structure. 


ure  2  shows  SEM  images  of  a  2-D  array  of  holes  transferred 
onto  the  Si02  film  through  the  ARC  material.  Both  Si02, 
ARC,  and  PR  films  can  be  etched  by  CF4  plasma.  The 
ARC  has  an  etch  rate  that  is  about  50%  that  of  the  Si02 
film  while  it  has  almost  same  etch  rate  as  the  PR  film. 
These  three  materials  do  not  have  good  selectivities  in  CF4 
plasma  etching,  but  the  film  thicknesses  are  nonetheless 
sufficient  to  allow  good  pattern  transfer.  As  seen  in  Fig.  2 
(a),  the  transferred  pattern  is  quite  uniform.  Also,  note  that 
the  hole  diameter,  d ,  in  the  Si02  film  of  Fig.  2  (b)  is  less 
than  the  -100-nm  diameter  of  the  starting  PR  film  of  Fig. 
1 .  This  is  due  to  the  formation  of  slanted  sidewalls  for  the 
holes  in  the  Si02  film  during  the  etch  process.  The  range  of 
hole  diameters  in  Fig.  2  (b)  is  about  50-  to  80-nm.  To  our 
knowledge,  this  represents  the  smallest  area  substrate  open¬ 
ing  ever  explored  for  2-D  patterned  growth.  These  sub- 100- 
nm  diameter  holes  provide  incorporation  routes  for  In  atoms 
to  the  substrate  surface  for  epitaxial  growth.  By  varying  the 
etching  time,  different  d} s  ranging  up  to  200  nm  could  be 
produced.  For  this  work,  three  different  hole  diameters  were 
prepared:  50-  to  80-nm,  -150  nm,  and  -200  nm. 
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Figure  2  (a)  A  45°-tilted  SEM  image  of  a  2-D  array  of 
holes  transferred  through  the  SiC>2  film  by  dry  etching 
and  (b)  a  higher  magnification  SEM  image. 
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All  of  the  InAs  QDs  and  islands  presented  in  this  work  in¬ 
cluding  the  InAs  layer  grown  on  an  unpatterned  substrate 
were  obtained  from  a  single  growth  run  on  a  single  substrate 
which  was  treated  with  different  dry  etch  times  using  a  me¬ 
chanical  mask  to  obtain  the  various  hole  diameters.  Thus, 
all  of  the  results  were  obtained  under  exactly  the  same 
growth  conditions  with  the  same  In  flux. 

III.  MBE  growth 

To  remove  any  surface  contamination  and  assure  good 
crystal  quality  at  the  growth  surface,  about  5-nm  of  GaAs 
was  removed  from  the  open  surface  of  the  substrate  by  wet 
etching  using  a  highly  diluted  etchant  (H2SO4  :  H2O2  = 
1:4)  having  etch  rate  of  2.0  -  2.5  nm/s  for  2  seconds  imme¬ 
diately  before  loading  the  sample  into  the  MBE  chamber.  A 
small  part  of  the  Si02  pattern  on  the  substrate  was  com¬ 
pletely  removed  so  that  surface  oxide  removal  from  the  sub¬ 
strate  before  InAs  deposition  could  be  confirmed  through 
the  RHEED  pattern.  The  surface  oxide  removal  step  was 
performed  before  the  start  of  growth  at  620°C  for  more  than 
10  minutes. 

Approximately  10  nm  of  InAs  was  then  deposited  by 
MBE  on  the  Si02-patterned  substrate  shown  in  Fig. 2  with 
the  flux  ratio  of  As  to  In  set  to  1 0  in  beam  equivalent  pres¬ 
sure.  This  thickness  is  about  twice  the  typical  height  of  self- 
assembled  InAs  QDs  on  an  unpattemed  GaAs  substrate.  The 
growth  temperature  was  maintained  at  535°C,  where  desorp¬ 
tion  of  In  atoms  from  GaAs(OOl)  already  begins.21  The  re¬ 
ported  incorporation  fraction  of  an  In  atom  at  547°C  is 
about  0.94  for  an  Ino.15Gao.85 As  surface  which  is  negligi¬ 
ble,22  but  could  be  higher  than  this  value  in  case  of  InAs 
surface.23  High  quality  InAs  QDs  have  been  generated  under 
these  growth  conditions.1  The  growth  temperature  was 
monitored  by  optical  pyrometry.  The  flux  of  In  atoms  was 
controlled  to  achieve  a  growth  rate  of  0.1  ML/s  which  was 
separately  calibrated  by  reflection  high  energy  electron  dif¬ 
fraction  (RHEED)  oscillations.  By  examining  growth  on 
unpattemed  Si02  films,  we  found  that  the  sticking  coeffi¬ 
cient  of  an  In  atom  on  an  unpattemed  Si02  film  is  signifi¬ 
cantly  reduced  around  510°C  and  becomes  unobservable  at 
535°C  under  these  flux  conditions.  Therefore,  these  growth 
conditions  enable  growth  of  InAs  QDs  without  any  contri¬ 
butions  from  migration-assisted  QD  formation. 

IV.  Results  and  discussion 

Figure  3  shows  SEM  images  of  the  as-grown  sample  (a) 
and  of  a  section  of  the  same  sample  after  an  HF-etch  to  re¬ 
move  the  Si02  mask  (b)  for  the  smallest  diameter  hole  ar¬ 
ray.  Comparing  Fig.  2  and  Fig.  3(a),  no  growth  is  observed 
on  the  Si02  film.  During  growth,  no  ring  patterns  appeared 
in  the  in  situ  RHEED,  implying  that  poly  crystalline  InAs 
was  not  formed  on  the  Si02  surface.  These  results  demon¬ 
strate  that  selective  growth  occurred  in  this  heteroepitaxy.  In 
this  selective  growth  regime,  dipping  the  sample  in  diluted- 
HF  completely  removes  the  Si02  mask  as  seen  in  Fig.  3(b). 

The  bottom  diameter  of  the  InAs  island  shown  in  Fig.  3 
is  in  the  range  of  50-  to  100-nm  close  to  those  of  the  holes 


Figure  3  45° -tilted  SEM  images  of  a  2-D  array  of  InAs 
islands  deposited  on  the  Si02 -patterned  substrate  (a)  as- 
grown  state  and  (b)  after  HF  treatment  to  strip  the  SiC>2 . 

of  Fig.  2(b)  originally  patterned  on  the  Si02  mask.  Al¬ 
though  the  Si02  pattern  in  Fig.  2  looks  uniform,  the  InAs 
islands  of  Fig.  3  have  a  noticeable  fluctuation  in  size  and 
shape.  Generally,  small-diameter  holes,  nearer  the  limits  of 
our  fabrication  techniques,  have  larger  size  fluctuations.  As 
shown  in  Fig.  2(b),  this  fluctuation  is  partly  due  to  irregu¬ 
larities  in  the  hole  pattern.  The  general  shape  of  holes  in 
Fig.  2(b)  is  close  to  a  circle  but  with  a  rough  periphery. 
Since  deposited  InAs  was  only  about  10  nm  in  thickness, 
no  faceting  was  observed  on  the  surface  of  InAs  islands.  As 
a  result,  the  overall  shape  of  the  InAs  islands  seems  to  de¬ 
pend  on  the  shape  of  holes  allowing  epitaxial  growth 
nucleation.  In  Fig.  3,  InAs  islands  having  d  <  80  nm  have  a 
shape  similar  to  a  spherical  section.  The  detailed  island 
shape  will  be  discussed  below. 

Figure  4  shows  a  SEM  image  of  a  single  as-grown  InAs 
dot  for  which  the  lateral  dimension  is  -60  nm.  It  is  known 
that  the  average  lateral  dimension  of  InAs  QDs  increases 
with  growth  temperature  and  becomes  greater  than  30  nm  at 
535°C.1  Although  there  remain  issues  as  to  the  precise 
shape  and  size  of  self-assembled  QDs,24"26  the  dot  in  Fig.  4 
is  within  about  a  factor  of  two  in  linear  dimension  com¬ 
pared  with  typical  self-assembled  QDs  but,  it  should  be 
emphasized,  any  wetting  layer  does  not  extend  beyond  the 
lateral  dimensions  of  the  dot.  The  InAs  dot  of  Fig.  4  is  thus 
equivalent  to  a  QD  grown  in  an  artificial  Volmer-Weber  (V- 
W)  growth  mode.  Without  S-K  mode,  therefore,  growth  of 
InAs  QDs  having  diameter  of  60  nm  only  relying  on  NFAG 
is  demonstrated. 
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Figure  4  A  45°-tilted  SEM  image  of  an  as-grown  InAs 
dot  of  d  ~  50  nm. 

Recently,  there  have  been  two  reports  of  InAs  and  In- 
GaAs  QD  formation  without  a  wetting  layer.  Mano  et  al. 
reported  InGaAs  QDs  on  a  GaAs  substrate  using  alternative 
deposition  of  In  and  Ga  droplets  by  low  temperature 
MBE.27  Tsatsul’nikov  et  al.  reported  V-W  mode  growth  of 
InAs-(Al,Ga)As  quantum  dots  with  an  ultrathin  AlAs  cap¬ 
ping  layer.28  In  both  cases,  unpattemed  substrates  were  used 
but  no  wetting  layers  were  formed27  or  remained28  after 
growth  of  a  capping  layer.  One  of  the  common  results  is  a 
change  of  QD  shape.  The  QD  height  was  in  the  range  of 
about  8-12  nm  which  is  greater  than  the  4-  to  6-nm  typical 
of  self-assembled  QDs  on  an  unpattemed  substrate1  while 
the  lateral  dimensions  are  in  different  ranges  (30  nm  in  Ref. 
27  and  7-  to  9-nm  in  Ref.  28).  They  confirmed  the  absence 
of  dislocations  on  their  QDs  with  photoluminescence  and 
transmission  electron  microscopy.  These  results  imply  that 
the  absence  of  wetting  layers  allows  a  different  shape  of 
QDs,  taller  than  self-assembled  QDs,  on  an  unpatterned 
substrate.  Although  the  physical  origins  of  this  QD  shaping 
have  not  been  explicitly  clarified,  these  reports  suggest  that 
the  presence  of  wetting  layers  is  very  important  in  QD  for¬ 
mation  by  S-K  mode,  as  expected. 

In  consideration  of  volume  comparison  between  the  InAs 
dot  of  Fig.  4  and  typical  self-assembled  QDs  with  and 
without  a  wetting  layer,  the  InAs  dot  might  be  strain  re¬ 
laxed  because  its  lateral  size  is  about  twice  that  of  self- 
assembled  QDs.  It  has  been  reported  that  self-assembled 
InAs  QDs  relaxed  by  an  excess  supply  of  In  atoms  still  keep 
a  similar  shape  to  unrelaxed  QDs  except  for  their  larger  size. 
Even  if  it  is  relaxed,  minimization  of  surface  free  energy 
would  lead  to  formation  of  a  spherical  shape  for  an  InAs 
island  as  observed  in  Fig.  3  and  4.29’30 

As  previously  mentioned,  different  etching  times  for 
pattern  transfer  resulted  in  a  variation  of  hole  diameter. 
Figure  5  shows  SEM  images  obtained  from  InAs  islands 
having  d  <  100  nm  [(a)  and  (b)],  ~  150  nm  [(c)],  ~  200  nm 
[(d),  (e)  and  (f)],  and  an  InAs  layer  grown  on  an  unpattemed 
substrate  region  [(g)].  Figures  6  and  7  are  atomic  force  mi¬ 
croscope  (AFM)  images  of  InAs  islands  and  their  corre¬ 
sponding  profiles  which  have  similar  diameters  to  those  of 
Fig.  5,  respectively.  Figure  7  represents  the  profiles  ob¬ 
tained  along  the  direction  of  the  line  (A  to  B)  indicated  in 
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Figure  5  45°-tilted  SEM  images  of  InAs  islands  depos¬ 
ited  on  holes  with  a  diameter  of  50-  to  100-nm  [(a)  and 
(b)],  -  150  nm  [(c)],  -  200  nm  [(d),  (e)  and  (f)],  and  an 
InAs  layer  grown  on  an  unpatterned  substrate  region 
[(g)]. 


Fig.  7(f).  Dotted  lines  in  Fig.  7  are  the  reference  height  cor¬ 
responding  to  the  GaAs  substrate  surface. 

As  seen  in  Figs.  5  and  6,  the  InAs  islands  have  a  circu¬ 
lar  shape  in  45°-tilted  and  top  view  when  d  is  less  than  or 
comparable  to  150  nm.  Fig  7  (a-c)  shows  the  shape  and 
height  of  these  islands  which  have  a  spherical- section  shape 
with  a  height,  h ,  of  about  10-  to  15-nm.  This  height  is 
close  to  that  reported  in  Ref.  27  and  28.  The  spherical- 
section  shape  can  be  understood  since  In  atoms  incident  near 
the  boundary  between  GaAs  and  Si02  surfaces  move  to  an 
inner  side  of  a  hole  to  find  a  more  stable  adsorption  site. 

Although  the  physical  dimension  of  these  dot-shaped 
InAs  islands  or  ‘big  dots’  is  not  highly  uniform,  their 
heights  are  2-  to  3 -times  larger  than  those  of  self-assembled 
InAs  QDs  on  GaAs(001).  Also,  the  height  of  an  InAs  big 
dot  is  at  most  50%  greater  than  the  thickness  expected  from 
the  In  flux  and  the  deposition  time.  Assuming  an  ellipsoi¬ 
dal  section  shape,  however,  the  extra  volume  increment  is 
almost  zero  compared  with  the  volume  of  a  cylindrical- 
shaped  island  with  the  same  bottom  area  and  the  nominal 
growth  height.  These  results  imply  that  there  is  no  addi¬ 
tional  volume  contribution  due  to  In  atoms  migrating  from 
nearby  Si02  regions  to  the  open  GaAs  surface,  counter  to 
the  situation  in  migration-assisted  QD  formation.  The  size- 
independent  height  shown  in  Fig.  7  is  further  evidence  of 
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Figure  6  AFM  images  of  InAs  islands  deposited  on 
holes  with  a  diameter  of  50-  to  100-nm  [(a)  and  (b)],  ~ 

150  nm  [(c)],  and  ~  200  nm  [(d),  (e)  and  (f)],  similar  to 
Fig.  5. 

the  absence  of  contributions  to  the  growth  from  In-atoms 
migrating  from  the  surrounding  Si02  material.  The  very  low 
sticking  coefficient  of  an  In  atom  on  a  Si02  surface  at  the 
given  growth  conditions,  implying  extremely  short  surface 
lifetime,  therefore,  excludes  migration-assisted  QD  forma¬ 
tion,  as  predicted. 

Around  each  InAs  island,  a  circular  step  along  with  sev¬ 
eral  small  dips  indicated  by  a  white  dashed  circle  and  black 
arrows  in  Fig.  6(a),  are  observed  in  all  AFM  images  of  Fig. 
6.  Although  such  steps  and  dips  were  already  observed  in 
the  SEM  results  (Fig.  3),  they  appear  much  more  clearly  in 
the  AFM  images.  The  depth  of  these  dips  is  in  the  range  of 
5-  to  9-nm.  While  the  circular  steps  resulted  from  the  wet 
etching  of  the  substrate  which  was  performed  before  loading 
into  the  MBE  chamber,  their  depth  was  not  precisely  meas¬ 
ured  but  is  less  than  that  of  the  small  dips  around  them. 
The  dips  [the  numbered  black  spots  in  Fig.  6(a)]  were  gen¬ 
erated  either  near  the  periphery  of  an  InAs  island  (1)  or 
about  40  -  60  nm  away  from  it  (2  and  3)  and  are  independ¬ 
ent  of  the  size  of  the  InAs  islands.  Once  a  dip  is  generated 
on  the  periphery,  no  additional  dips  form  in  the  same  direc¬ 
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Figure  7  AFM  profiles  of  InAs  islands  of  Fig.  6  ob¬ 
tained  along  the  direction  of  the  line  (AB)  indicated  in 
Fig.  7(f).  Dotted  lines  are  the  reference  height  corre¬ 
sponding  to  the  GaAs  substrate  surface. 

tion  from  the  center  of  an  island.  They  are  likely  due  to 
nonuniform  wet  etching.  A  GaAs  surface  interfaced  with  a 
Si02  film  can  have  an  etch  rate  faster  than  the  bulk  as  a  re¬ 
sult  of  stress  at  the  interface  and  capillary  effect,  and  can 
form  a  circular  step  with  a  diameter  greater  than  the  hole 
size  in  the  Si02.  Moreover,  some  wet  etchant  trapped  near 
the  boundary  between  the  Si02  film  and  the  GaAs  surface 
might  not  be  removed  rapidly  in  the  deionized  water  rinse 
used  to  quench  the  etch.  Such  etchant  trapping  could  pro¬ 
duce  the  dips  shown  in  Fig.  6.  Thus,  reliable  surface  treat¬ 
ment  for  nanoscale  open  substrate  surfaces  is  another  impor¬ 
tant  issue  to  be  dealt  with  in  NLAG. 

In  Fig.  5  and  6,  the  shape  of  the  InAs  islands  deforms 
from  a  spherical-section  when  d  exceeds  -150  nm.  For  ex¬ 
ample,  the  InAs  island  of  Fig.  6(d)  of  which  the  top  view  is 
close  to  a  circle  has  a  deformed  spherical-section  profile,  as 
seen  in  Fig.  7(d).  If  d  is  increased  further,  as  shown  in 
Fig. 5(e)  and  (f)  and  Fig.  6(e)  and  (f),  the  deposited  InAs 
does  not  retain  a  circular  shape  but  rather  forms  a  random¬ 
shaped  island  within  each  hole.  Fig.  7(e)  and  (f)  reveal 
strongly  distorted  profiles.  However,  their  shape  is  still 
different  from  that  of  an  InAs  layer  deposited  on  an  unpat¬ 
terned  region  which  exhibits  surface  undulations  for  the 
reduction  of  strain  energy  as  shown  in  Fig  5(g).  As  d  in¬ 
creases  beyond  150  nm,  a  self-assembling  mechanism  seems 
to  be  activated.  The  ring-type  InAs  island  of  Fig.  5(f)  is 
similar  to  a  coalesced  form  of  several  self-assembled  QDs. 
Since  the  deposited  InAs  was  10-nm  thick,  formation  of 
self-assembled  QDs  and  their  coalescence  is  very  feasible  in 
wide  holes.  From  Fig.  5,  6,  and  7,  it  can  be  deduced  that 
the  area  allowed  for  nucleation  of  InAs  on  GaAs(100)  sur¬ 
face  strongly  effects  the  growth  mode,  especially  the  island 
shape  and  morphology.  This  variation  is  mainly  due  to  lat¬ 
tice-mismatch-induced  stress  effects  which  increase  with 
deposition  area. 


As  indicated  by  arrows  in  Fig.  8  (a),  very  small-sized 
InAs  dots  were  occasionally  formed  alongside  the  larger 
InAs  islands.  Approximately  15%  to  20%  of  the  InAs  is¬ 
lands  with  d  >  150  nm  exhibit  such  small  dots.  These 

small  dots  were  not  observed  when  d  <  1 00  nm.  The  diame¬ 
ter  of  these  small  dots  is  about  30  -  40  nm  which  is 
comparable  to  the  lateral  dimension  of  typical1  self- 
assembled  InAs  QDs  grown  at  535°C.  In  Fig.  8(b),  the 
height  of  the  small  dot  is  about  4  nm  which  is  also  very 
close  to  that  of  typical  self-assembled  InAs  QDs  while  other 
large-sized  dots  in  Fig.  8(b)  are  more  than  8-nm  high.  The 
large-sized  dots  in  Fig.  8(b)  are  consistent  with  the  10-nm 
deposition.  Therefore,  we  argue  that  the  small  dots  are  evi¬ 
dence  of  S-K  growth  mode  self-assembly  for  open  hole  areas 
with  diameters  d  >  150  nm  on  a  GaAs(001)  surface  bounded 
by  an  SiC>2  mask. 

As  mentioned  above,  the  shape  of  the  InAs  islands  for  d 
~  200  nm  in  Fig  5(f)  could  be  due  to  coalescence  between 
self-assembled  QDs  during  growth.  More  than  20%  of  the 
InAs  islands  grown  in  holes  with  d  >  150  nm  then  might 

have  self-assembled  QDs  at  least  in  coalesced  form.  Forma¬ 
tion  of  self-assembled  QDs  on  those  holes  can  be  under¬ 
stood  by  large  open  areas  which,  while  still  limited,  are 
much  greater  than  the  size  of  self-assembled  QDs.  Strain 
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Figure  8  (a)  AFM  image  revealing  formation  of  self- 
assembled  QDs  and  large-sized  dots,  and  (b)  an  AFM 
profile  of  a  self-assembled  QD. 


energy  depends  on  area  as  well  as  thickness.31,32  At  the  early 
stages  of  growth,  a  few  monolayer-thick  strained  InAs  film 
deposited  on  such  a  relatively  large  area  could  have  suffi¬ 
cient  strain  energy  to  activate  the  S-K  growth  mode.  In  Fig. 
8(b),  more  than  one  InAs  dot  appears  within  a  single  growth 
area.  If  multiple  self-assembled  InAs  QDs  can  be  generated 
on  a  limited  area  for  d  >  150  nm,  this  also  indirectly  sup¬ 
ports  the  possibility  of  the  coalescence  between  self- 
assembled  QDs  for  larger  growth  areas.  Again,  it  must  be 
emphasized  that  the  self-assembled  InAs  QDs  shown  in  Fig. 
8  were  not  observed  for  d  <  100  nm  (Fig.  3).  Coalescence 

between  dots  on  such  small  holes  is  less  probable  because 
of  the  spherical-section  shape  and  the  comparable  height  of 
InAs  islands  to  those  of  QDs  without  a  wetting  layer.27,28  It 
is  evident  that  only  a  single  dot  can  be  generated  within  a 
single  hole  as  the  hole  diameter  approaches  the  lateral  di¬ 
mension  of  a  self-assembled  QD.  Because  of  insufficient 
strain  energy  to  activate  self-assembling  due  to  a  small, 
finite-area  wetting  layer,  a  single  dot- shaped  layer-by-layer 
growth  may  proceed  on  the  holes  of  d  <  100  nm  for  a  10- 
nm-thick  InAs  deposition. 

These  results  imply  a  criterion  on  the  critical  diameter  of 
holes,  dc ,  for  generation  of  self-assembled  QDs.  In  this 
work,  100  nm  <  dc<  150  nm  at  535°C  at  a  growth  rate  of 
0.1  ML/s.  This  dc  which  is  about  5-  to  10-times  the  lateral 
dimension  of  self-assembled  InAs  QDs  (20-  to  30-nm???), 
provides  a  boundary  between  layer-by-layer  growth  and  S-K 
mode  growth  on  a  large-area  wetting  layer.  In  other  words, 
dc  is  the  minimum  diameter  for  activation  of  S-K  growth 
mode  self-assembly.  The  reported  density  of  self-assembled 
InAs  QD  grown  on  an  unpattemed  GaAs  substrate  at  535°C 

is  about  1.5x1 0^0  cm'^.1’30  Thus,  the  average  area  allocated 
to  a  single  QD  is  about  80x80  nm^,  comparable  to  dc.  Fur¬ 
ther  study  and  theoretical  modeling  will  be  necessary  to 
understand  the  role  of  the  wetting  layer  and  to  understand 
the  meaning  of  dc  in  S-K  growth  mode  QD  formation.  Ex¬ 
perimentally,  these  results  suggest  an  important  relation 
between  activation  of  self-assembly  and  area  of  the  wetting 
layer  in  S-K  mode. 

The  hole  density  for  a  285-nm  period  2-D  array  pattern 
is  1.23x1 0^  cm“2,  which  is  lower  than  the  dot  density  of 
typical  InAs  QDs  randomly  generated  on  unpattemed 
GaAs(001)  by  about  an  order  of  magnitude.  However,  sub- 
100-nm  IL  is  currently  being  developed,33  and  nanoscale 
lithography  technologies  ultimately  will  be  able  to  provide 
deep  sub-100-nm  patterns  with  hole  densities  higher  than 
the  dot  density  of  an  unpatterned  substrate. 

Our  smallest  d  is  50  nm  and  most  of  InAs  islands  in 
this  size  range  have  a  spherical- section  shape.  There  are  sev¬ 
eral  possible  ways  to  control  the  shape  and  size  of  QDs  in 
NLAG.  For  example,  QD  shape  different  from  pyramidal 
one  such  as  a  cylindrical  QD  could  be  realized  if  a  hole  di¬ 
ameter  less  than  the  lateral  dimension  of  self-assembled 
QDs  is  achieved  by  nanoscale  patterning.  Since  height  of 
such  QDs  can  be  exactly  controlled  by  deposition  time, 
customizing  QD  shape  and  size  in  three-dimension  will  be 
possible  with  NLAG. 


Relying  on  S-K  mode  for  QD  formation,  there  might  be 
an  upper  limit  in  dot  density  desirable  for  device  applica¬ 
tion.  This  is  because  the  S-K  mode  results  from  a  strain 
relaxation  process,  and  dot  density,  correlated  with  the  dot 
size  and  shape,  is  mainly  governed  by  lattice-mismatch  and 
growth  conditions.  If  we  use  migration-assisted  QD  forma¬ 
tion,  the  dot  density  can  be  increased  by  increasing  the  hole 
density.  But,  the  dot  size  is  influenced  by  accidental  captur¬ 
ing  of  itinerant  In  atoms  moving  along  the  Si02  surface. 
Generally,  this  accidental  capturing  is  proportional  to  the 
area  of  unit  period  of  the  mask,  and  as  a  result  is  critically 
affected  by  pattern  uniformity.  Therefore,  there  is  a  tradeoff 
between  dot  density  and  size  uniformity  especially  in  height 
depending  on  the  uniformity  of  the  lithographic  and  pattern 
transfer  processes. 

Dot  density  and  size  uniformity  are  directly  related  with 
gain  of  QD  lasers.  Enhancement  of  dot  density  while 
retaining  quantum-size  effects  and  narrowing  the  size  distri¬ 
bution  is  one  of  the  major  goals  in  QD  growth.  Deep  sub- 
100  nm  NLAG  which  enables  independent  control  of  dot 
density  and  size  in  wide  range  could  be  a  potential  approach 
towards  this  goal. 

IV.  Summary 

Nanoscale  limited  area  growth  (NLAG)  of  InAs  islands 
on  a  Si02-patterned  GaAs(OOl)  substrate  is  reported.  Two- 
dimensional  arrays  of  285-nm-period  circular  holes  with 
three  different  bottom  diameters  of  50-  to  100-nm,  -150 
nm,  and  -200  nm  were  generated  on  a  Si02  film  by  large- 
area  IL  and  dry  etching.  At  a  growth  temperature  of  535°C 
and  a  growth  rate  of  0.1  ML/s,  a  10-nm  thickness  of  InAs 
was  selectively  deposited  on  the  GaAs  surface  of  the  Si02- 
masked  substrate  by  MBE.  Owing  to  the  selectivity 
resulting  from  the  very  low  zero  sticking  coefficient  of  In 
atoms  on  the  Si02  surface,  two  different  regimes  of  InAs 
QDs  were  generated  on  open  GaAs  surfaces  bounded  by  the 
Si02  mask  without  migration-assisted  QD  formation.  One 
regime  gives  spherical-section  InAs  dots  with  diameters  of 
about  50-  to  100-nm  and  a  height  of  10-  to  15-nm  entirely 
relying  on  NLAG,  and  the  second  regime  is  self-assembled 
QDs  similar  to  those  on  an  unpattemed  substrate  formed  on 
holes  of  diameter  greater  than  -150  nm.  The  former  corre¬ 
sponds  to  QDs  without  a  wetting  layer,  whereas  a  nanoscale 
finite-area  wetting  layer  plays  a  significant  role  in  the  latter 
regime.  This  leads  to  InAs  islands  severely  impacted  by 
coalescence  between  self-assembled  QDs  that  was  not  ob¬ 
served  for  holes  of  diameter  less  than  -100  nm.  The  lateral 
size  of  the  wetting  layer  strongly  affects  QD  formation  in  S- 
K  mode  growth.  From  the  hole-diameter  dependent  QD 
formation,  a  criterion  on  the  size  of  holes  for  activation  of 
self-assembling  in  S-K  mode  was  proposed  and  is  100-  to 
150-nm  which  is  5-  to  10-times  greater  than  the  lateral  size 
of  typical  self-assembled  QDs  on  an  unpatterned  GaAs  sub¬ 
strate  under  similar  growth  conditions. 
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Germanium  quantum  structures  may  give  rise  to  ad¬ 
vanced  applications  in  quantum  bit  (i.e.,  qubit)  computing1,2 
and  optoelectronics.3-6  Despite  the  promising  applications, 
integrating  Ge  nanostructures  for  device  manufacturing  faces 
numerous  challenges  such  as  (1)  defect-free  epitaxial  growth, 
(2)  selective  growth  at  precise  locations,  (3)  uniform  size  and 
spatial  distribution,  (4)  vertical  alignment  of  dots  in  multiple 
stacks,  and  (5)  controlled  doping.  Among  these  challenges, 
we  intend  to  focus  on  the  selective  growth  of  defect- free  Ge. 
Many  methods  exist  to  direct  the  self-assembly  of  Ge 
nanostructures7-11  and  to  grow  high  quality  epitaxial  films.12 
We  take  advantage  of  the  nanotemplate  approach  where  Ge 
quantum  structures  grow  selectively  in  a  one-dimensional  or 
two-dimensional  (2D)  array7-11,13  through  the  template  win¬ 
dows.  The  Ge  nanostructures  coalesce  upon  reaching  the  top 
of  the  template  and  form  a  high  quality  Ge  epitaxial  lateral 
overgrowth  (ELO)  layer.12  The  relaxed,  defect-free  epitaxial 
Ge  layer  on  Si  can  also  serve  as  a  buffer  layer  on  which 
III-V  devices  can  be  fabricated.14 

Chemical  vapor  deposition  and  gas  source  molecular 
beam  epitaxy  (MBE)  with  Cl  or  H  chemistry  are  successfully 
used  to  achieve  selective  growth  of  Ge  on  patterned  Si 
substrates.12,13,15,16  However,  these  growth  techniques  re¬ 
quire  extraordinary  precaution  to  contain  commonly  used  gas 
precursors  such  as  GeH4 .  Germane  is  pyrophoric  and  toxic 
to  humans,  the  exposure  threshold  limit  value  of  time- 
weighted  average  is  only  0.2  parts  per  million.17  In  this 
work,  we  demonstrate  that  a  2D  array  of  Ge  nanostructures 
can  be  selectively  grown  on  Si  by  solid  source  MBE,  leading 
to  a  threading-dislocation-free  Ge  ELO  layer. 

To  create  the  template,  a  300-nm-thick  layer  of  thermal 
Si02  is  initially  grown  on  an  undoped  Si(100)  substrate  by 
dry  oxidation.  Interferometric  lithography18-24  by  the  355 
nm  Ar  laser  line  and  plasma  etching25-28  are  used  to  pattern 
a  2D  array  of  vias  on  the  Si02  film,  exposing  the  underlying 
Si  substrate.  The  average  diameter  and  depth  of  the  Si02  vias 
are  200  and  300  nm,  respectively.  Figure  1(a)  depicts  the 


^Electronic  mail:  meister@unm.edu 


patterned  vias  along  with  the  Ge  seeds  grown  in  a  later  step. 
The  patterned  samples  are  treated  in  Piranha  solution  for  20 
min  to  remove  carbon  contamination  and  to  chemically  oxi¬ 
dize  the  exposed  Si.  The  Piranha  solution  consists  of  four 
parts  of  H2S04  (2  M)  per  one  part  of  H202  (30  wt  %)  by 
volume.  No  HF  treatment  is  performed  on  the  patterned  Si02 
in  order  to  preserve  the  template  against  chemical  etching. 
After  a  deionized  H20  rinse  and  N2  blow-dry,  the  patterned 
sample  is  loaded  into  a  MBE  chamber  through  a  high- 
vacuum  transfer  loadlock.  The  base  pressure  of  the  ultrahigh 
vacuum  (UHV)  chamber  is  4X  1CT10  Torr.  Prior  to  Ge  ex¬ 
posure,  the  loaded  sample  is  heated  to  900  °C  in  UHV  for  15 
min  to  remove  surface  contaminants  introduced  during 
sample  transfer.  The  thermal  treatment  is  also  meant  to  par¬ 
tially  remove  the  chemical  oxide  formed  by  Piranha 
treatment.29,30  The  sample  is  then  cooled  down  to  and  main¬ 
tained  at  650  °C  for  Ge  exposure.  A  Knudsen  effusion  cell  is 
used  to  expose  the  sample  to  Ge.  The  effusion  cell  is  oper¬ 
ated  at  1 120  °C,  and  the  Ge  growth  rate  on  Si  is  0.7  ML/min. 
During  growth,  the  chamber  pressure  remains  below  1 
X  10-9  Torr.  As  a  control  experiment,  Ge  is  also  deposited 
on  a  blank  Si(100)  sample  under  identical  conditions.  The 
blank  sample  is  prepared  by  dissolving  the  thermally  grown 
Si02  in  HF  (11  wt  %  in  H20)  for  10  min.  The  blank  Si 
sample  is  then  treated  in  Piranha  and  loaded  into  the  UHV 
MBE  chamber.  The  control  sample  subsequently  undergoes 


FIG.  t.  SEM  images:  showing  (a)  selectively  seeded  Ge  islands  on  Si  in  a 
2D  array  through  the  vias  of  Si02  nanotemplate,  (b)  Ge  seeds  when  the  Si02 
template  is  removed  in  HF,  and  (c)  Ge  ELO  layer  partially  coalesced  on  top 
of  the  Si02  template. 
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preparation  steps  identical  to  those  of  the  templated  sample. 
After  Ge  exposure,  the  samples  are  characterized  by  scan¬ 
ning  electron  microscopy  (SEM),  cross  sectional  transmis¬ 
sion  electron  microscopy  (XTEM),  and  energy  dispersive 
spectroscopy  (EDS).  The  samples  for  XTEM  are  prepared  by 
mechanical  polishing  and  Ar  ion  milling. 

Figure  1  shows  the  SEM  images  that  capture  the  surface 
morphology  of  epitaxially  grown  Ge  islands  and  the  ELO 
layer.  In  Fig.  1(a),  Ge  islands  grow  selectively  on  the  ex¬ 
posed  Si  through  the  vias  of  the  Si02  nanotemplate.  No  vis¬ 
ible  Ge  nucleation  occurs  on  the  Si02  surface.  The  edges  of 
the  vias  after  plasma  etching  do  not  remain  perfectly  circular, 
and  this  roughness  is  transferred  to  the  edges  of  the  Ge  struc¬ 
tures  [Fig.  1(b)].  Faceted  Ge  islands  are  revealed  in  Fig.  1(b) 
after  the  Si02  template  is  etched  in  HF  (11  wt  %  in  H20). 
Continued  growth  leads  to  the  coalescence  of  Ge  islands, 
forming  an  ELO  layer  shown  in  Fig.  1(c).  The  selective 
growth  of  Ge  on  Si  over  Si02  stems  from  the  thermal  insta¬ 
bility  GeO  and  SiO  at  the  growth  temperature,  despite  the 
use  of  Ge  solid  source  and  the  absence  of  selectivity¬ 
controlling  agents  such  as  Cl  or  H.13  Germanium  is  known  to 
scavenge  O  from  Si02  and  to  form  GeO,  which  sublimates  at 
625  °C.31  Surnev32  also  reports  that  GeO  and  SiO,  produced 
during  the  reaction  between  Ge  and  Si02 ,  start  to  sublimate 
at  410  and  670  °C,  respectively.  Wado  et  al.  also  suggest  that 
the  desorption  of  Ge  and  GeO  from  the  Si02  surface  at  el¬ 
evated  temperatures  enhances  the  selectivity.33  In  future  pub¬ 
lications,  we  will  directly  address  the  selectivity  issue  by 
probing  Ge  adspecies  on  partially  oxidized  Si,  using  scan¬ 
ning  tunneling  microscopy. 

The  interfacial  defect  morphology  of  the  Ge  layer  grown 
on  the  blank  Si  substrate  and  the  Ge  ELO  layer  on  the  pat¬ 
terned  Si  substrate  are  compared  by  XTEM.  Figure  2(a) 
shows  a  TEM  image  of  a  network  of  dislocation  segments34 
that  exist  mostly  within  300  nm  from  the  Ge-Si  interface 
when  Ge  is  grown  on  a  blank  Si  substrate.  Despite  the  iden¬ 
tical  preparation  steps  used  for  the  blank  Si  sample,  except 
for  the  Si02  removal  in  HF,  the  Ge-Si  interface  of  the  blank 
Si  sample  roughens  during  Ge  growth.  Note  that  the  images 
in  Fig.  2  capture  only  the  Ge-Si  interface  since  the  thickness 
of  the  Ge  layer  is  2  jmm.  The  Ge  layer  far  exceeds  the  field  of 
view  in  Fig.  2.  In  comparison,  the  Ge  ELO  layer  is  free  of 
dislocation  segments,  but  twins  are  generated  from  the 
Si02-Ge  interface  mostly  on  top  of  the  Si02  template.  Fig¬ 
ure  2(b)  shows  the  twins  propagating  along  the  {111}  planes 
to  the  surface.  These  twins  may  originate  by  two  mecha¬ 
nisms.  The  roughness  of  the  Si02-Ge  interface  causes  lat¬ 
tice  misplacement  where  the  stacking  sequence  along  the 
[111]  directions  is  disrupted  to  follow  the  inverse  order.35 
The  merging  fronts  of  coalescing  Ge  islands  may  also  lead  to 
the  formation  of  these  twins.12,35 

Figure  3(a)  is  a  magnified  view  of  Region  outlined  by  a 
square  in  Fig.  2(b).  No  threading  dislocations  are  observed  in 
the  ELO  layer  and  within  the  windows.  The  thin  layers  of 
contrast  ( ~  20  nm)  near  the  template  walls  are  due  to  the 
circular  vias  whose  front  and  back  are  partially  ion-milled 
for  imaging,  leaving  two  arced  segments.  These  segments 
along  with  the  depth  averaging  in  TEM  give  rise  to  the  con¬ 
trast.  The  z -contrast  image  in  Fig.  3(b)  corroborates  this  in¬ 
terpretation  by  showing  similar  layers.  The  z -contrast  image 
Downloaded  30  Dec  2003  to  129.24.216.199.  Redistribution  subject 


FIG.  2.  XTEM  images  of  (a)  Ge  layer  grown  on  blank  Si(100)  with  a 
network  of  dislocations  at  the  Ge-Si  interface  and  (b)  Ge  layer  grown  on 
templated  Si(100)  with  no  visible  threading  dislocations. 


also  provides  an  elemental  map  of  Si,  Ge,  and  O.  Oxygen  is 
the  lightest  element  appearing  dark  whereas  Ge  is  the  heavi¬ 
est  element  appearing  light.  The  light  contrast  is  also  ob¬ 
served  at  the  bottom  of  each  Ge  seed,  indicating  that  Ge 
diffuses  into  Si  during  Ge  growth.  The  diffusion  layer  is 
approximately  25  nm  thick. 

The  bottom  of  the  Ge  seed  is  further  magnified  to  inter¬ 
rogate  the  Ge-Si  interface  within  the  template  windows  and 
to  determine  whether  the  complex  contrast  near  the  bottom 
of  Ge  seed  in  Fig.  3(a)  is  due  to  tangled  dislocation  net¬ 
works.  The  high-resolution  TEM  image  in  Fig.  4  shows  a 
3-nm-thick  amorphous  layer  at  the  Ge-Si  interface,  and  EDS 
detects  a  highly  elevated  level  of  oxygen  near  the  interface. 
No  carbon  signal  is  detected  near  the  interface.  These  obser¬ 
vations  suggest  that  the  amorphous  region  is  an  oxidation 
layer  incompletely  removed  during  the  thermal  degassing 
step  at  900  °C.  Despite  the  oxidation  layer,  Ge  grows  epi¬ 
taxially  on  the  Si  substrate.  Similar  epitaxial  growth  is  ob¬ 
served  by  other  authors.36,37  Germanium  is  believed  to  scav¬ 
enge  O  from  Si02.  The  byproducts  sublime  as  GeO  and 
SiO,32  creating  areas  of  bare  Si.37  The  size  of  the  oxide-free 
openings  is  approximately  10  nm,  and  such  region  is  labeled 
as  II  in  Fig.  4  for  clarity.  Germanium  crystal  inclusions  grow 
in  these  bare  Si  areas37  and  then  grow  laterally  to  form  an 
epitaxial  layer.  No  dislocation  segments  are  seen  in  epitaxial 
Ge  above  the  oxide-free  region,  whereas  stacking  faults  ema¬ 
nate  from  the  oxide  region  and  propagate  along  {111}  planes. 
Similar  stacking  faults  are  commonly  observed  in  Si  ho¬ 
moepitaxy  at  the  Si02-Si  interface.38-40  During  the  oxida¬ 
tion  process,  the  volume  increase  in  the  oxidation  layer 
causes  compressive  stress,  which  in  turn  leads  to  the  super¬ 
saturation  of  Si  in  the  interstices  of  the  epitaxial  layer  and 


Ge  Seed 


Si02 

£ 

gj  Ge  Diffuision  Layer 


FIG.  3.  Magnified  view  of  Region  I  in  Fig.  2(b):  (a)  XTEM  image  of  Ge 
seeds  and  ELO  layer  showing  twin  defects  and  (b)  z -contrast  image  showing 
25-nm-thick  Ge  diffusion  layer. 
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FIG.  4.  High-resolution  XTEM  image  to  show  the  details  of  the  Ge-Si 
interface  where  stacking  faults  emanate  from  the  partially  oxidized  inter¬ 
face. 

eventually  to  the  nucleation  of  stacking  faults.39  In  the  case 
of  Ge  heteroepitaxy,  the  stress  caused  by  Ge  oxidation  at  the 
interface  may  similarly  cause  the  generation  of  stacking 
faults.  These  stacking  faults  in  Ge  terminate  within  70  nm 
from  the  interface  unless  they  run  into  the  Si02  template 
sidewall  and  terminate  even  closer  to  the  interface.  The  ter¬ 
mination  of  stacking  faults  transitions  into  a  defect-free  Ge 
region.  The  termination  of  defects  by  the  Si02  sidewall, 
known  as  necking,  may  be  also  responsible  for  the  absence 
of  threading  dislocations  in  the  Ge  seeds.12 

The  absence  of  threading  dislocations  is  also  attributed 
to  Ge-Si  intermixing  below  the  Ge-Si  interface.  The  lattice 
constant  in  the  intermixing  region  shown  in  Fig.  3  is  5.57  A 
based  on  its  electron  diffraction  pattern.  This  lattice  constant 
is  greater  than  5.43  A  of  Si  below  the  diffusion  layer  but  less 
than  5.66  A  of  Ge  above  the  diffusion  layer.  That  is,  the 
intermixing  of  Ge  and  Si  creates  a  graded  Ge  diffusion  layer. 
The  gradation  helps  to  form  a  low-defect-density  Ge  epitax¬ 
ial  layer  because  the  lattice  mismatch  is  less  than  4.2% 
within  the  graded  area.  Thus,  no  atomically  abrupt  interface 
exists  between  Ge  and  Si. 

We  speculate  that  the  ELO  layer  is  also  fully  relaxed. 
The  speculation  is  based  on  the  theory  by  Luryi  and  Suhir41 
that  the  strain  energy  density  decays  exponentially  from  the 
heterojunction  and  that  the  characteristic  length  of  decay  is 
comparable  to  the  dimension  of  the  heterojunction.  For  the 
200-nm-wide  Ge-Si  interface,  the  strain  density  would  de¬ 
crease  to  its  minimum  within  200  nm  of  the  interface,  and 
the  strain  energy  density  may  never  exceed  the  threshold  to 
create  dislocations. 

We  have  selectively  grown  Ge  nanostructures  in  a  2D 
array,  using  solid  source  MBE.  The  selectivity  relies  on  ob¬ 
served  thermal  instability  of  GeO  and  SiO  that  desorb  from 
the  Si02  template  surface.  The  germanium  structures  coa¬ 
lesce  to  form  an  ELO  layer  over  an  extended  period  of  Ge 
exposure.  The  germanium  seeds  as  well  as  the  ELO  layer  are 
free  of  threading  dislocations.  The  only  defects  observed  are 
stacking  faults  at  the  Ge-Si  interface  and  twins  in  the  ELO 
layer. 
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This  article  describes  defect  reduction  mechanisms  that  are  active  during  the  growth  of  GaN  by 
nanoheteroepitaxy  on  (0001)  6H  SiC.  Nanoheteroepitaxial  (NHE)  and  planar  GaN  epitaxial  films 
were  grown  and  compared  using  transmission  electron  microscopy,  photoluminescence,  x-ray 
diffraction,  and  time  resolved  photoluminescence.  It  was  found  that  in  addition  to  the  previously 
reported  defect  reduction  mechanism  that  results  from  the  high  compliance  of  nanoscale  nuclei, 
other  independent  defect  reduction  mechanisms  are  also  active  during  NHE  including:  (i)  filtering 
of  substrate  defects,  (ii)  improved  coalescence  at  the  nanoscale,  and  (iii)  defect  termination  at  local 
free  surfaces.  Also,  it  was  found  that  the  biaxial  strain  in  the  GaN  film  could  be  significantly  reduced 
by  using  a  “grouped”  NHE  pattern  geometry.  Time  resolved  photoluminescence  measurements  on 
NHE  GaN  samples  with  this  geometry  showed  a  more  than  tenfold  increase  in  carrier  lifetime 
compared  to  GaN  grown  on  planar  SiC.  ©  2004  American  Institute  of  Physics. 
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INTRODUCTION 

GaN  and  the  related  III-N  materials  are  typically  grown 
on  lattice  mismatched  substrates  such  as  silicon,  sapphire, 
and  silicon  carbide.  A  consequence  of  this  mismatch  is  that 
III-N  epitaxial  layers  contain  a  high  density  of  threading 
dislocations  that  have  been  linked  to  the  failure  of  lasers1  and 
the  breakdown  of  pn  junctions.2  Furthermore,  it  is  antici¬ 
pated  that  threading  dislocations  will  aggravate  composi¬ 
tional  stability  problems  associated  with  In  segregation  from 
InGaN  alloys.3  To  reduce  this  defect  concentration  several 
approaches  exploiting  lateral  overgrowth  on  a  patterned  sub¬ 
strate  have  been  demonstrated.  These  approaches  include  lat¬ 
eral  epitaxial  overgrowth,4,5  pendeo  epitaxy,6  and  cantilever 
epitaxy,7  and  significantly  reduced  dislocation  densities  in 
the  range  106-107  cm-2  have  been  achieved.  Theory8,9  and 
experiment10  indicate  that  a  further  reduction  in  defect  den¬ 
sity  is  possible  if  the  lateral  overgrowth  approach  is  extended 
to  the  nanoscale.  This  paradigm  is  embodied  in  the  nanohet¬ 
eroepitaxy  approach9  where  strain  energy  is  reduced  by  le¬ 
veraging  the  high,  three  dimensional  (3D)  compliance  of 
nanoscale  features.11  Conservative  calculations  of  the  strain 
energy9  suggest  that  the  3.2%  lattice  mismatch  between  GaN 
and  SiC  can  be  accommodated  without  defect  generation 
when  the  nanoheteroepitaxial  (NHE)  pattern  features  are  re¬ 
duced  to  —40  nm  in  diameter. 

This  article  describes  additional  defect  reduction  mecha¬ 
nisms  that  are  operative  during  the  NHE  process  even  when 
the  NHE  pattern  size  is  larger  than  the  40  nm  target  value 
that  theory9  suggests  is  required  for  the  elimination  of  strain- 


Author  to  whom  correspondence  should  be  addressed;  electronic  mail: 
shersee  @  chtm.unm.edu 


related  defects:  (1)  filtering  of  substrate  defects  as  a  result  of 
the  small  initial  growth  area;  (2)  improved  nanoscale  coales¬ 
cence;  and  (3)  the  termination  of  defects  at  local  free  sur¬ 
faces.  It  is  found  that  these  additional  mechanisms  contribute 
significantly  to  quality  improvement  in  NHE  GaN  films.  The 
thermal  mismatch  stress  between  the  substrate  and  the  epi¬ 
taxial  layers  can  also  be  controlled  by  employing  a  multiple- 
scale  nanopatterning  geometry  leading  to  localized  growth 
areas.  In-plane  biaxial  stress  values  as  low  as  0.18  GPa  were 
achieved  within  arrays  of  localized  areas  of  —10  gm  linear 
dimensions,  large  enough  for  many  device  applications.  The 
defect  and  stress  reduction  mechanisms  reported  here  are 
also  expected  to  occur  during  the  NHE  of  other  mismatched 
semiconductor  materials,  such  as  GaN  on  Si,  which  is  of 
critical  importance  for  solid-state  lighting  applications. 


EXPERIMENTAL  SETUP 

The  design  of  the  NHE  sample  structure  has  been  dis¬ 
cussed  previously.9,10  The  experiments  described  here  used  a 
simple  structure,  consisting  of  holes  etched  into  a  35  nm 
thick,  low-pressure  chemical  vapor  deposition  Si3N4  layer 
that  was  deposited  onto  a  SiC  substrate.  Metalorganic  chemi¬ 
cal  vapor  deposition  (MOCVD)  growth  of  GaN  on  this  struc¬ 
ture  is  selective  (growth  only  on  the  exposed  SiC)  and  cre¬ 
ates  an  array  of  compliant,  nanoscale  islands.  Two  nanoscale 
hole  geometries:  a  continuous,  large-area  array  pattern  geom¬ 
etry  [Fig.  1(a)],  and  a  local-area,  grouped  array  pattern  [Fig. 
1(b)]  were  fabricated  by  interferometric  lithography.12  In 
both  cases,  the  pattern  period  was  500  nm  and  the  hole  di¬ 
ameter  was  approximately  100  nm.  In  the  grouped  array  pat¬ 
tern  [Fig.  1(b)]  the  center-to-center  spacing  of  the  groups 
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FIG.  1.  Nanoscale  pattern  geometries  used  for  NHE 
growths:  (a)  patterned  array  geometry  and  (b) 
“grouped”  patterned  array  geometry. 


was  7.5  /zm  and  the  span  of  each  group  was  3.5  /zm.  These 
dimensions  were  selected  to  allow  coalescence  within  the 
group  but  not  in  the  space  between  the  groups.  These  moire 
patterns  were  created  by  multiple  exposures  with  slightly 
different  periods  and  the  holes  near  the  edges  of  each  group 
are  approximately  30%  smaller  than  the  holes  in  the  center 
of  the  pattern.  While  a  hole  diameter  of  —100  nm  is  too  large 
to  offer  sufficient  reduction  in  strain  energy  to  eliminate 
stress-induced  defects  via  the  3D  compliance  NHE  defect 
reduction  mechanism,9  a  significant  improvement  in  GaN 
material  quality  is  nevertheless  achieved  as  detailed  below. 

The  GaN  epitaxial  layers  were  grown  by  MOCVD  in  a 
radio-frequency  heated,  horizontal  reactor  at  100  Torr  using 
trimethy gallium  and  ammonia  sources.  Growth  was  initiated 
using  a  15  A  GaN  buffer  grown  at  650  °C.  The  growth  tem¬ 
perature  was  then  increased  to  1080  °C  to  grow  the  bulk  of 
the  GaN  epilayer.  During  buffer  layer  growth  the  average 
growth  rate  (measured  on  the  planar  SiC  reference  substrate) 
was  kept  low  at  1.4  nm/min  to  ensure  selectivity,  which  is 
critical  to  the  NHE  process.  This  growth  rate  was  increased 
during  the  higher  temperature  growth  stage  to  10  nm/min.  In 
each  experiment  we  grew  GaN  on  the  two  arrays  shown  in 
Fig.  1  and  on  a  planar  (unpatterned)  SiC  reference  wafer. 

RESULTS 

Figure  2  shows  low  resolution  plan- view  scanning  elec¬ 
tron  microscope  (SEM)  images  of  1.8  /zm  thick  GaN  films 
grown  side-by-side  on  a  planar  SiC  substrate  [Fig.  2(a)]  and 
on  a  continuously  patterned  NHE  array  [Fig.  2(b),  substrate 
patterned  as  in  Fig.  1(a)].  At  this  thickness  the  GaN  film  on 
the  patterned  array  is  fully  coalesced.  The  GaN  film  grown 
on  the  planar  substrate  exhibits  a  “hole”  defect  density  of 
9.lXl06cm-2.  These  “hole”  defects  are  believed  to  be 


threading  dislocations  or  nanopipe/micropipe  defects  that 
originate  in  the  SiC  substrate  and  thread  into  the  GaN  epi¬ 
taxial  layer.  In  the  NHE  GaN  sample  the  density  of  “hole” 
defects  is  reduced  to  ~6X  104  cm-2  (this  defect  density  was 
estimated  from  measurement  of  larger  areas  of  the  sample). 
This  reduction  in  defect  density  in  the  NHE  sample  illus¬ 
trates  one  of  the  additional  defect  reduction  mechanisms  that 
are  active  in  NHE.  Because  much  of  the  SiC  substrate  is 
covered  by  the  Si3N4  growth  mask,  most  substrate  threading 
defects  (including  threading  dislocations,  nanopipes,  and  mi¬ 
cropipes)  will  terminate  at  the  Si3N4  mask  and  will  not 
propagate  into  the  GaN  epitaxial  layer.  Both  GaN  layers  also 
show  cracks  that  originate  from  the  thermal  expansion  mis¬ 
match  on  cooling  from  the  growth  temperature.  As  shown 
below,  the  grouped  nanoscale  pattern  can  eliminate  these 
cracks. 

Figure  3  shows  the  results  of  NHE  GaN  growth  on  the 
“grouped”  array  pattern  geometry  sample  shown  in  Fig. 
1(b).  Each  group  of  nanoscale  nuclei  has  coalesced  to  form  a 
perfectly  faceted  pyramid  of  GaN  [Fig.  3(a)]  that  is  approxi¬ 
mately  6.5  /zm  across  at  the  base  (measured  at  the  pyramid 
base  between  the  {01  In}  facets).  The  underlying  “grouped” 
nucleation  pattern  can  be  seen  in  Fig.  3(a),  where  at  the  edge 
of  the  sample  one  of  the  pyramids  became  detached.  It 
should  be  noted  that  while  similar  GaN  facetted  pyramids 
have  been  reported  previously,13,14  the  pyramids  reported 
here  were  nucleated  from  approximately  80  individual  nu¬ 
clei,  which  then  coalesced  to  form  the  single  crystal  GaN 
pyramid.  This  excellent  coalescence  illustrates  one  of  the  key 
differences  between  NHE  and  the  conventional  lateral  epi¬ 
taxial  overgrowth  (LEO)  approach.  When  coalescence  occurs 
in  the  NHE  process  the  epilayer  is  thin  and  compliant  and 
high  quality  coalescence  is  obtained.11  In  the  case  of  LEO, 
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FIG.  2.  (a)  SEM  image  of  GaN  on  planar  SiC  showing 
“hole”  defects  and  cracks  and  (b)  SEM  image  of  fully 
coalesced  NHE  GaN  on  patterned  SiC  showing  reduced 
density  of  “hole”  defects. 
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FIG.  3.  (a)  GaN  pyramids  on  group -patterned  SiC.  (b)  Bright-field  XTEM 
crystal,  (c)  XTEM  image  of  pyramid  side  showing  dislocation  bending. 

by  the  time  coalescence  occurs  the  epilayer  is  typically 
“tilted”  and  thick  (between  1  and  10  jmm)  and  defects  are 
generally  observed  at  the  coalescence  boundary.  Note  that 
neither  the  “hole”  defects  [shown  in  Fig.  2(a)]  nor  cracking 
due  to  thermal  mismatch  strain  [shown  in  Figs.  2(a)  and 
2(b)]  were  observed  in  these  GaN  pyramids. 

Transmission  electron  microscopy  (TEM)  analysis  of  a 
GaN  pyramid  feature  is  shown  in  Figs.  3(b)  and  3(c).  The 
crystallographic  orientations  and  Miller  indices  of  the  facets 
in  Fig.  3(b)  are  assigned  based  on  measurement  of  the  facet 
angle  with  respect  to  the  [0001]  direction  (61°±1°)  and  are 
consistent  with  previous  work15  that  has  shown  such  facets 
to  be  of  the  prismatic  {01  In}  family.  During  TEM  sample 
preparation  an  example  of  cleavage  on  a  {1100}  facet  was 
also  observed.  The  electron  diffraction  pattern  from  the  up¬ 
per  part  of  this  GaN  pyramid  [inset  in  Fig.  3(b)]  confirms 
that  it  is  single  crystal.  Figure  3(c)  is  a  TEM  image  of  the 
pyramid  adjacent  to  one  of  the  {1101}  facets.  This  image 
confirms  that  the  bending  behavior  of  threading  dislocations 
previously  reported  for  GaN  on  Si14  is  also  active  during  the 
growth  of  GaN  on  SiC.  It  is  assumed  that  threading  disloca¬ 
tions  can  minimize  their  energy  by  bending  and  terminating 
at  a  {1 101}  surface.  Three  such  dislocations  are  indicated  by 
arrows  in  Fig.  3(c).  In  NHE,  where  growth  begins  as  a  dense 
array  of  nanoscale  nuclei,  there  are  many  such  sidewall  sur¬ 
faces  available  for  this  bending/termination  process  to  occur. 
This  bending  mechanism  is  also  responsible  for  the  result 
that  in  NHE  GaN  nuclei  threading  dislocations  appear  to  be 
replaced  almost  completely  by  in-plane  type  defects.  This 
change  in  defect  kinetics  is  another  major  aspect  of  nanohet- 
eroepitaxial  growth  that  has  also  been  observed  in  the  NHE 
growth  of  GaN  on  Si10,14  as  well  as  in  the  growth  of  GaN  on 
SiC. 

The  difference  in  thermal  expansion  coefficients  (the 
thermal  mismatch)  between  the  component  materials  in  a 
heterostructure  can  be  a  source  of  significant  biaxial  stress 


(c) 

image  of  pyramid.  (Inset)  electron  diffraction  pattern  showing  pyramid  is  single 


that  in  thick  samples  causes  cracking  as  seen  in  Fig.  2.  In 
these  GaN/SiC  structures  micro-Raman  spectroscopy  was 
used  to  measure  the  residual  biaxial  stress  in  the  GaN  films. 
The  Raman  scattering  measurements  were  made  in  a  stan¬ 
dard  backscattering  Z(XX)Z  geometry.  For  the  grouped  (py¬ 
ramidal)  samples,  the  nonplanar  geometry  effectively  elimi¬ 
nates  any  orientational  or  polarization  selectivity.  The  Raman 
measurement  used  a  488  nm  Ar  ion  laser  source,  a  40  X 
microscope  objective  (spot  size  ~1  /xm),  aim  double  spec¬ 
trometer  and  an  optical  multichannel  analyzer  (OMA).  Spec¬ 
tral  shifts  were  calibrated  against  emission  lines  from  the  Ar 
plasma  with  a  repeatability  of  approximately  one  diode  in  the 
OMA  array  or  0.12  cm-1.  In  general,  for  an  elastic  material 
where  the  elastic  constants  are  known,  converting  the  pho¬ 
non  frequency  shift  to  a  biaxial  stress  value  is  straightfor¬ 
ward.  The  phonon  frequency  shift  is  first  converted  to  strain 
in  the  “c”  lattice  parameter  then  Hookes’  law  is  used  to 
calculate  the  biaxial  stress  in  the  basal  plane.16  In  this  Raman 
scattering  geometry  the  GaN  E 2  peak  is  a  convenient  indi¬ 
cator  of  strain,  however,  previous  work16-22  has  attributed  a 
wide  range  of  conversion  factors  relating  the  biaxial  strain  in 
the  GaN  epitaxial  layer  to  the  E 2  Raman  peak  shift.  [The 
conversion  factor  k  (cm-1  GPa-1)  is  in  the  range:  2.1  <k 
<7.6,  where  A a>  =  ka,  Xco  is  the  phonon  frequency  shift 
(cm-1)  and  cris  the  biaxial  stress.]  This  wide  range  in  k  value 
is  due  largely  to  use  of  different  values  for  the  GaN  elastic 
constants  (Young’s  modulus  and  Poisson’s  ratio);  moreover, 
some  systematic  inaccuracy  is  unavoidable  as  Hookes’  law  is 
being  applied  to  a  partially  relaxed  (and  therefore  nonelastic) 
GaN  film.  Despite  these  reservations,  most  authors  typically 
measure  a  stress  induced  E 2  phonon  frequency  reduction 
(relative  to  the  unstrained  E 2  peak  frequency  of  568  cm-1) 
of  approximately  4  cm-1  in  GaN/SiC  structures. 

The  biaxial  stresses  shown  in  Table  I  were  calculated 
using  a  conversion  factor  of  k  =  5.1  cm- 1  GPa- 1 .  Each  stress 
value  is  an  average  value  calculated  from  at  least  five  mea- 


TABLE  I.  Biaxial  stress  measured  in  the  GaN/SiC  structures  by  Raman  spectroscopy. 


Sample  structure 

(El) 

(cm-1) 

Shift  from 
bulk  (cm-1) 

Biaxial  stress 
(GPa) 

GaN/SiC:  planar 

564.91 

-3.09 

-0.61 

GaN/SiC:  coalesced  uniform  hexagonal  array 

564.25 

-3.74 

-0.73 

GaN/SiC:  coalesced  grouped  hexagonal  array 

567.1 

-0.9 

-0.18 
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FIG.  4.  Optical  micrographs  at  20  X  magnification 
showing  reduction  of  crack  density  in  planar,  2  gm 
thick,  GaN  films  on  SiC,  with  reduced  post-growth 
cooling  rate:  (a)  postgrowth  cooling  rate  at  100  °C/min 
and  (b)  postgrowth  cooling  rate  at  20  °C/min. 


surements  of  the  Raman  spectrum  on  multiple  samples.  For 
all  of  the  GaN  growths  on  SiC,  the  E 2  Raman  peak  wave 
number  is  less  than  the  bulk  (unstrained)  value  of  568  cm-1, 
indicating  that  the  GaN  is  in  tension  as  expected  from  the 
thermal  expansion  coefficients.  For  the  GaN/SiC  planar 
growth  sample  and  the  coalesced  NHE  uniform  array  sample 
we  measure  biaxial  stresses  of  approximately  —0.61  and 
—0.73  GPa,  respectively.  The  NHE  “grouped”  geometry 
sample  consistently  had  a  much  smaller  biaxial  stress  of  only 
0.18  GPa. 

A  simple  model  can  be  used  to  explain  these  stress  val¬ 
ues.  We  assume  that  at  the  GaN  growth  temperature  of 
—  1040  °C,  defect  creation  will  reduce  the  mismatch  strain  to 
approximately  zero.  When  a  planar  GaN/SiC  sample  is 
cooled  from  the  GaN  growth  temperature  to  room  tempera¬ 
ture  we  assume  that  this  defect  structure  remains  largely  un¬ 
changed  (the  defect  structure  is  “frozen  in”)  and  the  stress 
measured  at  room  temperature  will  then  be  due  to  the  ther¬ 
mal  mismatch.  In  this  case  the  strain  at  room  temperature  is 
given  by  6=Ar(a:sub-«epi),  where  AT  is  the  temperature 
difference  and  a  is  the  thermal  expansion  coefficient.  The 
biaxial  stress  value  is  obtained  using  the  approach  of 
Reiger16  from  Eq.  (1) 

eE 

<r= — ,  (1) 

v 

where  a  is  the  biaxial  stress  (GPa),  E  is  Young’s  modulus 
(GPa),  and  v  is  Poisson’s  ratio.  Using  a  growth  temperature 
of  — 1000  °C,  thermal  expansion  coefficients  of:  aSic=4.2 
X  10-6/°C  and  aGaN=5.6X  10-6/°C,  a  Young’s  modulus  of 
200  GPa,  and  a  Poisson’s  ratio  of  0.38,  this  calculation  yields 
a  biaxial  stress  value  of  —0.74  GPa.  This  value  is  compa¬ 
rable  to  the  biaxial  stress  values  measured  for  the  planar 
growth  and  coalesced  NHE  films  in  Table  I. 

The  reduced  biaxial  stress  measured  in  the  GaN  pyramid 
structures  (Fig.  3)  has  been  previously  observed,23,24  espe¬ 
cially  near  the  top  of  the  pyramid,  and  it  is  of  interest  to 
understand  why  this  stress  reduction  occurs.  The  theory  of 
nanoheteroepitaxy9  shows  that  the  stress  and  strain  fall  off 
exponentially  in  a  3D  semiconductor  object,  with  a  decay 
length  that  is  comparable  to  the  diameter  of  that  object.  It  is 
important  to  understand  that  nanoscale  size  is  not  specifically 


required  to  achieve  this  exponential  stress/strain  decay,  rather 
it  is  an  approximately  unity  aspect  ratio  of  the  object  (height 
to  width)  that  must  be  achieved.  The  GaN  pyramid,  where 
the  height  is  comparable  to  the  width  of  the  object,  satisfies 
this  aspect  ratio  requirement  and  a  significant  reduction 
stress  and  strain  is  thus  expected.  While  this  mechanism 
could  account  for  most  of  the  observed  reduction  in  residual 
biaxial  stress  in  the  GaN  pyramids,  an  additional  stress  re¬ 
duction  mechanism  is  also  believed  to  be  active. 

After  growth,  as  the  GaN/SiC  sample  is  cooled  from 
high  temperature,  the  thermal  mismatch  stress  will  change 
and  there  will  be  some  rearrangement  of  the  defect  structure 
in  response  to  this  change.  This  defect  rearrangement  will  be 
facilitated  by  the  presence  of  a  surface  which  can  act  as  a 
source  of  vacancies  and  can  allow  defect  termination  or  gen¬ 
eration.  In  the  grouped  sample  (Fig.  3)  defects  can  also  in¬ 
teract  with  the  sidewall  surfaces  of  the  pyramid  as  shown  in 
Fig.  3(c).  We  suggest  that  as  the  grouped  sample  is  cooled 
from  growth  temperature,  the  defect  structure  in  this  sample 
will  therefore  be  better  able  to  respond  to  the  changing  ther¬ 
mal  mismatch  stress  in  the  GaN/SiC  structure.  Support  for 
this  hypothesis  was  recently  obtained  by  comparing  the 
stress  in  fully-coalesced,  2  jmm  thick,  planar  GaN  epilayers 
grown  on  SiC,  which  were  cooled  at  different  rates  following 
growth.  Figure  4  shows  optical  micrographs  of  these  planar 
GaN  films  at  20  X  magnification.  The  crack  density  is  clearly 
reduced  when  the  postgrowth  cooling  rate  is  reduced  from  its 
normal  value  of  100°C/min  [Fig.  4(a)]  to  20°C/min  [Fig. 
4(b)].  This  reduction  in  crack  density  indicates  a  reduced 
residual  stress  in  the  slower  cooled  film  and  indicates  that 
there  has  been  some  change  of  defect  structure  (and  stress)  in 
the  slower  cooled  film,  in  support  of  the  above  hypothesis. 
An  interdependence  of  defect  density  and  sample  lateral  size 
has  also  been  observed  in  other  materials  such  as 
InGaAs/GaAs.25 

Time  resolved  photoluminescence  measurements  with 
subpicosecond  resolution  were  also  performed  on  these 
samples  using  a  luminescence  downconversion  technique. 
PL  excitation  pulses  of  —100  fs  duration  tunable  from  375 
nm  to  225  nm  were  obtained  by  frequency  doubling  of  the 
visible  signal  pulses  from  a  250  kHz  Ti:  sapphire 
regenerative-amplifier-pumped  optical  parametric  amplifier. 
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FIG.  5.  Time  resolved  photoluminescence  shows  a  significantly  longer  car¬ 
rier  lifetime  for  the  NHE  “grouped”  GaN  sample. 

Figure  5  shows  the  decay  of  PL  intensity  for  the  three  sample 
types  after  a  345  nm,  530  /zW  pump  pulse.  The  decay  is 
fitted  by  two  exponentials  to  yield  a  fast  (tf)  and  a  slow  (ts) 
component  of  carrier  lifetime.  The  superior  carrier  lifetime 
of  the  NHE  “grouped”  pattern  is  clearly  demonstrated  with 
an  ~4X  increase  in  the  “fast”  carrier  lifetime  component 
and  more  than  a  10 X  increase  in  the  “slow”  carrier  lifetime 
component  for  this  sample.  Similar  improvements  in  the  fast 
and  slow  components  of  carrier  lifetime  were  also  obtained 
in  the  grouped  NHE  sample  for  345  nm,  1.56  mW  pump 
pulses.  The  room  temperature  photoluminescence  intensity 
of  the  “grouped”  sample  was  approximately  100  times  more 
intense  than  that  of  the  planar  GaN  on  SiC  sample.  Part  of 
this  efficiency  improvement  is  undoubtedly  due  to  the  im¬ 
proved  coupling  of  radiation  both  into  and  out  of  the  pyra¬ 
midal  samples  as  compared  with  a  planar  geometry,  where 
the  high  refractive  index  of  the  GaN  prevents  escape  of  much 
of  the  photoluminescence  radiation;  however,  some  of  the 
increase  is  likely  due  to  reduced  nonradiative  processes  as 
reflected  in  the  lifetime  measurements. 

CONCLUSIONS 

We  have  analyzed  NHE  GaN  grown  on  SiC  by  MOCVD 
and  observed  that  the  quality  of  NHE  GaN  is  significantly 
improved  compared  to  planar  growth  even  when  the  nanos¬ 
cale  nuclei  in  these  samples  were  larger  than  the  target  size 
of  40  nm  and  were  not  expected  to  show  defect  elimination 
via  the  3D  NHE  compliance  strain-relief  mechanism.9  It  is 
found  that  the  NHE  growth  mask  is  effective  at  terminating 
substrate  defects.  Also  the  close  proximity  of  the  growth  sur¬ 
face  to  unbound  sidewalls  in  the  NHE  sample  structure  le¬ 
verages  the  bending  of  threading  dislocations  and  their  ter¬ 
mination  at  sidewalls.  The  “grouped”  NHE  structures 
showed  a  dramatic  improvement  in  both  carrier  lifetime  and 
room  temperature  photoluminescence  intensity,  and  a  signifi¬ 
cantly  reduced  biaxial  stress  of  —0.18  GPa. 

While  the  GaN  material  quality  in  this  grouped  array 
structure  is  significantly  improved  and  this  geometry  may  be 


acceptable  for  some  emitter  array  applications,  the  challenge 
remains  to  combine  a  low  defect  density  with  reduced  stress 
(no  cracking)  in  a  fully  coalesced,  planar  film. 
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Abstract 

Growth  of  GaN  on  a  nanoscale  periodic  faceted  Si  substrate  by  metal  organic 
vapor  phase  epitaxy  is  reported.  The  surface  of  the  Si(001)  substrate  is 
patterned  with  a  one-dimensional,  355-nm  period  array  of  grooves.  Each 
groove  consisted  of  two  facing  Si  { 1 1 1 }  stripe  facets  separated  by  upper  and 
lower  Si(001)  surfaces.  GaN  deposition  on  this  groove  array  shows  a  strong 
selectivity  with  epitaxial  growth  of  the  hexagonal  phase  on  the  Si  { 1 1 1 }  facets 
being  dominant.  The  initial  stage  of  GaN  growth  appears  to  be  amorphous. 
This  is  followed  by  GaN  containing  a  high-density  of  stacking  faults  to 
accommodate  the  large  lattice  mismatch  strain  but  the  defect  density  decreases 
as  growth  proceeds.  Lateral  overgrowth  from  the  Si{  1 1 1 }  facets  toward 
neighboring  Si(001)  facets  leads  to  coalescence  between  GaN  from  opposing 
Si{  1 1 1 }  facets.  These  GaN  regions  have  misaligned  c-axes  and  the  crystal 
structure  becomes  unstable  as  they  merge,  resulting  in  a  phase  transition  from 
hexagonal  to  cubic  at  the  coalescence  region.  Experimental  results  of  GaN 
grown  on  a  nanoscale  faceted  Si  surface  including:  nucleation,  crystal 
structure,  and  lateral  growth  depending  on  Si  orientation  are  presented. 


1.  Introduction 

Gallium  nitride  (GaN)  and  related  materials  have  attracted  much  attention  since  the 
achievement  of  blue  laser  diodes  on  a  sapphire  substrate  (Nakamura  1996).  Because  of 
their  large  bandgap,  there  is  also  substantial  interest  for  advanced  communication  systems 
that  require  high  power  and  high  speed.  While  sapphire  has  crystal  symmetry  identical  to 
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that  of  (0001)  GaN,  its  large  lattice  mismatch  and  low  thermal  conductivity  are  less 
favorable.  Silicon  (Si)  has  a  strong  potential  as  a  substrate  for  GaN  as  a  result  of  the 
availability  of  low-cost,  large-area  substrates  with  extremely  high  quality.  It  also  has  a 
large  lattice  mismatch  but  has  a  thermal  conductivity  comparable  with  GaN  that  is 
considerably  larger  than  that  of  sapphire  (Powell  2002).  However,  GaN  on  Si(001)  is 
known  to  have  phase  instability  due  to  polytypism  which  leads  to  a  mixture  of  cubic  and 
hexagonal  phases  and  degrades  the  crystallinity  and  surface  morphology.  For  device 
applications,  phase  control  or  phase  separation  into  wide-area  single  phase  GaN  on  Si(001) 
is  required. 

In  this  work,  we  investigate  the  use  of  nanoscale  patterning  of  a  Si  substrate  to 
control  the  phase  instability.  It  is  known  that  GaN  exhibits  a  hexagonal  phase  on  Si(lll) 
and  can  have  cubic  phase  on  Si(001)  (Lei  1991,  Godlewski  1996,  Kawaguchi  1998,  Kung 
1999,  Zamir  2002).  The  basic  idea  is  to  prepare  both  nanoscale-wide  (111)  and  (001) 
facets  on  a  single  Si(001)  substrate.  On  this  multi-faceted  surface,  both  GaN  phases  are 
available  at  the  same  time  but  in  different  areas  of  the  Si  substrate.  Then,  the  nucleation 
mechanisms  and  crystal  structure  of  GaN  which  depend  on  Si  facet  orientation  can  be 
directly  compared.  We  report  two  important  experimental  results:  one  is  nucleation 
selectivity  of  GaN  depending  on  Si  orientation  and  associated  lateral  growth,  and  the  other 
is  phase  transition  of  GaN  from  hexagonal  to  cubic  phase  on  a  nanoscale  faceted 
(nanofaceted)  Si  surface.  Nanoscale  faceting  (nano faceting)  on  a  Si  substrate  and  metal- 
organic  vapor-phase  epitaxy  (MOVPE)  of  GaN  will  be  described.  Experimental  results 
obtained  from  GaN  grown  on  this  nanofaceted  Si  surface  including  orientation-dependent 
nucleation,  crystal  structure,  lateral  growth,  and  formation  of  cubic  phase  GaN,  will  be 
presented. 


2.  Experiment 

With  large-area  interferometric  lithography  and  anisotropic  wet  etching,  a  nanoscale  one 
dimensional  (ID)  periodic  grooves  which  were  bounded  by  (100)  and  {111}  facets  was 
fabricated  on  Si(100)  substrate  (Brueck  1998,  Lee  and  Brueck  2003).  Figure  1(a)  shows 
45°-tilted  (top)  and  cross  sectional  (bottom)  scanning  electron  microscopy  (SEM)  images 
of  a  355-nm  period  nanofaceted  Si  substrate.  As  seen  in  Fig.  1(a),  each  period  has  both  top 
and  bottom  (100)  facets  and  {111}  sidewalls;  the  overall  shape  is  a  flat-bottom  groove. 
The  widths  of  the  top  and  the  bottom  (100)  facets  are  approximately  100  and  120  nm, 
respectively. 

On  this  nanofaceted  Si  surface,  GaN  was  deposited  by  MOVPE  using  (CH3)3Ga  and 
NH3.  First,  a  ~1.5-nm  thick  GaN  buffer  was  grown  at  650°C.  On  top  of  this  buffer,  an 
~600-nm-thick  GaN  epilayer  was  grown  at  1050°C.  Figure  1(b)  shows  a  cross  sectional 
SEM  image  of  the  as-grown  sample.  This  sample  was  examined  with  transmission  electron 
microscopy  (TEM),  atomic  force  microscopy  (AFM),  and  photoluminescence  (PL). 


3.  Results  and  discussion 


2 


Figure  2(a)  is  a  cross  sectional  TEM  (XTEM)  image  of  the  GaN  grown  on  the  nanofaceted 
Si  surface.  Figures  1(d)  and  1(e)  are  an  AFM  and  a  top  view  SEM  image  revealing  surface 
morphology  of  the  as-grown  GaN,  respectively.  Some  surface  roughening  due  to  an 
irregular  corrugation  running  across  the  groove  direction  was  observed  in  Figs.  1(d)  and 
1(e),  and  also  in  Fig.  2(a).  However,  this  is  significantly  flat  and  smooth  compared  with 
the  GaN  grown  on  an  unfaceted  large-area  Si(001)  substrate  under  the  same  growth 
conditions  [Fig.  1(f)]  and  with  the  reported  results  (Lei  1991,  Godlewski  1999,  Zhang 
1999).  This  improvement  of  surface  morphology  is  a  major  difference  between  GaN 
epilayers  grown  on  nanofaceted  and  unfaceted  Si(001). 

In  Fig.  1(c),  which  is  the  magnified  image  of  the  region  marked  by  the  white  box  in 
Fig.  1(b),  a  void  is  observed  on  the  bottom  Si(001)  facet.  The  shape  and  location  of  the 
voids  are  more  clearly  revealed  in  the  XTEM  images  shown  in  Fig.  2.  Voids  are  formed  on 
both  the  bottom  and  the  top  Si(001)  facets.  In  Fig.  2(b),  the  overall  shapes  of  these  voids 

are  approximately  triangular  with  GaN{1101}  sidewalls  on  the  bottom  Si(001)  and  a 
GaN{0001}  on  the  top  Si(001)  facet. 

Figure  3(a),  corresponding  to  the  area  in  box  a  in  Fig.  2(b),  is  a  high  resolution 
XTEM  image  taken  near  the  comer  of  the  groove  where  the  bottom  Si(001)  and  Si(lll) 
intersect.  The  amorphous-like  GaN  which  could  be  due  to  the  stress  from  the  large  lattice 
mismatch  appears  at  the  beginning  of  deposition  but  evolves  to  a  crystalline  structure  with 


Figure  1.  (a)  A  45°-tilted  (top)  and  a  cross  sectional  (bottom)  SEM  image  of  a 
355-nm  period  ID  array  of  flat-bottom  grooves  fabricated  on  a  Si(001) 
substrate,  (b)  A  cross  sectional  SEM  image  of  a  600-nm  thick  GaN  film 
grown  on  a  nanofaceted  Si  surface  shown  in  (a),  (c)  A  magnified  SEM  image 
of  the  white  dashed  box  in  (b).  (d)  An  AFM  image  of  the  as-grown  sample, 
(e)  A  top  view  SEM  image  of  GaN  shown  in  (b).  (f)  A  top  view  SEM  image 
of  GaN  grown  on  a  wide-area  unfaceted  Si(001)  under  the  same  growth 
conditions  as  those  of  (b). 


a  large  number  of  stacking  faults  after  a  few  nm  of  growth.  Stacking  faults  are  generated 
partially  to  release  the  excess  strain  energy  (Zubia  2000).  Figure  3(b),  corresponds  to  the 
area  in  box  b  of  Fig.  2(b),  and  shows  a  defect- free  hexagonal  phase  GaN  region  formed 
above  the  stacking  faults.  The  c-axes  in  the  hexagonal  GaN  phases  are  perpendicular  to  the 
Si  { 1 1 1 }  facets  as  indicated  by  the  arrows  in  Fig.  2(b).  Figure  3(a)  shows  that  after  GaN 
nucleation  on  the  Si  { 1 1 1 }  facets,  there  is  lateral  growth  towards  the  adjacent  Si  (001) 
facets.  Figure  3(c)  corresponds  to  the  area  in  box  c  of  Fig.  2(b)  and  shows  a  high  resolution 
XTEM  image  taken  near  the  void  at  the  top  Si(001)  stripe  facet.  As  seen  in  this  figure,  the 
two  c-axes  of  the  coalesced  GaN  are  misaligned  since  they  are  originated  from  opposing 
Si(lll)  faces  and  make  an  angle  of  -70°.  Figure  3(d)  is  the  high-resolution  image  of  the 
area  in  box  d  of  Fig.  2(a).  The  cubic  phase  GaN  in  Fig.  3(d)  was  consistently  formed  above 
the  coalescence  boundary  of  the  highly  defected  hexagonal  phase  regions. 

Figure  4  is  a  room- temperature  PL  spectrum  obtained  from  the  as-grown  sample. 
The  PL  exhibits  a  single  peak  at  3.389  eV  which  is  on  a  background  of  scattered  laser  light. 
This  is  close  to  the  reported  PL  peak  energy  of  the  transition  of  a  donor-bound  exciton  in 
hexagonal  phase  GaN  on  Si(lll)  (Zamir  2002,  Yodo  2002).  This  also  implies  that  the 
major  crystal  structure  of  the  as-grown  GaN  on  a  nanofaceted  Si  surface  is  hexagonal  and 
supports  the  TEM  results. 

Based  on  the  experimental  results,  the  growth  evolution  outlined  schematically  in 
Fig.  5  is  proposed.  As  suggested  in  Fig.  5(a),  nucleation  selectively  occurs  on  Si  { 1 1 1 }  with 
a  hexagonal  GaN  phase  forming  at  this  initial  growth  stage.  From  the  shape  of  the  voids,  it 
can  be  noticed  that  the  GaN  selectively  grown  on  Si{  1 1 1 }  has  a  trapezoidal  cross  section 
with  {1101}  as  the  sidewall  orientation  at  the  beginning  of  growth.  This  orientation 
selectivity  results  in  negligible  deposition  on  the  Si(001)  facets,  but  lateral  growth  from 
Si{  1 1 1 }  toward  these  facets  begins  to  fill  in  these  facets.  The  arrowheads  in  Fig.  5(b) 
indicate  the  directions  of  lateral  growth  conjectured  from  the  TEM  image  of  Fig.  2.  From 


■* - 355  nm 


Figure  2.  (a)  An  XTEM  image  of  the  as-grown  sample,  (b)  A  magnified 
XTEM  image  corresponding  to  the  white  dashed  box  in  (a). 
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the  facet  geometry,  the  directions  available  for  lateral  growth  on  the  top  Si(001)  are  [0001] 
and  [1101]  while  that  on  the  bottom  Si(001)  is  [1 1  01]  only.  Thus  the  sidewall  orientation 
of  the  void  on  the  top  Si(001)  is  determined  by  the  lateral  growth  along  the  additional 
[0001], 

In  Figs.  2,  the  voids  are  not  at  the  centers  of  the  Si(001)  facets.  The  gas  flow  in  our 
horizontal  reactor  which  directs  gases  from  left  to  right  on  these  figures  may  result  in 
uneven  exposure  of  the  opposing  Si  { 1 1 1 }  facets  to  the  source  gases.  Since  the  bottom  void 
forms  early  in  the  growth  and  effectively  halts  further  growth  in  this  region  by  sealing  out 
any  gases,  the  growth  rates  depending  on  facet  orientation  can  be  properly  compared  at  the 
bottom  side.  In  Fig.  2(b),  growth  proceeded  from  the  Si(l  1 1)  to  the  solid  white  line  profile 
along  [0001]  and  from  the  dashed  white  line  in  a  [1 1  01] .  By  applying  the  same  rule  to  the 
other  side  of  the  groove  and  averaging  the  results  of  the  two  cases,  the  growth  rate  in 
[1101]  is  estimated  as  at  least  20-25%  of  that  along  [0001],  If  migration  of  Ga  atoms  from 

{1101}  to  (0001)  on  the  GaN  surface  is  appreciable  in  nanoscale  dimensions,  the  actual 
growth  rate  along  [0001]  can  be  partly  due  to  the  narrower  width  of  the  upper  side  of  the 
trapezoidal  cross  section  and  could  be  significantly  greater  than  the  nominal  overall  growth 
rate  of  10  nm/min.  Owing  to  the  considerable  lateral  growth  rate  in  [1 1  01],  a  hexagonal 
single  phase  GaN  epilayer  can  be  achieved  over  the  wide  area  (probably  pm-scale  area)  on 
Si(001)  from  a  seed  layer  on  a  Si{  1 1 1 }  facet  if  the  lateral  dimension  of  the  top  and  bottom 
Si(001)  facets  were  to  be  extended  (e.  g.  a  larger  period  structure). 

Continual  growth  with  fast  growth  rate  along  [0001]  induces  the  coalescence 
between  GaN  epilayers  selectively  grown  on  facing  Si{  1 1 1 }  and,  as  a  result,  generates  a 
void  below  the  coalescence  points  of  Fig.  5(b)  where  growth  is  stopped  at  the  inside. 


Figure  3.  High  resolution  XTEM  images  alphabetically  labeled  (a  to  d)  in 
the  white  boxes  of  Fig.  2.  See  the  text  for  details. 
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Although  initial  deposition  begins  on  localized  Si  { 1 1 1 }  facets,  the  growth  surface  is 
eventually  changed  to  GaN  after  coalescence,  as  illustrated  in  Figs.  5(b)  and  5(c). 
Continual  deposition  on  this  GaN  surface  activates  a  planarization  process  leading  to  the 
flat  surface  shown  in  Figs.  1(d)  and  1(e). 

In  Fig.  3(c),  coalescence  between  adjacent  hexagonal  GaN  nuclei  induces  a 
structural  instability  resulting  from  the  physical  contact  of  these  two  hexagonal  phase  GaN 
epilayers  with  misaligned  c-axes.  As  seen  in  Figs.  3(b)  to  3(d),  the  hexagonal  phase 
changes  to  a  cubic  phase  above  a  void  where  coalescence  proceeds.  A  growth  region  with  a 
high  density  of  stacking  faults  is  observed  again  just  as  the  two  opposing  hexagonal  islands 
meet.  The  generation  of  stacking  faults  could  be  understood  as  phase  fluctuation  between 
hexagonal  and  cubic  phases,  partly  due  to  a  high  strain  in  the  region  of  coalescence  that 
will  increase  total  crystal  energy  by  structural  instability. 

The  appearance  of  the  cubic  phase  GaN  is  an  important  result  of  this  work  which 
suggests  a  possibility  of  phase  separation  or  phase  control  on  Si(001)  by  nano  faceting. 
Formation  of  cubic  phase  GaN  which  is  similar  to  Fig.  3(d)  has  been  reproduced  on  a 
periodic  V-grooves  fabricated  in  a  Si(001)  surface  and  will  be  reported  elsewhere.  In  this 


Wavelength  (nm) 


Figure  4.  Room-temperature  PL  of  the 
as-grown  sample. 


GaN 


(a) 


(b) 

Cubic  phase  GaN 


(c) 


Figure  5.  An  illustration  for 
orientation-dependent  nucleation 
and  lateral  growth  of  GaN  on  a 
nanofaceted  Si  surface. 
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work,  we  briefly  introduce  the  formation  of  cubic  phase  GaN  on  a  Si(100)  substrate  by 
nanofaceting.  Further  research  is  required  to  understand  the  relation  between  facet 
geometry/physical  dimension  and  the  relative  size  of  the  cubic  phase  regions  on  a 
nano  faceted  Si  surface.  Planarization  of  the  front  surface  of  GaN  which  is  ID  periodic 
composite  of  hexagonal  and  cubic  phases  also  needs  more  investigation. 


4.  Summary  and  conclusions 

Growth  of  GaN  on  a  nanoscale  periodic  faceted  Si(001)  substrate  by  MOVPE  has  been 
investigated.  On  a  355-nm  period  ID  array  of  flat-bottom  grooves  consisting  of  a  bottom 
Si(001),  a  top  Si(001),  and  Si{  1 1 1}  sidewall  facets  in  each  period,  strong  selectivity  of  the 
nucleation  and  incorporation  of  GaN  on  Si(lll)  with  hexagonal  phase  has  been  observed. 
Orientation-dependent  nucleation  results  in  lateral  growth  from  a  Si  { 1 1 1 }  facet  toward 
neighboring  bottom  and  top  Si(001)  facets  with  formation  of  voids,  and  eventually 
coalescence.  With  continued  growth,  the  entire  GaN  film  planarizes  to  a  smooth  surface 
but  has  an  unstable  crystal  structure  resulting  from  the  periodically  misaligned  c-axes.  This 
structural  instability  causes  phase  transition  from  hexagonal  to  cubic  phase  on  a 
nanofaceted  Si  surface.  Our  experimental  results  therefore  suggests  that  nanofaceting  can 
be  used  for  GaN  phase  control  or  phase  separation  on  a  local  or  a  wide  area  of  a  Si(001) 
substrate. 
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This  letter  reports  the  growth  of  spatially  separated  hexagonal  and  cubic  phases  of  GaN  on  a 
patterned  Si(001)  substrate  by  metalorganic  vapor-phase  epitaxy.  The  substrate  surface  was 
patterned  with  grooves  having  a  355  nm  period.  Each  groove  consisted  of  two  opposed  Si{l  11} 
facets  that  were  separated  by  Si(001)  surfaces.  Epitaxial  growth  of  GaN  on  this  substrate  began 
selectively  on  the  Si{l  1 1}  facets  and  yielded  the  GaN  hexagonal  phase.  With  further  growth,  the  two 
hexagonal  phase  regions  separately  grown  on  the  opposed  Si{l  1 1}  facets  coalesced,  with  strongly 
misaligned  c  axes  (—110°).  The  GaN  grown  after  coalescence  was  subsequently  confirmed,  by 
transmission  electron  microscopy  and  photoluminescence,  to  be  of  cubic  phase.  ©  2004  American 
Institute  of  Physics.  [DOI:  10.1063/1.1687456] 


Gallium  nitride  (GaN)  and  related  compounds  exhibit  a 
wide  range  of  emission  wavelengths,  and  are  also  attractive 
for  high-power  and  high-speed  devices.  The  lack  of  a  lattice- 
matched  substrate  has  led  to  extensive  efforts  in  the  devel¬ 
opment  of  substrates  and  growth  strategies  for  these  materi¬ 
als.  Silicon  substrates  are  inexpensive  and  readily  available 
in  large  areas  with  very  high  crystalline  quality. 

Generally,  GaN  grows  with  a  hexagonal  phase  on 
Si(lll)  and  can  have  a  cubic  phase  on  Si(001).1-7  The  for¬ 
mation  of  the  cubic  phase  is  known  to  depend  on  growth 
conditions  and  on  the  crystal  structure  of  the  substrate  and 
the  buffer  layer;  the  material  quality  is  often  severely  de¬ 
graded  by  mixture  with  or  transition  to  the  hexagonal  phase, 
resulting  in  a  poly  type  structure.3-7  Despite  its  expected  ad¬ 
vantages  in  electron  mobility,  /7-type  doping,  electrical  con¬ 
tact,  and  cleaving  for  laser  facets,7,8  high-quality  cubic  phase 
GaN  has  so  far  proved  elusive. 

For  high-quality  cubic  phase  GaN  on  Si(001),  the  strat¬ 
egy  followed  in  this  work  is  to  take  advantage  of  the  com¬ 
pliance  associated  with  nanoscale  growth  as  shown  in  Fig. 
I.9  A  nanoscale  groove  [Fig.  1(a)]  is  etched  into  a  Si(001) 
surface,  revealing  two  opposed  Si(lll)  faces  separated  by  a 
bottom  Si(001)  facet.  Deposition  of  GaN  on  this  patterned 
surface  results  in  selective  growth  of  GaN  only  on  the 
Si(lll),  leading  to  two  hexagonal  GaN  crystals  in  the 
groove.10  Continued  growth  leads  to  the  coalescence  be¬ 
tween  these  two  GaN  epilayers,  as  illustrated  in  Fig.  1(b). 
Since  GaN  has  hexagonal  phase  on  Si(lll)  with  its  c  axis 
normal  to  it,  the  two  top  (0001)  surfaces  of  Fig.  1(b)  form  a 
V  groove  at  coalescence,  but  with  misaligned  c  axes.  Fol¬ 
lowing  the  coalescence,  incoming  flux  interacts  with  this  sec¬ 
ondary  V  groove  and  forms  the  most  stable  crystal  structure 
available  under  the  growth  conditions.  The  investigation  of 
the  crystal  structure  of  GaN  continuously  grown  on  this  ar- 
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tificially  manipulated  surface  was  the  basic  motivation  for 
this  work. 

A  355-nm-period,  nanofaceted  Si  surface,  exhibiting 
{111}  and  (001)  facets,  was  fabricated  on  a  Si(001)  substrate 
by  large-area,  i- line  interferometric  lithography  and  aniso¬ 
tropic  wet  etching.10  Figure  2(a)  shows  a  cross-sectional 
scanning  electron  microscopy  (SEM)  image  of  the  nanofac¬ 
eted  Si  surface,  similar  to  Fig.  1(a).  For  convenience,  we  call 
a  single  period  of  Fig.  2(a)  a  flat-bottom  groove.  The  widths 
of  the  top  and  the  bottom  (001)  facets  are  approximately 
100-150  nm  and  70-100  nm,  respectively,  and  the  groove 
depth  is  —100  nm.  GaN  epilayers  were  grown  by  metalor¬ 
ganic  vapor-phase  epitaxy.  A  —  1.5-nm-thick  GaN  buffer  was 
grown  at  650  °C  and  a  —  75-nm- thick  GaN  epilayer  was  sub¬ 
sequently  deposited  at  1050  °C.  Nominal  growth  rates  were 
1.5  nm/min  for  the  buffer  and  10  nm/min  for  the  GaN  epil¬ 
ayer. 

Figure  2(b)  shows  a  cross-sectional  transmission  elec¬ 
tron  microscopy  (XTEM)  image  of  a  single  groove  of  an 
as-grown  sample.  In  Fig.  2(b),  the  groove  is  filled  with  only 
a  75-nm- thick  deposition.  Triangular  voids  located  on  the  top 
and  bottom  Si(001)  facets  were  observed.  These  voids  imply 
that  there  was  negligible  deposition  on  Si(001).10  As  indi¬ 
cated  by  the  arrows  in  Fig.  1(b),  the  initial  growth  proceeds 
selectively  in  the  direction  normal  to  the  Si{l  11}  planes.  The 
cross  section  of  the  GaN  at  this  point  was  a  trapezoid 
bounded  by  {1 101}  sidewalls  and  a  {0001}  top  facet  that  was 
parallel  to  the  underlying  Si{l  1 1}  facet  [Fig.  2(c)].  This  cross 
section  was  maintained  until  coalescence  between  opposing 
GaN  nuclei  occurred  at  point  X  [Fig.  2(b)].  The  dashed  lines 
indicate  the  profile  of  the  secondary  V  groove  at  the  moment 
of  coalescence.  After  coalescence,  the  supplied  gas  flow  no 
longer  accessed  the  space  below  point  X,  and  the  growth 
inside  the  lower  void  stopped.  Continued  growth  eventually 
resulted  in  a  second  coalescence  above  the  upper  Si{001} 
facet,  again  with  the  formation  of  a  void.  In  this  early  stage 
of  growth,  the  crystal  structure  over  this  second  coalescence 
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FIG.  1.  (a)  Concept  drawing  of  the  faceted  Si  surface  prepared  for  this 
work,  (b)  Cross  section  of  the  faceted  Si  surface  shown  in  (a)  along  with 
selective  growth  of  GaN  on  the  Si(lll)  faces,  leading  to  the  formation  of  the 
secondary  Y  groove. 


is  not  yet  well  defined,  and  we  concentrate  on  the  material 
properties  in  the  vicinity  of  point  X. 

Figure  2(c)  illustrates  the  GaN  phase  map  resulting  from 
analysis  of  the  sample  cross  section  shown  in  Fig.  2(b).  Fig¬ 
ure  3  shows  high-resolution  XTEM  images  of  selected  areas 
of  the  GaN  growth  labeled  (A,B)  in  Fig.  2(b).  In  Fig.  3(a), 
taken  from  region  A,  stacking  faults  are  largely  distributed 
near  the  GaN/Si  interface,  but  the  crystal  structure  stabilizes 
to  a  hexagonal  phase  that  is  maintained  until  coalescence,  as 
shown  subsequently  [Fig.  3(c)].  Stacking  faults  (SFs)  are  of¬ 
ten  observed  in  GaN/Si  heterostructures,  especially  when  the 
substrate  is  patterned  at  nanoscale.11  Since  a  SF  in  the  cubic 
phase  corresponds  to  the  presence  of  the  hexagonal  phase 
(and  vice  versa),  a  region  of  SFs  can  be  regarded  as  a  phase 
fluctuation  or  an  aperiodic  variation  of  the  crystal  structure 
between  hexagonal  and  cubic  phases. 

Figure  3(b)  shows  a  high-resolution  XTEM  image  from 
a  region  near  B  of  Fig.  2(b)  that  corresponds  to  the  region 
grown  after  coalescence  of  the  secondary  V  groove.  The 
GaN,  again,  is  highly  defected  with  stacking  faults  in  the 
coalescence  region  between  the  two  hexagonal  crystals,  but 
the  resulting  crystal  structure  now  is  cubic.  Selective  area 
electron  diffraction  (SAD)  focused  on  the  GaN  region  [the 
inset  of  Fig.  3(b)]  provides  evidence  of  the  existence  of  the 
cubic  phase.  The  cubic  phase  diffraction  spots,  indicated  by 
the  black  arrows  labeled  “C,”  are  shown  together  with  those 
from  the  two  overlapped  hexagonal  phases  (the  white  arrows 
labeled  “HI”  and  “H2”).  Note  that  the  crystal  axes  of  the 
cubic  phase  GaN  are  parallel  to  those  of  the  Si  substrate. 
Thus,  the  planar  surface  above  region  B  is  parallel  to  the 

Si{001}  surface.  The  resulting  cubic  phase  region,  as  illus- 
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FIG.  2.  (a)  Cross-sectional  SEM  image  of  a  355-nm-period  one-dimensional 
nanofaceted  surface  fabricated  on  a  Si(001)  substrate,  (b)  XTEM  image  of  a 
single  groove  of  a  75-nm-thick  as-grown  GaN  sample.  The  point  X  corre¬ 
sponds  to  the  initial  coalescence  point  and  formation  of  the  secondary  V 
groove.  The  surface  of  the  GaN  exhibits  facets  that  are  7°  off  from  the 
Si(001).  (c)  Phase  map  of  the  cross  section  shown  in  (b),  showing  regions  of 
hexagonal  and  cubic  crystals  separated  by  highly  defected  SF  regions. 

trated  in  the  phase  map  of  Fig.  2(c),  has  an  isosceles  triangle 
cross-sectional  shape  that  mostly  fills  the  initial  grooved  sur¬ 
face,  and  is  therefore  spatially  separated  from  the  hexagonal 
phase  regions  on  the  nanofaceted  Si  surface. 

Figure  3(c)  shows  details  of  coalescence  revealed  in 
high-resolution  XTEM.  The  coalescence  boundary  between 
the  two  hexagonal  phases  is  clearly  resolved.  The  white  ar¬ 
rows  in  Fig.  3(c)  indicate  the  c-axis  directions  of  the  two 
hexagonal  GaN  epilayers  grown  from  the  Si{l  1 1}  facets.  The 
c  axes  of  the  misaligned  hexagonal  nuclei  are  tilted  by 
—  110°  with  respect  to  each  other;  this  situation  is  analogous 
to  a  twin  structure.  The  boundary  region  shown  in  Fig.  3(c) 
is  energetically  very  unstable.  The  black  arrows  indicate  sev¬ 
eral  lattice  points  that  are  significantly  displaced  along  this 
coalescence  boundary.  As  shown  in  Figs.  2(c),  3(b),  and  3(c), 
this  highly  strained  coalescence  boundary  between  hexago¬ 
nal  nuclei  is  gradually  overtaken  by  phase  fluctuation  (gen¬ 
eration  of  SFs),  and  finally  by  the  formation  of  the  cubic 
GaN  phase.  Experimentally,  it  is  evident  from  Figs.  3(b)  and 
3(c)  that  changing  the  crystal  structure  to  cubic  phase  is  en¬ 
ergetically  more  favorable  than  keeping  misaligned  hexago- 
AIP  license  or  copyright,  see  http://apl.aip.org/apl/copyright.jsp 
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FIG.  3.  High-resolution  XTEM  images  taken  from  the  region  (a)  A,  (b)  B, 
and  (c)  the  region  near  the  X,  as  indicated  in  Fig.  2(b).  The  inset  of  (b)  is  a 
SAD  pattern  obtained  from  the  region  above  coalescence  point  X  of  Fig. 
2(b).  In  the  inset,  HI  and  H2  are  the  diffraction  spots  from  the  two  mis¬ 
aligned  hexagonal  phases,  while  C  marks  the  cubic  phase.  In  (c),  the  arrows 
indicate  the  c  axes  of  the  two  coalesced,  but  misaligned,  hexagonal  phases. 


nal  phases.  A  relatively  small  energy  difference  between  the 
two  phases  of  GaN  can  be  one  of  the  physical  origins  driving 
phase  transition  to  lower  the  total  crystal  energy.12 

Finally,  Fig.  4  shows  a  photoluminescence  (PL)  spec¬ 
trum  obtained  from  the  as-grown  sample  at  10  K.  In  Fig.  4, 
two  dominant  PL  peaks  are  observed  at  3.439  and  3.276  eV. 


Energy  (eV) 

FIG.  4.  PL  of  the  as-grown  sample. 


While  3.439  eV  is  between  donor  bound  exciton  recombina¬ 
tion  (D°X)  and  band-impurity  transition  in  hexagonal  phase 
GaN  on  a  Si(lll),  3.276  eV  is  very  close  to  D°X  recombi¬ 
nation  of  cubic  phase  GaN  on  GaAs(OOl)  or  Si(001).5,7 
Therefore,  the  PL  of  Fig.  4  provides  further  evidence  of  the 
coexistence  of  hexagonal  and  cubic  phases  in  a  single  GaN 
epilayer  and  supports  the  XTEM  observation  of  Figs.  2(b) 
and  3. 

In  conclusion,  spatial  separation  between  hexagonal  and 
cubic  phases  of  GaN  has  been  observed  on  a  nanoscale  sec¬ 
ondary  V  groove  that  results  from  selective  growth  on  a 
nanopattemed  Si(001)/Si(lll)  flat-bottomed  groove  on  a 
Si(001)  substrate.  This  technique  can  be  applied  to  the 
growth  of  a  particular  phase  GaN  on  either  a  local  or  a  wide 
area  of  a  Si(001)  substrate,  and  therefore  to  the  study  of 
properties  of  cubic  phase  GaN. 
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The  equilibrium  crystal  shape  (ECS)  of  GaAs  homoepitaxially  grown  on  a  nanoscale  Si02 -patterned 
(001)  plane  by  molecular  beam  epitaxy  is  investigated.  A  GaAs  epilayer  selectively  grown  on  a 
nanoscale  area  bounded  by  a  circular  Si02  mask  undergoes  faceting,  resulting  in  a  pyramidal  shape 
with  {110}  sidewalls.  Growth  is  slowed  or  terminated  with  the  generation  of  these  {110}  facets  even 
with  a  continuing  supply  of  Ga  atoms.  This  implies  that  the  pyramidal  shape  is  energetically  very 
stable.  Based  on  experimental  results  and  the  Wulff  construction,  a  {110}-type  sidewall  pyramid  is 
proposed  as  an  ECS  of  GaAs  on  (001)  in  nanoscale  patterned  growth.  ©  2004  American  Institute 
of  Physics.  [DOI:  10.1063/1.1757657] 


I.  INTRODUCTION 

A  crystal  has  a  certain  shape  in  equilibrium.  In  crystal 
growth,  the  shape  of  a  growing  crystal  is  governed  by  equi¬ 
librium  crystal  shape  (ECS)  under  a  given  set  of  growth  con¬ 
ditions.  In  other  words,  the  shape  of  a  growing  crystal 
evolves  to  the  ECS  for  the  given  growth  conditions.  There 
has  been  a  theoretical  approach  to  ECS  since  Wulff  proposed 
a  construction  rule  based  on  minimization  of  the  total  surface 
free  energy  more  than  a  hundred  years  ago.1  Extensive  ex¬ 
perimental  research  work  has  been  reported  from  the 
1980’s.2 

The  technology  of  crystal  growth  has  progressed  re¬ 
markably  over  the  last  three  decades.  Advanced  growth  tech¬ 
niques  such  as  molecular  beam  epitaxy  (MBE),  providing 
nearly  atomic-layer  thickness  control,  are  widely  available. 
In  traditional  MBE,  the  substrate  is  generally  centimeter 
scale  while  the  thickness  of  the  epilayer  is  typically  in  the 
micrometer  scale.  As  a  result  of  the  extremely  large  ratio  of 
the  lateral  dimension  to  the  thickness  and  of  the  very  low 
growth  rate  [— 1  monolayer  (ML)/sec],  the  growing  epilayer 
maintains  a  quasi-two-dimensional  (2D)  growth  mode  and  its 
crystal  shape  is  trivially  determined  by  the  shape  of  the  sub¬ 
strate  surface.  For  example,  it  is  always  a  thin  disk  if  the 
substrate  has  a  circular  shape.  If  the  radius  of  the  substrate 
opening  is  reduced  from  centimeter  scale  to  micrometer  or 
even  to  nanometer  scale  so  that  the  lateral  dimensions  of  the 
epilayer  become  comparable  to  or  less  than  its  thickness,  the 
ECS  will  be  changed  dramatically. 

To  understand  the  impact  of  the  variation  of  the  lateral 
dimension  of  a  growth  area  on  the  overall  shape  of  the  epil¬ 
ayer,  we  can  imagine  selective  growth  of  GaAs  on  a 
Si02-pattemed  substrate.  If  a  circular  hole  with  diameter 
much  larger  than  the  thickness  of  the  epilayer  to  be  deposited 
is  fabricated  on  a  Si02  mask,  growth  selectively  proceeding 
within  this  hole  area  typically  results  in  a  disk  shape  with  a 
trapezoidal  cross  section  as  shown  in  Fig.  1.  If  the  hole  di¬ 
ameter  becomes  comparable  to  or  smaller  than  the  thickness 
of  the  epilayer  to  be  deposited  on  the  hole,  as  seen  in  Fig.  1, 
the  overall  shape  of  the  epilayer  will  be  strongly  affected  by 
the  sidewall  formed  near  the  boundary  of  the  Si02  mask  and 
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the  epilayer.  Depending  on  the  deposition  thickness,  faceting 
occurs  on  the  sidewalls  and  the  resulting  epilayer  shape  is 
governed  by  ECS. 

The  condition  that  the  deposition  thickness  be  greater 
than  the  lateral  dimension  of  an  epitaxial  area  is  easily  sat¬ 
isfied  in  nanoscale  patterned  growth  without  requiring  un¬ 
usually  thick  depositions.  In  selective  patterned  growth  using 
a  mask  to  delineate  individual  nanoscale  growth  areas,  an 
epitaxial  area  with  a  lateral  size  less  than  the  surface  diffu¬ 
sion  length  of  a  Ga  atom  at  typical  MBE  growth  conditions 
allows  most  of  Ga  atoms  incident  on  a  substrate  surface  to 
migrate  over  the  whole  nanoscale  surface  area  to  find  a  fa¬ 
vorable  bonding  configuration  encouraging  the  formation  of 
the  energetically  stable  shape.3 

Evaluation  of  the  ECS  is  important  in  determining  the 
surface-free-energy  hierarchy  among  crystal  orientations  and 
also  in  understanding  the  electronic  properties  of  nanostruc¬ 
tures  such  as  quantum  dots  that  are  affected  by  geometrical 
shape.  MBE  is  generally  known  as  a  nonselective  deposition 
technique,  but  we  have  demonstrated  selective  growth  for 
GaAs  on  a  Si02 -masked  GaAs  substrate  at  high  growth  tem¬ 
perature  (^615  °C)  and  low  deposition  flux  (^0.1  ML/s).4  In 
this  work,  the  ECS  of  GaAs  using  selective  growth  on  a 
nanoscale  Si02 -masked  substrate. 

II.  NANOSCALE  PATTERNING  AND  MBE  GROWTH 

Figure  2(a)  shows  a  45°-tilted  scanning  electron  micros¬ 
copy  (SEM)  image  of  a  355-nm  period  2D  array  of  circular 
holes  fabricated  in  a  — 50-nm-thick  Si02  mask  on  a 
GaAs (001)  substrate  by  interferometric  lithography  and  dry 
etching.  Three  ranges  of  hole  diameter  d  were  prepared  on  a 
single  substrate:  80-130  nm,  160-200  nm,  and  230-260 
nm.  Figure  2(a)  corresponds  to  the  first  range  for  which  the 
smallest  hole  diameters  are  less  than  100  nm. 

On  this  patterned  substrate,  we  deposited  an  amount  of 
GaAs  corresponding  to  a  100-nm-thick  layer  (on  an  unpat¬ 
terned  wide  area)  at  the  growth  temperature  of  630  °C  with 
deposition  flux  of  —0.1  ML/s  under  As-rich  conditions 
(As2/Ga  beam  equivalent  pressure  —35)  where  selective 
growth  has  been  reported.5  The  range  of  hole  diameters 
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Epilayer  thickness 


FIG.  1 .  Cross  section  (top)  and  tilted  (bottom)  schematic  views  of  selective 
growth  of  GaAs  on  a  Si02 -patterned  substrate. 


spans  this  deposition  amount.  Figure  2(b)  presents  a  45  °- 
tilted  SEM  image  of  the  GaAs  epilayer  selectively  grown  on 
the  Si02 -patterned  substrate  shown  in  Fig.  2(a). 

Because  of  the  variations  in  hole  diameter  and  roughness 
resulting  from  the  lithography,  the  GaAs  islands  show  vari¬ 
ous  shapes.  Categorization  into  a  few  groups  is  discussed  in 
Sec.  III.  The  insets  of  both  figures  show  a  single  circular  hole 
and  a  nanoscale  faceted  GaAs  epilayer  selectively  grown  in 
such  a  hole  (GaAs  epi-island),  respectively. 

III.  RESULTS  AND  DISCUSSION 

Figure  3(a)  shows  a  top- view  SEM  image  of  an  as- 
grown  sample  for  80  to  130  nm.  Various  GaAs  island 
shapes  are  observed:  circular,  triangular,  square,  rectangular, 
and  pentagonal.  The  top- view  images  in  Fig.  3(b)  show  some 
of  them  for  mask  holes  of  d~130nm.  As  indicated  by  the 
dashed  circle  in  each  image,  which  represents  the  original 
substrate  surface  opening  in  the  Si02  mask,  some  GaAs  epi- 
islands  exhibit  lateral  growth  over  the  Si02  mask,  primarily 


FIG.  2.  (a)  A  45°-tilted  SEM  image  of  a  355-nm  period,  2D  array  of  circular 
holes  fabricated  in  a  ~50-nm-thick  Si02  mask  on  a  GaAs(OOl)  substrate, 
(b)  A  45°-tilted  SEM  image  of  a  GaAs  epilayer  selectively  grown  on  the 
Si02 -patterned  substrate  shown  in  (a).  The  insets  of  (a)  and  (b)  show  a 
single  hole  and  an  island-type  GaAs  epilayer  selectively  grown  in  a  circular 
hole,  respectively. 


FIG.  3.  (a)  A  top-view  SEM  image  of  an  as-grown  sample  of  d 
~80-130nm.  White  arrows  indicate  the  GaAs  islands  belonging  to  the 
group  of  islands  with  minimal  lateral  growth,  (b)  A  top-view  SEM  image  of 
GaAs  islands  of  d~130nm,  having  triangular  (left),  square  (center),  and 
rectangular  shapes  (right),  (c)  A  top-view  SEM  image  of  circular  shape 
GaAs  islands  of  d~  100  (left),  180  (center),  and  260  nm  (right). 


along  [100]  and  [010]  in  resulting  directions.  The  three- 
dimensional  (3D)  shape  of  an  epi-island  similar  to  that  in  the 
center  of  Fig.  3(b)  has  been  discussed  in  our  previous  work.4 
The  3D  shape  of  the  GaAs  epi-islands  shown  in  Fig.  3(b)  is 
complex  and  all  of  the  facet  orientations  are  difficult  to  mea¬ 
sure.  On  the  other  hand,  as  shown  in  Fig.  3(c),  growth  of 
some  epi-islands  retains  the  roughly  circular  shape  of  the 
initial  mask  opening  without  noticeable  lateral  growth  and 
their  simple  3D  shape  enables  characterization  of  their  facet 
orientations.  As  indicated  by  the  white  arrows,  these  circular 
shape  islands  are  relatively  common  with  — 10%— 15%  of  the 
GaAs  islands  belonging  to  this  class.  Also,  as  seen  in  Fig. 
3(c),  this  shape,  which  exhibits  at  least  twofold  symmetry,  is 
observed  over  the  entire  range  of  hole  diameters  investi¬ 
gated.  In  this  work,  we  focus  on  the  islands  with  minimal 
lateral  overgrowth  to  investigate  the  ECS  of  a  minimal- 
volume  GaAs  epi-island  selectively  grown  in  a  nanoscale 
area.  In  Fig.  3(c),  GaAs  epi-islands  with  other  shapes  are 
distributed  around  the  circularly  shaped  ones  we  are  discuss¬ 
ing.  Additional  discussion  of  these  GaAs  epi-islands  will  be 
presented  below. 

Figure  4  shows  high-magnification  top- view  and  45  °- 
tilted  (top)  and  side  view  SEM  images  (bottom)  of  a  GaAs 
island  for  180  nm  taken  from  different  directions.  For  this 
figure,  the  Si02  mask  was  removed  by  dipping  the  as-grown 
sample  in  diluted  HF  to  allow  unobstructed  views  of  the 


Downloaded  22  Jul  2004  to  64.106.37.66.  Redistribution  subject  to  AIP  license  or  copyright,  see  http://jap.aip.org/jap/copyright.jsp 


1216  J.  Appl.  Phys.,  Vol.  96,  No.  2,  15  July  2004 


100  nm 


FIG.  4.  High-magnification  SEM  images  of  a  GaAs  epi-island  for  d 
~180nm  taken  from  several  directions  with  different  view  angles.  The 
island  is  similar  to  that  shown  in  Fig.  3(c)  center.  Top  row  are  45°-tilted 
images;  bottom  row  are  side  view  images.  Note  the  view  direction  indicated 
in  each  part. 

GaAs  island.  The  island  shown  in  Fig.  4  belongs  to  the  group 
of  the  islands  with  minimal  lateral  overgrowth  [it  is  similar 
to  the  epi-island  in  the  center  image  of  Fig.  3(c)].  Islands 
smaller  than  that  of  Fig.  4  were  too  small  to  obtain  highly 
resolved  surface  images  for  clear  identification  of  facet  ori¬ 
entation  with  our  field  emission  SEM.  As  seen  in  Fig.  4(a), 
the  top  view  shows  a  nearly  circular  outline,  but  the  side 
view  along  [110]  in  Fig.  4(b)  shows  a  clear  triangular- 
faceted  shape  with  a  —35°  angle  of  the  sidewall  measured 
from  the  base  [or  (001)]  plane.  Additionally,  the  GaAs  island 
in  Fig.  4(c)  reveals  clear  facets  on  its  surface  in  a  45° -tilted 
image  taken  along  [010].  As  seen  in  Fig.  4(d),  its  side-view 
shape  along  the  same  direction  is  also  triangular  but  with  a 
—45°  angle  of  the  sidewall  facet  measured  from  the  base. 
The  slight  differences  in  contrast  on  the  island  surface  of  Fig. 
4(a)  correspond  to  the  different  facets. 

From  the  four  images  of  Fig.  4,  the  3D  shape  of  the 
group  of  GaAs  islands  we  are  focusing  on  is  pyramidal.  The 
angle  measurement  along  various  directions  reveals  that  the 
orientation  of  the  sidewall  facets  on  a  GaAs  island  is  {110}.  A 
schematic  illustration  of  the  {110}-type  sidewall  pyramid  is 
shown  in  Fig.  5.  Given  this  shape,  we  can  calculate  the  vol¬ 
ume  of  the  GaAs  island  and  the  deposition  required  for  its 


(101) 


a 

a/d  =  <T(iii)A/<T(iii)B 


FIG.  5.  Schematic  diagrams  of  a  {11 0}-type  sidewall  pyramid  (top)  and  a 
{11  l}-type  sidewall  pyramid  elongated  along  [110]  (bottom). 
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formation.  The  maximal  and  minimal  volumes  were  esti¬ 
mated  by  equating  d  alternatively  to  a  diagonal  across  the 
pyramid  base  and  to  a  side  of  the  square  pyramid  base,  re¬ 
spectively.  Dividing  these  volumes  by  the  base  area  (a  circle 
of  d~180nm),  a  range  of  the  effective  deposition  thick¬ 
nesses  of  14  to  38  nm  was  obtained.  This  rough  approxima¬ 
tion  reflects  the  actual  shape  of  the  GaAs  island  where  clear 
faceting  occurs  near  the  top  of  the  pyramid  while  the  base 
still  seems  to  be  close  to  circular.6  Importantly,  this  range  of 
effective  deposition  thicknesses  is  considerably  smaller  than 
the  supplied  amount  corresponding  to  a  100-nm- thick  layer. 
This  discrepancy  provides  a  very  important  experimental  re¬ 
sult  on  the  incorporation  behavior  of  the  incident  Ga  flux  in 
nanoscale-pattemed  growth. 

In  conventional  MBE  (unpatterned,  or  patterned  on  mi¬ 
crometer  or  larger  scales)  where  the  deposition  thickness  is 
much  smaller  than  the  lateral  size  of  the  epitaxial  area,  the 
deposition  amount  on  any  unit  area  is  simply  proportional  to 
the  growth  time  and  should  be  the  same  as  the  amount  sup¬ 
plied  if  the  incorporation  rate  is  unity.  At  our  growth  condi¬ 
tion,  an  incorporation  rate  close  to  unity  was  confirmed  on  a 
wide-area  unpatterned  GaAs  surface.  The  larger-volume 
GaAs  islands  of  Fig.  3(a)  are  also  indirect  evidence  of  this 
higher  incorporation  rate;  other  research  groups  also  have 
reported  unity  sticking  coefficients  of  Ga  on  GaAs  (001) 
around  630  °C.7  However,  as  demonstrated  above,  the  effec¬ 
tive  deposition  thickness  is  significantly  smaller  than  the 
supplied  amount  in  nanoscale  patterned  growth  for  the  class 
of  circular  epi-islands  without  significant  lateral  overgrowth 
on  which  we  are  focusing.  This  implies  that  growth  on  a 
nanoscale  hole  area  is  slowed  or  terminated  once  a  stable 
facet  such  as  {110}  is  generated  on  the  surface  of  the  epil- 
ayer.  In  other  words,  the  formation  of  an  energetically  very 
stable  facet  provides  a  thermodynamic  barrier  for  additional 
incorporation,  which  would  require  transformation  from  the 
stable  facet  to  another  facet  (and  another  geometric  shape). 
Thus,  the  slowing  or  termination  of  growth  with  the  forma¬ 
tion  of  {110}-type  sidewalls  on  a  GaAs  island  suggests  that 
the  pyramid  observed  in  Fig.  4  is  a  very  stable  geometric 
shape  under  our  growth  conditions.  Unincorporated  Ga  at¬ 
oms  may  evaporate  from  an  island  if  their  sticking  behavior 
on  {110}  is  different  from  that  on  (001),  or  they  may  migrate 
to  and  evaporate  from  an  adjacent  Si02  mask  since  the  Ga 
sticking  coefficient  on  the  Si02  surface  is  negligible  under 
selective  growth  conditions. 

For  a  given  circular  base,  both  orthorhombic  and  pyra¬ 
midal  shape  GaAs  islands  can  be  grown  with  low-index  ori¬ 
entation  faceting.  However,  the  orthorhombic  shape  is  not 
available  since  the  slanted  sidewall  generated  at  the  initial 
stage  of  growth  as  a  result  of  the  mask  thickness  does  not 
allow  vertical  sidewalls.5,8  Then,  the  only  possible  geometric 
shape  is  pyramidal.  Both  {110}  and  {111}  sidewalls  are  fea¬ 
sible.  Experimentally,  however,  {lll}-type  sidewall  pyramids 
were  not  observed.  As  seen  in  Fig.  4,  only  {110}-type  side- 
wall  pyramids  are  generated. 

According  to  the  Wulff  construction  rule,  the  crystal 
shape  is  determined  through  the  minimization  of  total  sur¬ 
face  free  energy  at  a  given  volume.  Once  the  surface  free 
energy  of  each  orientation  of  GaAs  is  known,  then  the  ECS 
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of  a  GaAs  crystal  can  be  built  up  by  the  Wulff  construction. 
The  distance  of  a  particular  facet  having  surface  free  energy 
per  unit  area  of  a  from  the  center  of  symmetry  is  propor¬ 
tional  to  r  determined  by9 


r(n)  =  min 

m 


o-(m) 

m*n 


a) 


where  m  is  a  unit  vector  normal  to  the  given  orientation  and 
n  is  a  normalized  vector  directing  from  the  center  of  sym¬ 
metry  to  any  point  on  a  given  facet  surface.  With  the  con¬ 
straint  of  a  fixed  volume  of  a  crystal,  Eq.  (1)  means  that  r  of 
each  facet  for  max[m‘n]  is  proportional  to  cr.  (For  the  pyra¬ 
mid  we  treat  in  this  work,  the  center  of  symmetry  is  on  the 
line  perpendicular  to  the  base  plane  passing  through  the  top 
vertex,  and  max[m*n]  then  has  the  same  value  for  all  four 
sidewalls  regardless  of  their  orientation.)  In  Eq.  (1),  how¬ 
ever,  cr  can  change  with  growth  conditions  such  as  tempera¬ 
ture  and  As  overpressure.  To  our  knowledge,  no  absolute 
comparison  of  surface  free  energy  between  different  orienta¬ 
tions  of  GaAs  under  a  specific  growth  condition  has  been 
reported.  But,  as  discussed  below,  Eq.  (1)  can  explain  the 
absence  of  a  {lll}-type  sidewall  pyramid. 

For  {111},  there  are  two  possible  orientations:  (lll)A 
and  (111)5.  In  general,  they  have  different  as  and  then, 
from  Eq.  (1),  the  ECS  must  be  a  pyramid  elongated  along 
either  [110]  or  [110]  if  all  the  sidewall  facets  belong  to 
{111}.  A  theoretical  model  suggests  the  order  of  the  lowest 
available  a  as  cr(111)fi  =  43  meV/A2<o-{110}  =  45  meV/A2 
<o-(ih)A  =  51  meV/A2  under  As-rich  conditions  where  cry 
represents  the  surface  free  energy  of  facet  j}°  Then,  the  ECS 
might  be  thought  to  be  a  pyramid  elongated  along  [110]  and, 
as  shown  in  Fig.  5,  have  an  expanded  (111)5  facet  and  a 
contracted  (111) A  facet.  This  elongated  pyramid  geometri¬ 
cally  may  not  be  probable  as  the  ECS  in  our  experiments 
since  the  base  area  allowed  for  epitaxial  growth  is  close  to 
circular,  requiring  equal  size  and  shape  sidewalls.  In  the 
elongated  {111}  pyramid  shown  in  Fig.  5,  alb  is  the  ratio  of 
the  width  to  the  length  of  the  base,  equal  to  o-(111)A  /o-(111)5  . 
Reliable  comparison  of  the  total  surface  free  energy 
( =  'LjCTjAj ,  where  Aj  is  the  area  of  facet  j  of  the  {111}  pyra¬ 
mid)  with  that  of  the  {110}  pyramid  is  not  available.  But,  its 
rectangular  base  could  explain  the  absence  of  a  crystal  shape 
consisting  of  {111}  type  sidewalls  in  this  work.  This  suggests 
that  the  pattern  shape  for  selective  deposition  as  well  as  the 
growth  conditions  affect  the  ECS  in  nanopatterned  growth. 
Indeed,  we  have  already  shown  that  the  (113)  and  (111)  ori¬ 
entations  appear  as  the  major  facets  on  the  sidewalls  of  GaAs 
selectively  grown  in  a  one-dimensional  nanoscale  stripe  open 
area  fabricated  along  the  [110]  direction.11  This  confirms 
that  the  ECS  is  not  unique,  but  depends  on  pattern  shape  as 
well  as  on  the  pattern  area  and  the  growth  conditions.  In  the 
modeling,  the  calculated  cr|110|  is  comparable  to  0-(111)5 
while  it  has  a  larger  difference  from  o-(111)A  .  Intuitively,  the 
formation  of  four  identical-cr  {110}  facets  could  be  more  fa¬ 
vorable  than  the  simultaneous  formation  of  two  different 
high-  and  low- a  facets. 

For  our  growth  conditions,  the  surface  migration  length 
of  Ga  atoms  on  (001)  is  of  micrometer  order,  significantly 


greater  than  the  typical  size  of  the  epi-islands.3  There  has 
been  no  quantitative  measurement  of  Ga  surface  migration 
length  on  different  orientations  with  variation  of  growth  tem¬ 
perature.  But,  it  can  be  conjectured  from  the  data  on  (001) 
and  the  comparison  between  different  orientations  that  the 
Ga  surface  migration  lengths  on  {111}  and  {110}  would  be 
greater  than  the  lateral  dimension  of  the  epi-islands  in  these 
experiments.12  There  could  be  several  different  facets  on  the 
sidewalls  of  an  epi-island  at  the  initial  stages  of  growth.  De¬ 
pending  on  pattern  shape  and  growth  conditions,  however, 
the  epi-island  evolves  to  the  most  stable  shape  having  the 
lowest  surface  free  energy  with  facet  expansion  and  contrac¬ 
tion  as  growth  is  continued.  By  relying  on  Wulff  construction 
and  the  observed  geometric  shape  of  a  GaAs  island,  there¬ 
fore,  it  can  be  concluded  that  a  {110}-type  sidewall  pyramid 
is  the  ECS  of  GaAs  grown  on  a  nanoscale  circular  patterned 
(001)  substrate  under  our  growth  conditions. 

Finally,  we  comment  on  the  reason  that  lateral  over¬ 
growth  proceeds  on  some  GaAs  islands  [square  or  rectangu¬ 
lar  shapes  in  the  top  view  of  Figs.  3(a)  and  3(b)]  with  the 
formation  of  additional  facets.  We  do  not  fully  understand 
this  observation  but  probably  this  could  be  due  to  a  distorted 
shape  of  the  original  hole  pattern  that  can  induce  asymmetric 
geometrical  shapes  on  an  island  surface.  Irregular  initial 
faceting  in  a  distorted  circular  hole  can  result  in  a  growth 
rate  of  a  particular  facet  that  is  considerably  faster/slower 
than  other  neighboring  ones,  which  asymmetrically  modifies 
the  overall  shape  of  the  epi-island.  Thus,  pattern  uniformity 
and  substructure  even  on  nanometer  and  subnanometer 
scales  is  an  important  issue  for  further  study.  The  investiga¬ 
tion  of  the  ECS  of  GaAs  islands  undergoing  lateral  over¬ 
growth  is  presently  underway  and  will  be  reported  else¬ 
where. 

IV.  CONCLUSION 

We  report  the  ECS  of  GaAs  grown  on  a  nanoscale 
Si02-patterned  substrate  by  MBE.  The  overall  shape  of  a 
GaAs  epilayer  selectively  grown  in  a  nanoscale  circular  hole 
is  strongly  affected  by  faceting,  which  results  in  a  pyramidal 
shape  with  {110}  sidewalls  and  minimal  lateral  growth  at  our 
growth  conditions.  Comparison  of  the  actual  deposited  vol¬ 
ume  with  the  supplied  amount  of  material  implies  that 
growth  on  a  nanoscale  circular  hole  is  slowed  or  terminated 
once  {110}  facets  are  formed.  Generation  of  these  facets 
across  the  entire  range  of  hole  diameters  investigated  means 
that  the  observed  pyramidal  shape  is  energetically  very 
stable.  From  the  experimental  results  and  Wulff  construction 
rule,  we  conclude  that  a  {110}-type  sidewall  pyramid  is  one 
of  the  ECS’s  of  GaAs  on  a  (001)  plane  in  nanoscale- 
patterned  growth  under  our  growth  conditions. 
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We  describe  the  growth  and  characterization  of  InAs  quantum  dots  (QDs)  on  a  patterned  GaAs 
substrate  using  metalorganic  chemical  vapor  deposition.  The  QDs  nucleate  on  the  (001)  plane  atop 
GaAs  truncated  pyramids  formed  by  a  thin  patterned  Si02  mask.  The  base  diameter  of  the  resulting 
QDs  varies  from  30  to  40  nm  depending  on  the  size  of  the  mask.  With  specific  growth  conditions, 
we  are  able  to  form  highly  crystalline  surface  QDs  that  emit  at  1.6  jam  under  room-temperature 
photopumped  conditions.  The  crystalline  uniformity  and  residual  strain  is  quantified  in 
high-resolution  transmission  electron  microscopy  images  and  high-resolution  x-ray  reciprocal  space 
mapping.  These  strained  QDs  may  serve  as  a  template  for  selective  nucleation  of  a  stacked  QD 
active  region.  ©  2004  American  Institute  of  Physics.  [DOI:  10.1063/1.1792792] 


In  the  strain-driven  self-assembled  quantum  dot  (QD) 
growth  process,  nonuniformity  in  the  wetting  layer  gives  rise 
to  QD  nucleation.1  The  nucleation  sites  are  weakly  linked  to 
surface  steps  that  are  not  uniformly  distributed;  the  QD 
nucleation  sites  are  thus  distributed  randomly  on  the  growth 
surface.  The  random  nucleation  results  in  a  nonuniform  QD 
size  distribution  and  a  broadened  inhomogeneous  linewidth, 
typically  >20  meV. 

Several  methods  have  been  used  to  define  the  nucleation 
sites  in  the  growth  plane  including  electron-beam 
lithography,2-4  scanning  tunneling  microscope-assisted 
nanolithography,5-7  and  optical  lithography.8  The  regrowth  of 
patterned  QDs  has  been  accomplished  using  metalorganic 
chemical  vapor  deposition  (MOCVD),2  chemical  beam 
epitaxy,3  and  molecular  beam  epitaxy.4-8  The  ultimate  goal 
of  this  research  is  to  demonstrate  an  inhomogeneously 
broadened  linewidth  similar  to  the  homogeneously  broad¬ 
ened  linewidth  (10  meV  at  300  K,  <1  meV  at  4  K).9  The 
formation  of  a  large,  dense,  and  uniform  QD  array  is  a  com¬ 
plex  task  requiring  a  stable,  uniform  patterning  process  that 
yields  a  clean,  undamaged  surface  for  regrowth.  To  date, 
even  room-temperature  photoluminescence  (RTPL)  from  re¬ 
grown  QDs  has  not  been  demonstrated. 

In  our  work,  we  have  used  an  optical  lithography  pat¬ 
terning  method  for  the  benefits  of  fast,  large-scale  exposure. 
Two  co-authors  (S.C.L.  and  S.R.J.B.)  have  pioneered  this 
patterning  process  for  homoepitaxial  regrowth  using  molecu¬ 
lar  beam  epitaxy.  In  this  manuscript,  we  describe  patterned 
QD  formation  using  MOCVD  utilizing  the  H2  carrier  gas  to 
produce  an  atomically  clean  postprocessing  surface.  Further¬ 
more,  we  have  designed  the  epistructure  to  reduce  nonradia- 
tive  centers  caused  by  dangling  bonds  and  strain-related  de¬ 
fects  that  result  from  patterning.  In  our  structures,  an 
InGaAs/GaAs  buffer  separates  the  QD  active  material  from 
the  regrowth  interface.  The  buffer  thickness  is  designed  to 
form  a  truncated  pyramid  on  which  the  QDs  nucleate.  The 
InAs  QDs  are  formed  by  strain  driven  processes  where  the 
nucleation  site  is  predefined  since  the  In  and  As  atoms  pref- 
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erentially  adhere  to  the  available  (001)  surface  atop  the 
GaAs  truncated  pyramid.  This  preferential  growth  on  the 
(001)  surface  has  been  noted  in  previous  nanopatterning 
work.3 

The  resulting  patterned  QDs  demonstrate  RTPL  mea¬ 
sured  using  a  5  mW  HeNe  pump  laser  (500  mW/cm2),  a 
lock-in  amplifier,  and  an  InGaAs  detector.  This  patterning 
and  regrowth  methodology  results  in  highly  crystalline  InAs 
pyramids  confirmed  by  high  resolution  transmission  electron 
microscopy  (HRTEM)  images  and  high-resolution  x-ray  re¬ 
ciprocal  space  mapping  (HRXRSM). 

The  basic  structure  for  processing  the  growth  mask  con¬ 
sists  of  a  negative  photoresist  (PR)  film,  a  45-nm-thick  Si02 
layer,  and  a  GaAs(001)  substrate.8  A  two-dimensional  array 
of  holes  is  formed  in  the  PR  using  an  interferometric  lithog¬ 
raphy  technique.  The  pattern  is  transferred  to  the  Si02  film 
with  dry  etching.  A  very  high  quality  Si02  film  is  necessary 
to  prevent  pin-hole  formation  during  the  etching  process. 
The  patterning  technique  and  subsequent  processing  to  gen¬ 
erate  the  Si02  pattern  are  fully  described  in  Ref.  8.  The  re¬ 
sulting  pattern  features  100-  to  200-nm-diam  holes  on  a 
360  nm  pitch  along  the  [110]  direction. 

The  patterned  pyramids  and  QDs  are  grown  in  a  vertical 
MOCVD  reactor  at  60  Torr  using  TMGa,  TMIn,  and  pure 
AsH3  as  source  materials.  Before  regrowth,  the  patterned 
GaAs  sample  is  deoxidized  at  900  °C  for  5  min.  Hydrogen  is 
flowed  during  this  process  to  remove  carbon,  residual  oxy¬ 
gen,  and  other  contamination  from  the  surface  prior  to  re¬ 
growth.  Growth  is  initiated  with  a  GaAs  buffer  layer  at  a 
growth  rate  of  1.5  A/s  and  V/III  ratio  of  200  at  800 °C  and 
results  in  a  truncated  pyramid  (—30  nm  in  height).  The 
growth  mode  is  characterized  by  a  low  growth  rate  and  high 
V/III  ratio  to  provide  a  very  smooth  GaAs  buffer  layer.  The 
high  V/III  ratio  is  necessary  as  the  group  III  elements  have 
higher  sticking  coefficient  than  group  V  elements;  this  effect 
is  amplified  by  the  patterned  surface.  From  initial  regrowth 
experiments  on  thin  stripe  patterns  in  Si02,  we  know  that  Ga 
atoms  diffuse  —2  jam  under  these  conditions.  This  diffusion 
length  translates  into  a  very  high  Ga  atom  concentration  in 
the  patterned  holes.  The  resulting  high  Ga  atom  concentra- 
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FIG.  2.  High-resolution  TEM  micrographs  of  (a)  patterned  surface  pyramids 
with  QDs,  (b)  HRTEM  of  a  single  patterned  InAs  QDs/GaAs  pyramid,  (c) 
HRTEM  of  an  InAs  QD  opportunistically  formed  at  the  pyramid  base. 


FIG.  1.  A.  Scanning  electron  microscopy  images  of  (a)  Si02  pattern  on  a 
GaAs(lOO)  substrate,  (b)  patterned  InAs  QDs  atop  GaAs  pyramids. 


tion  requires  a  high  column  V  source  flow  to  maintain  sto¬ 
ichiometry.  After  the  GaAs  truncated  pyramids  are  formed, 
the  temperature  is  lowered  and  stabilized  at  500 °C.  Quan¬ 
tum  dot  growth  is  commenced  with  5  ML  of  In0 15Gao  85As 
and  3  ML  of  InAs  at  a  growth  rate  of  1  ML/s  and  V /III  ratio 
of  25. 

Figure  1  shows  SEM  images  at  several  stages  of  pat¬ 
terned  growth  including  (a)  patterned  holes  in  the  Si02  mask, 
(b)  patterned  InAs  QDs  atop  GaAs  pyramids,  and  (c)  InAs 
QDs/GaAs  pyramids  formed  under  unoptimized  growth  con¬ 
ditions.  From  Fig.  1(a),  the  center-to-center  spacing  between 
two  holes  is  —360  nm  in  [110]  direction  and  510  nm  in 
[010].  Figure  1(b)  shows  a  clean  Si02  masked  surface  be¬ 
tween  the  InAs  pyramidal  features  demonstrating  a  highly 
selective  growth.  The  general  shape  of  pyramid  base  is  cir¬ 
cular  with  a  base  diameter  of  —  1 10  nm.  The  pyramidal  den¬ 
sity  is  >1  X  109  cm-2. 


InAs  QD  at  the  pyramid  base 
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Transmission  electron  microscopy  allows  measurement 
of  the  limiting  crystal  planes  and  process  uniformity.  Higher 
resolution  images  delineate  InAs  from  GaAs  materials  and 
confirm  crystal  integrity  especially  at  the  regrowth  interface. 
Figure  2  includes  several  cross-sectional  TEM  images  look¬ 
ing  at  the  (110)  crystal  plane  to  show  (a)  an  array  of  pat¬ 
terned  surface  InAs  QDs  on  GaAs  pyramids,  (b)  a  single 
InAs  QDs/GaAs  pyramid,  and  (c)  an  InAs  QD  that  nucleated 
(opportunistically)  at  the  pyramid  base  due  to  a  fabrication 
defect.  In  Fig.  2(a),  the  variation  in  Si02  hole  diameter  from 
120  to  150  nm  can  be  observed.  The  pattern  variation  results 
in  GaAs  pyramid  height  that  varies  from  30  to  40  nm  and 
QD  diameters  ranging  30  to  35  nm.  The  limiting  crystal 

plane,  which  is  a  function  of  substrate  material,  appears 
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highly  uniform  throughout  the  observed  images.  Figure  2(b) 
shows  crystal  faceting  along  the  (111)  plane  near  the  base 
and  (311)  near  the  apex.  As  discussed  in  previous  work,8  the 
pyramid  shape  changes  from  trapezoidal  to  triangular  de¬ 
pending  on  the  size  of  the  nanohole  and  the  amount  of  ma¬ 
terial  deposition.  We  have  designed  our  growth  sequence  to 
form  a  GaAs  pyramid  truncated  at  the  (001)  plane.  As  the 
(001)  plane  is  the  fastest  growing  plane,  the  InAs  prefers 
nucleation  on  this  plane.  Figure  2(c)  focuses  at  the  base  of 
one  pyramid  and  shows  the  formation  of  an  InAs  QD  be¬ 
tween  GaAs  buffer  and  Si02  mask  where  the  (001)  plane  has 
been  exposed  due  to  a  processing  defect. 

Figure  3  shows  RTPL  spectra  of  (a)  surface  QDs  on  a 

planar  substrate,  (b)  well-formed  patterned  surface  QDs,  and 
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FIG.  3.  Room  temperature  PL  spectra  from  (a)  self-assembled  QDs,  (b) 
patterned  QDs,  (c)  incompletely  nucleated  QDs  on  GaAs  pyramids. 

(c)  incomplete  nucleation  on  a  patterned  substrate.  The  self- 
assembled  (a)  and  the  well-formed  patterned  (b)  ensembles 
emit  very  similar  spectra  peaked  near  —1610  nm  with  a  full- 
width  at  half  maximum  (FWHM)  of  —140  meV.  The  broad 
FWHM  is  a  typical  characteristic  of  surface  SAQD 
emission.10  The  peak  intensity  of  the  patterned  QDs  is  curi¬ 
ously  two  times  greater  than  the  SAQDs.  While  this  may 
indicate  better  material  quality,  further  experimental  study  is 
necessary.  Finally,  there  is  no  PL  emission  from  the  irregular 
InAs/GaAs  pyramids  suggesting  poly  crystalline  nature. 

Figures  4(a)-4(c)  show  the  (044)  diffraction  space  maps 
for  (a)  the  SAQDs,  (b)  the  patterned  InAs  QDs,  and  (c) 
poorly  nucleated  QDs.  The  procedure  for  collection  and  cal¬ 
culations  from  angular  units  into  reciprocal  space  coordi¬ 
nates  of  these  maps  were  presented  in  previous 
publications.11,12  The  HRXRSM  from  the  SAQDs  in  Fig. 
4(a)  shows  only  the  projection  of  intensity  around  the  GaAs 
substrate  along  the  k-perpendicular  [001]  direction.  This  re¬ 
sult  indicates  the  wetting  layer  strain  in  the  growth  direction 
without  ordering  of  dots  in  the  lateral  direction.  The  interest¬ 
ing  features  appear  in  the  map  of  patterned  InAs  QDs,  Fig. 
4(b),  where  the  periodic  peaks,  located  symmetrically  around 
the  GaAs  substrate  peak  in  k-parallel  [110]  direction,  are 
observed  and  indicate  lateral  ordering  of  the  patterned  InAs 
QDs.  The  broadening  of  periodic  peaks  in  the  co  (omega) 
direction  indicates  crystal  homogeneity  in  each  ensemble. 
The  FWHM  values  are  (a)  2.3  X  10“4  A-1  for  the  SAQDs,  (b) 
1.9X  10“4  A-1  from  the  patterned  QDs,  and  (c)  3.0 
X  10-4  A-1  from  poorly  nucleated  QDs.  The  sharp  periodic 
peaks  of  Fig.  4(b)  suggest  uniform  crystal  plane  orientations 
compare  to  the  broad  spectrum  of  Fig.  4(c)  that  suggests  a 
highly  nonuniform  crystallographic  structure. 

In  conclusion,  we  have  demonstrated  growth  of  InAs 
QDs  atop  the  (001)  plane  of  patterned  GaAs  truncated  pyra¬ 
mids  using  MOCVD.  These  QDs  present  RTPL  from  pat¬ 
terned  InAs  QDs.  Data  from  HRTEM  and  HRXSM  analysis 
shows  high  crystalline  quality  material.  Of  course,  the  end 
goal  of  patterned  QD  work  is  to  demonstrate  an  inhomoge- 
neously  broadened  linewidth  that  is  very  similar  to  the  ho¬ 
mogeneously  broadened  linewidth.  Using  our  present 
method,  improved  QD  uniformity  requires  improved  pattern 
uniformity. 
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Selective  growth  of  In^Ga^As  on  a  Si02-mask  patterned  GaAs(001)  substrate  by  molecular  beam 
epitaxy  (MBE)  is  reported.  A  simple  model  for  selective  growth  is  derived  from  the  general  rate 
equation  of  thin  film  growth.  Based  on  this  model,  heteroepitaxial  selective  growth  of  In^Ga^As 
with  x  up  to  0.07  is  demonstrated  at  595  °C,  which  is  within  the  range  of  typical  MBE  growth 
temperatures  for  GaAs  but  is  above  the  desorption  temperature  of  In  atoms  from  a  GaAs  surface.  An 
incorporation  rate  of  In  atoms  of  about  0.22%  for  a  Ga  flux  of  2.0  X  1013  atoms /cm2  s  was  measured 
at  this  temperature.  The  concentration  of  In  atoms  effectively  incorporated  into  the  In^Ga^As  layer 
is  linearly  proportional  to  the  supplied  In  flux  in  the  range  0. 7-9. 2  X  1014  atoms/cm2  s.  The 
extremely  low,  but  finite,  In  incorporation  is  utilized  for  the  growth  of  5  nm  thick  In^Ga^As/GaAs 
quantum  wells  and  —300  nm  thick  In^Ga^As  layers.  Together  with  selective  growth,  the  surface 
morphology  and  optical  properties  of  high- temperature  grown  In^Ga^As  are  characterized.  © 
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I.  INTRODUCTION 

Selective  growth  is  deposition  restricted  to  a  desired  area 
on  a  patterned  surface.  In  group  IV  molecular  beam  epitaxy 
(MBE),  selective  growth  of  Si  and  Si^Ge^  utilizing  chemi¬ 
cal  reactions  between  a  Si02  mask  and  incoming  Si  and  Ge 
atoms  has  been  reported.1  In  III-V  MBE,  we  have  demon¬ 
strated  homoepitaxial  selective  growth  of  GaAs  on  a  Si02 
patterned  substrate  by  a  combination  of  desorption  of  Ga 
atoms  from  the  Si02  surface  and  Ga  migration  from  the  Si02 
to  a  nearby  open  GaAs  surface.2,3  Low-angle  incidence 
epitaxy4  and  migration  enhanced  epitaxy5  also  have  been  in¬ 
vestigated  to  achieve  selective  growth. 

Selective  growth  of  III-V  materials  has  strong  potential 
for  electronic  and  optoelectronic  devices,  especially  those 
requiring  large  lattice-mismatched  heterostructures.  To 
achieve  long  wavelength  emission  and  high  electron  mobility 
on  GaAs,  In^Ga^As  has  been  widely  used  as  an  active  re¬ 
gion  for  device  structures.  The  lattice  mismatch  between 
In  As  and  GaAs  is  7.2%. 6  For  device  applications,  however, 
this  is  too  large  for  In  As  or  In^Ga^As  on  GaAs  to  accom¬ 
modate  the  excess  strain  energy  with  a  layer-by-layer  Frank- 
Merwe  growth  mode,  and  results  in  a  change  to  Stranski- 
Krastanov  island  growth  mode  or  in  the  generation  of  misfit 
dislocations  severely  degrading  the  epilayer  quality.  Theoret¬ 
ical  considerations  predict  a  reduction  of  the  stress  of  a 
strained  epilayer  with  decreasing  deposition  area  in  the 
nanoscale  regime.7,8  Selective  growth  is  highly  flexible  in 
defining  the  epitaxial  area  on  a  planar  surface  from  macros¬ 
cale  to  nanoscale  regimes.  Then,  it  may  be  able  to  extend  the 
materials  available  for  better  device  performance  to  higher  In 
concentration,  lattice-mismatched  compositions. 

In  MBE  growth,  In  and  Ga  atoms  show  very  different 
sticking  behaviors.  Experimentally,  we  found  that  In  atoms 
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begin  to  desorb  from  a  GaAs  surface  around  560  °C  whereas 
Ga  atoms  desorb  at  —640  °C.9  Generally,  the  optimal 
growth  temperature  for  In^Ga^As  is  lower  than  the  desorp¬ 
tion  temperature  of  In  atoms  and  thus  is  below  —560  °C 
while  the  optimal  temperature  for  GaAs  growth  is 
580-600  °C.  To  grow  a  high  quality  In^Ga^As/GaAs  het¬ 
erostructure,  it  is  desirable  to  vary  the  substrate  temperature 
as  different  materials  are  grown.  Before  increasing  the 
growth  temperature  for  the  deposition  of  GaAs  on  an 
In^Ga^As  layer,  the  surface  is  covered  with  a  few  nanom¬ 
eters  thick  low-temperature  GaAs  layer  to  avoid  the  desorp¬ 
tion  of  In  atoms.10 

Although  no  systematic  investigation  of  the  desorption 
of  In  and  Ga  atoms  on  a  Si02  surface  has  been  reported,  the 
relationship  between  their  respective  desorption  temperatures 
is  expected  to  be  similar  to  that  obtained  on  a  GaAs  surface. 
To  achieve  selective  growth  on  a  Si02-patterned  GaAs  sub¬ 
strate,  negligible  deposition  on  Si02  must  be  maintained 
throughout  all  layers  of  the  hetero structure  growth.  We  have 
demonstrated  selective  growth  of  GaAs  and  InAs  separately 
at  very  different  temperatures:  630  °C  for  GaAs  and  535  °C 
for  InAs.3,11  Thus,  the  requirement  of  zero  deposition  on  a 
Si02  surface,  meaning  no  nucleation  and  incorporation  under 
a  given  growth  condition,  is  another  restriction  which  makes 
it  more  difficult  to  grow  GaAs  selectively  on  In^Ga^As 
without  In  desorption.  This  is  because  the  low-temperature 
grown  GaAs  protection  layer  must  be  also  prepared  in  a  se¬ 
lective  growth  mode  at  a  temperature  below  560  °C.  As  seen 
below,  lowering  the  selective  growth  temperature  below 
560  °C  may  be  possible  by  reducing  the  Ga  flux  to  a  very 
low  level  (<^2.0  X  1013  atoms/cm2  s  used  in  this  work). 
However,  it  is  difficult  to  precisely  control  the  Ga  flux  at 
these  low  values.  Also,  the  Ga  vapor  pressure  at  a  given 
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temperature  must  be  considered  for  finite  deposition  on  a 
GaAs  surface  if  the  Ga  flux  is  extremely  low. 

In  this  work,  we  take  a  different  approach  to  heteroepi- 
taxial  selective  growth.  The  optimal  growth  temperature  for 
GaAs  is  580-600  °C.  Setting  the  growth  temperature  for 
both  GaAs  and  In^Ga^As  (with  low  x)  in  this  range,  we 
examine  the  sticking  behavior  of  In  atoms.  It  has  been  re¬ 
ported  that  In  atoms  have  a  nonzero  sticking  coefficient 
above  560  °C.12,13  If  selective  growth  of  GaAs  can  be  real¬ 
ized  for  580-600  °C,  In  atoms  definitely  do  not  adsorb  on  a 
Si02  surface  at  this  range,  and  selective  growth  of 
In^Ga^As  as  well  as  GaAs  will  be  achieved  at  the  same 
temperature.  This  has  a  strong  advantage  for  the  subsequent 
deposition  of  additional  layers  on  In^Ga^As  without  re¬ 
growth. 

In  this  work,  we  first  present  a  simple  model  for  selec¬ 
tive  growth  derived  from  the  general  rate  equation  of  thin 
film  growth.  A  relation  between  the  incoming  flux  and 
growth  temperature  is  proposed  in  Sec.  II.  Relying  on  this 
model,  we  demonstrate  selective  growth  of  InxGaj_xAs  on  a 
Si02-patterned  GaAs  substrate  with  the  variation  of  growth 
temperature  and  In  flux.  Experimental  details  are  summa¬ 
rized  in  Sec.  III.  In  Sec.  IV,  we  examine  selective  growth  of 
GaAs  depending  on  growth  temperature  for  a  fixed  Ga  flux 
and  find  the  low-temperature  limit.  We  measure  an  incorpo¬ 
ration  rate  of  In  atoms  at  this  temperature  that  is  extremely 
low  (~2X  10-3)  but  sufficient  for  growth  of  quantum  wells 
(QWs)  and  even  thick  bulk  In^Ga^As  layers.  Selective 
growth  of  In^Ga^As  with  v~0.07  at  595  °C  is  achieved 
with  variation  of  the  In  and  Ga  fluxes.  Material  properties  of 
high-temperature  grown  In^Ga^As  such  as  surface  mor¬ 
phology  and  optical  properties  are  reported.  Finally,  a  sum¬ 
mary  and  conclusions  are  presented  in  Sec.  V. 


II.  A  THEORY  OF  SELECTIVE  GROWTH 

Selective  growth  is  achieved  when  zero  deposition  oc¬ 
curs  on  mask  surface  while  finite  deposition  proceeds  on  the 
desired  portions  of  the  open  substrate  surface.  First,  we  focus 
on  zero  deposition  on  the  mask  surface  which  is  the  neces¬ 
sary  condition  for  selective  growth.  In  principle,  zero  depo¬ 
sition  on  any  ideal  surface  is  always  possible  in  the  follow¬ 
ing  way.  Here,  ideal  means  free  from  any  extrinsic 
nucleation  source  such  as  impurities  and  crystal  defects  or 
the  fluctuation  and  variation  of  surface  potential  by  atomic 
steps.  On  such  an  atomically  flat  perfect  surface,  a  single 
incident  atom  has  a  finite  surface  residence  time  r  and  even¬ 
tually  desorbs  from  it.  This  means  that  the  sequential  inci¬ 
dence  of  a  single  atom  with  a  time  interval  longer  than  r 
results  in  zero  deposition  on  a  given  surface.  Within  the  resi¬ 
dence  time  r,  the  atom  migrates  on  the  surface  until  it  des¬ 
orbs  from  it.  Statistically,  the  distance  of  migration  is  re¬ 
ferred  to  as  surface  diffusion  length  LD  and  is  represented  by 
V ,  where  D  is  the  surface  diffusion  constant.14 

In  crystal  growth,  however,  multiple  atoms  are  simulta¬ 
neously  incident  on  a  given  surface.  If  the  number  of  atoms 
incident  on  and  remaining  in  a  monatomic  state  on  a  unit 
area  at  time  t  is  n ,  then  n  satisfies  the  rate  equation:15 
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00 

dn  n  2 

—  =  F-~-2(jL>ln  -n2j(*)inh  (1) 

dt  t  i=2 

where  F  is  an  incident  flux  on  an  unit  area  per  unit  time, 

(i= 2,...)  is  the  incorporation  rate  of  a  single  atom  into  a 
cluster  consisting  of  i  atoms,  and  nt  is  a  number  density  of 
/-atom-sized  clusters.  The  third  term  describes  consumption 
of  single  atoms  through  the  formation  of  dimers,  and  this 
term  with  dimerization  rate  of  o)1  and  the  fourth  term  thus 
correspond  to  nucleation  and  clustering.  Every  n{  for  2 
will  have  an  equation  similar  to  Eq.  (1).  We  have  neglected 
desorption  of  single  atoms  from  an  existing  cluster  in  Eq.  (1) 
which  rarely  occurs  under  typical  thin  film  growth  condi¬ 
tions. 

Consider  deposition  of  GaAs  on  a  Si02-patterned  GaAs 
substrate.  Let  LD  denote  the  surface  diffusion  length  of  a  Ga 
atom  on  a  Si02  surface.  The  lateral  dimension  of  the  Si02 
mask  is  assumed  to  be  much  greater  than  LD  of  Ga  atoms  at 
a  given  growth  condition.  Then,  n  is  the  density  of  mon¬ 
atomic  Ga  atoms  on  a  unit  area.  The  basic  requirement  of 
selective  growth  is  zero  deposition,  that  is,  the  absence  of 
nucleation  or  clustering  on  the  Si02  surface.  That  is,  nt= 0 
for  /5s  2  and  the  absence  of  dimer  formation  on  the  Si02 
surface.  Then,  the  rate  equations  for  nt  for  i^2  do  not  need 
to  be  considered.  By  applying  these  conditions  to  Eq.  (1),  the 
rate  equation  for  n  of  the  Si02  surface  for  zero  deposition 
can  be  written  as 


which  is  the  equation  describing  zero  deposition  on  a  Si02 
surface.  Theoretically,  some  clusters  of  which  the  size  is  be¬ 
low  a  certain  critical  value  can  migrate  on  the  surface.  Also, 
not  only  Ga  atoms,  but  also  other  migrating  species  such  as 
GaAs  molecules  could  be  involved  in  the  equation  for  n{ 
similar  to  Eq.  (1)  for  III-V  semiconductors.  In  a  more  rigor¬ 
ous  approach,  the  rate  equations  for  nt  for  i^2  should  be 
considered.  In  this  work,  migrating  dimers  or  clusters  for  i 
^  2  are  not  considered  because  they  ultimately  play  the  role 
of  immobile  nucleation  centers  as  growth  is  continued.  Also, 
we  do  not  consider  migration  of  Ga  atoms  across  the  bound¬ 
ary  between  the  Si02  mask  and  the  open  GaAs  substrate 
surface,  since  that  effect  is  not  significant  in  achieving  selec¬ 
tive  growth  mode  if  the  lateral  dimension  of  the  Si02  mask  is 
much  greater  than  LD.  Monatomic  Ga  atoms  must  be  the 
major  migrating  species  in  deposition  of  GaAs  on  a  Si02 
surface  and  their  behavior  described  by  Eq.  (2)  is  very  im¬ 
portant  in  understanding  zero  deposition.  Again,  it  should  be 
emphasized  that  these  conditions  are  for  the  Ga  atoms  mi¬ 
grating  on  a  wide-area  unpatterned  Si02  surface. 

In  steady  state,  Eq.  (2)  simplifies  to  F=n/r.  To  keep  zero 
deposition  on  the  mask,  the  surface  diffusion  length  of  a  Ga 
atom  from  its  initial  point  of  incidence  without  encountering 
other  Ga  atoms  on  the  Si02  surface  before  their  desorption 
must  be  less  than  a  certain  characteristic  length  Lc~n~m 
obtained  from  the  assumption  of  uniform  distribution  of  n 
atoms  on  the  unit  area  of  the  mask.  Then,  Eq.  (2)  in  steady 
state  can  be  rewritten  as 
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under  the  condition  of  LD^LC.  In  Eq.  (3),  Fc  is  a  critical  flux 
explained  below.  In  Eq.  (3),  D  =  Cd  exp(-Edm/kBT)  where 
Fdiff  is  the  activation  energy  of  a  Ga  atom  for  surface  diffu¬ 
sion  on  a  Si02  surface,  Cd  is  a  proportionality  constant,  kB  is 
Boltzmann’s  constant,  T  is  the  growth  temperature,  and  r 
=  Pq1  exp (EdQJkBT)  where  v0  is  a  desorption  rate  constant 
and  Fdes  is  an  activation  energy  of  a  Ga  atom  for  desorption 
from  a  Si02  surface.  Then,  Fc  of  Eq.  (3)  roughly  satisfies  the 
following  condition: 

vl 

Ec  -  exp[-  (2 Fdes  -  Edm)/kBTs ] .  (4) 

Ed 

It  should  be  noted  that  Fdes  and  Ediff  are  only  weakly  depen¬ 
dent  on  T  and  temperature  dependence  of  the  growth  is  de¬ 
termined  largely  by  the  explicit  exponential  dependence.  In 
Eq.  (4),  Ts  is  the  growth  temperature  above  which  a  selective 
growth  mode  is  available  for  a  given  Ec.  If  2Edes> Edm 
(which  is  valid  in  most  thin  film  growth)  is  assumed,  zero 
deposition  on  the  mask  at  lower  T  is  possible  by  reducing  F 
and  conversely,  zero  growth  can  be  achieved  at  a  higher  flux 
if  the  growth  temperature  is  raised.  Thus,  Fc  gives  the  critical 
flux  below  which  zero  deposition  on  the  Si02  mask  is 
achieved  at  a  given  Ts. 

Selective  growth  requires  finite  deposition  on  the  desired 
substrate  surface  as  well  as  zero  deposition  on  the  mask  sur¬ 
face.  At  a  Ga  flux  below  Fc,  therefore,  the  Ga  atoms  incident 
on  a  substrate  surface  must  satisfy  Eq.  (1)  with  different  r 
and  different  (of  s  and  also  with  nonzero  nfs  (i  ^  2)  for  finite 
nucleation  to  lead  to  epitaxial  growth.  Thus,  the  actual 
growth  conditions  must  be  properly  set  to  satisfy  this  re¬ 
quirement.  In  previous  work,  we  have  reported  the  homoepi- 
taxial  selective  growth  condition  of  GaAs  on  a 
Si02-patterned  substrate.2  Basically,  Eq.  (4)  assumes  that  ev¬ 
ery  atom  incident  on  a  patterned  substrate  undergoes  a  pro¬ 
cess  of  finite-time  residence  for  subsequent  incorporation/ 
desorption.  Although  Eq.  (4)  is  a  rough  approximation,  it 
consistently  agrees  with  the  experimental  results  as  shown  in 
Sec.  IV. 

III.  EXPERIMENT 

Semi-insulating  GaAs(OOl)  wafers  were  used  as  sub¬ 
strates.  On  each  10  X  10  mm2  substrate,  a  45  nm  thick  Si02 
film  was  deposited  by  electron-beam  evaporation.  With  con¬ 
ventional  photolithography  and  chemical  etching  process,  a 
half  of  the  Si02  film  (—5  X  10  mm2)  was  removed  for  ex¬ 
amination  of  selective  growth.  All  substrates  were  cleaned  in 
an  oxygen  plasma  to  remove  residual  organic  contaminants 
and  by  diluted  HF  (HF :  H20=  1 : 400)  to  expose  a  fresh  Si02 
surface.  As  seen  later,  the  HF  treatment  removing  at  least  a 
5-6  nm  thick  layer  from  the  surface  is  very  critical  for 
avoiding  deposition  on  the  Si02  surface. 

According  to  the  model  proposed  in  Sec.  II,  Ts  and 
group  III  fluxes  are  important  for  selective  growth.  To  char¬ 
acterize  Ts  for  GaAs,  a  fixed  Ga  flux,  FGa,  —2.0 
X  1013  atoms /cm2  s  was  employed.  By  changing  growth 
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temperature,  Ts  for  GaAs  of  about  595  °C  at  this  FGa  was 
measured.  A  cracker  cell  was  used  to  supply  As2  as  a  major 
component  of  As  flux.  Beam  equivalent  pressure  of  each 
element  is  denoted  by  P  with  subscripts  Ga,  In,  and  As2.  The 
ratio  of  Ga  to  As2  in  beam  equivalent  pressure,  PASr  JPg* 
was  changed  from  25  to  100  to  characterize  the  effect  of  As 
overpressure  on  selective  growth.  For  7^=595  °C  and  FGa 
given  above,  In  flux,  FIn,  was  varied  from  0.7-9. 2 
X 1014  atoms  /cm2  s  to  examine  selective  growth  of 
In^Ga^As.  An  In  As  substrate  was  employed  to  calibrate  ^In 
with  reflection  high-energy  electron  diffraction  (RHEED)  os¬ 
cillations  at  T=510°C  where  the  incorporation  rate  of  In 
atoms  is  unity.  The  substrate  temperature  was  monitored  by 
optical  pyrometry. 

As-grown  samples  were  examined  with  field-emission 
electron  microscopy  [scanning  electron  microscopy (SEM)], 
77  K  photoluminescence  (PL),  and  double  crystal  x-ray  dif- 
fractometry  (DXRD).  Indium  composition  v  of  In^Ga^As 
layers  was  measured  by  DXRD  and  PL.  The  incorporation 
rate  of  In  atoms  at  high  temperature  was  measured  with 
5  nm  thick  In^Ga^As/GaAs  QWs  by  PL.  Selective  growth 
of  In^Ga^As  on  a  Si02-patterned  GaAs(001)  substrate  was 
examined  with  —300  nm  thick  layers  through  SEM.  The  In 
composition  and  strain  relaxation  were  evaluated  from  the 
(004)  symmetric  and  (115)  asymmetric  reflections  of  DXRD. 

IV.  RESULTS  AND  DISCUSSION 
A.  Selective  growth  of  GaAs 

Previously,2  we  have  reported  homoepitaxial  selective 
growth  of  GaAs  on  a  Si02-patterned  substrate  depending  on 
growth  temperature  for  a  fixed  FGa.  For  FGa=6. 5 
X  1013  atoms /cm2  s,  the  apparent  sticking  coefficient  of  Ga 
atoms  on  a  Si02  surface,  5'Ga.sio2,  has  a  negligible  value  of 
less  than  0.01  for  7^615  °C.2  Here,  5'Ga.SiQ2  is  defined  as 
the  fractional  area  covered  by  GaAs  on  a  Si02  surface  for  a 
given  set  of  growth  conditions.  In  Sec.  II,  an  ideal  surface 
was  assumed  for  simplicity,  but  this  surface  is  not  available 
in  actual  growth.  Generally,  external  nucleation  centers  al¬ 
ways  exist  on  a  Si02  surface.  Such  nucleation  centers  have  a 
nonzero  probability  of  being  activated  during  growth,  which 
increases  with  both  growth  time  and  supplied  fluxes.  In  other 
words,  nucleation  is  unavoidable  as  the  growth  time  is  ex¬ 
tended  and  ultimately  selective  growth  is  not  available  in 
actual  growth.  Then,  selective  growth  requires  a  certain  cri¬ 
terion  on  5'Ga.sio2  and  must  be  characterized  under  specific 
growth  conditions  including  growth  time  or  supplied 
amount.  In  this  work,  for  the  consistency  with  the  previous 
work,  we  set  5,Ga_sio2~0.01  on  a  Si02  surface  as  the  upper 
limit  of  selective  growth  at  a  given  growth  condition. 

By  reducing  FGa  from  6.5  to  2.0  X  1013  atoms/cm2  s,  Ts 
of  GaAs  was  lowered  from  —615  to  —595  °C.  This  ex¬ 
perimental  result  is  consistent  with  Eq.  (4).  Figure  1  shows 
the  45°-tilted  SEM  images  taken  near  the  boundary  between 
the  Si02  mask  and  the  substrate  of  the  samples  grown  at  585, 
591,  and  595  °C  with  PAs2/PGa— 25.  To  enhance  the  preci¬ 
sion  of  the  temperature  measurement,  deposition  was  exam¬ 
ined  at  a  given  growth  temperature  at  least  two  or  three 
times.  The  temperatures  referred  in  this  work  were  measured 
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FIG.  1.  45°  tilted  SEM  images  of  GaAs  grown  on  a  Si02-patterned  substrate 
at  585,  591,  and  595  °C.  A  dashed  line  indicates  the  boundary  between 
GaAs  and  Si02. 

before  starting  growth.  More  than  an  hour  preheating  at  a 
given  growth  temperature  under  an  As-rich  environment  pro¬ 
vides  a  well- stabilized  temperature.  Run-to-run  variation  of 
growth  temperature  between  before  and  after  deposition  was 
about  +1  °  C  by  optical  pyrometry.  The  deposition  thickness 
for  each  sample  was  200  nm.  The  dashed  line  roughly  indi¬ 
cates  the  boundary  between  a  GaAs  epilayer  and  a  Si02 
mask.  In  each  image  of  Fig.  1,  the  islands  on  the  Si02  sur¬ 
face  are  GaAs.  As  seen  in  Fig.  1,  the  Si02  surface  is  almost 
covered  by  GaAs  islands  at  T=585  °C.  As  T  is  increased,  the 
number  and  size  of  GaAs  islands  are  reduced  and  finally 
become  negligible  (5,Ga-sio2<0.01)  at  T- 595  °C.  This 
growth  condition  was  used  for  selective  growth  of 
In^Ga^As  in  the  following  experiments.16 

In  selective  growth,  FGa  has  been  considered  as  a  major 
factor  but  the  As  flux,  FAs  can  be  also  important  in  As-based 
MBE  because  of  the  formation  of  GaAs  molecular  species 
which  may  play  the  role  of  a  nucleation  center  on  a  Si02 
surface.  To  examine  whether  As  overpressure  influences  the 
selective  growth  mode,  P^JPq^  was  increased  from  25  to 
100.  Figure  2(a)  shows  a  45°  tilted  SEM  image  from  the 
sample  grown  at  595  °C  under  Pp&  IP qvl~  100.  As  seen  in 
this  figure,  selective  growth  mode  is  kept  without  nucleation 
on  the  Si02  mask.  Thus,  in  the  range  examined  in  this  work, 
high  As  overpressure  does  not  induce  nucleation  on  a  Si02 
surface  confirming  that  FGa  is  the  major  parameter  for  selec¬ 
tive  growth  of  GaAs. 

All  the  Si02-patterned  substrates  used  for  selective 
growth  were  treated  with  diluted  HF  to  remove  a  very  thin 


FIG.  2.  (a)  A  45°  tilted  SEM  image  of  GaAs  grown  on  a  Si02-patterned 
substrate  at  595  °C  with  PA$2/PGa~  100.  (b)  A  45°  tilted  SEM  image  of 
GaAs  grown  on  a  Si02-patterned  substrate  at  630  °C  with  PA&2/  PGa~40 
without  HF  treatment.  A  dashed  line  indicates  the  boundary  between  GaAs 
and  Si02. 

layer  (—5-6  nm  in  thickness)  from  the  Si02  surface  imme¬ 
diately  before  loading  them  into  the  MBE  growth  chamber. 
This  process  is  very  critical  for  selective  growth.  Figure  2(b) 
shows  a  45°  tilted  SEM  image  of  the  sample  grown  at 
630  °C  with  the  same  FGa  but  without  the  HF  treatment.  In 
Fig.  2(b),  significant  nucleation  of  GaAs  occurs  on  the  Si02 
mask  at  a  temperature  considerably  higher  than  Ts.  Before 
being  loaded  into  the  MBE  chamber,  the  Si02-patterned  sub¬ 
strate  undergoes  a  wafer  patterning  and  cleaning  process. 
The  comparison  of  Fig.  2(b)  with  Fig.  1  suggests  that  the 
Si02  surface  is  contaminated  or  damaged  and  some  external 
nucleation  centers  are  generated  during  the  patterning  pro¬ 
cess.  Therefore,  removal  of  such  nucleation  sources  by  di¬ 
luted  HF  is  very  important  to  achieve  selective  growth  rely¬ 
ing  on  desorption  of  Ga  atoms  from  the  Si02  surface. 

B.  Incorporation  of  In  atoms  at  high  temperature 

Selective  growth  of  In^Ga^As  on  GaAs  is  the  major 
goal  of  this  work  and  the  desorption  temperature  of  InAs 
—  560  °C  is  referred  to  as  Tdes.  In  Sec.  IV  A,  Ts  for  selective 
growth  of  GaAs  was  determined  as  595  °C.  This  is  in  the 
optimal  range  of  GaAs  growth  but  higher  than  ^des-  From  Eq. 
(4),  Ts  can  be  lowered  further  by  decreasing  ^Ga-  However, 
the  accuracy  in  control  of  FGa  at  very  low  range  and  the 
required  growth  time  for  a  given  thickness  which  is  inversely 
proportional  to  ^Ga  make  this  strategy  problematic.  Instead, 
it  is  useful  to  investigate  the  incorporation  of  In  atoms  on  a 
GaAs  substrate  which  is  significantly  affected  by  tempera¬ 
ture  near  and  above  Tdes.  Then,  it  is  desirable  to  fix  T  at 
595  °C  and  to  examine  selective  growth  of  In^Ga^As  with 
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FIG.  3.  (a)  77  K  PL  spectra  of  5  nm  thick  In/ja^As/GaAs  QWs  with 
variation  of  Fin  from  4.2  to  9.2  X  1014  atoms/cm2  s.  (b)  PL  spectra  shown 
in  (a)  with  focusing  on  peak  1 .  PL  intensity  was  normalized  with  respect  to 
that  of  Fln= 9.2X  1014  atoms/cm2  s. 

variation  of  Fw  A  major  concern  is  then  to  determine  the 
incorporation  rate  of  In  atoms  on  a  GaAs  surface  at  595  °C. 
Another  issue  in  these  high  temperature  grown  In^Ga^As 
layers  is  the  quality  of  the  epilayer  such  as  crystallinity,  sur¬ 
face  morphology,  and  optical  properties.  First,  we  examined 
the  sticking  behavior  of  In  atoms  at  this  temperature  and 
applied  the  results  to  selective  growth  of  In^Ga^As  on  a 
Si02-patterned  GaAs(OOl)  substrate.  While  the  In  incorpora¬ 
tion  at  this  temperature  is  extremely  low  as  seen  below,  we 
keep  PAs2/PGa~  100  considering  the  addition  of  FIn  to  FGa 
for  growth  of  In^Ga^As. 

With  the  variation  of  Fm,  the  incorporation  rate  of  In 
atoms  at  T=  595  °C  was  measured  for  the  growth  of 
LqGa^As/GaAs  QWs.  The  In  flux  was  changed  from 
4.2  to  9.2  X  1014  atoms/cm2  s.  Based  on  the  preliminary 
data  about  In  incorporation,  the  width  of  the  QW  was  con¬ 
trolled  to  5  nm  by  growth  time  determined  by  both  ^Ga  and 
Fib.  On  every  In^Ga^As  layer,  a  150  nm  thick  GaAs  cap¬ 
ping  layer  was  deposited.  The  range  of  used  in  this  sec¬ 
tion  is  for  the  measurement  of  the  In  incorporation  rate  at 
595  °C.  As  discussed  in  the  following  section,  selective 
growth  of  In^Ga^As  at  the  given  growth  conditions  (FGa 
=  2.0 X  1013/cm2  at  595  °C)  is  achieved  only  for  /qn<6.0 
X  1016/cm2. 

Figure  3  shows  PL  spectra  obtained  from  5  nm  thick 
In^Ga^As/GaAs  QWs  grown  with  different  FIn’s.  All  the 
spectra  were  taken  at  77  K  with  a  632.8  nm  He-Ne  laser  and 


(b) 

FIG.  4.  (a)  Relation  between  FIn  and  x  measured  from  PL  shown  in  Fig.  3. 
(b)  Relation  between  Fm  and  Gn,inc- 

a  room  temperature  Ge  detector.  In  Fig.  3,  three  PL  peaks 
were  observed.  Among  them,  the  PL  peak  at  1.51-1.52  eV  is 
very  weak  and  is  almost  independent  of  Fi„  and  is  thus  from 
the  GaAs  substrate.  As  Fln  is  increased,  the  remaining  two 
major  peaks  are  shifted  to  lower  energy  but  with  a  nearly 
constant  peak  separation.  The  peak  separation  shows  a  little 
fluctuation  and  is  about  38-45  meV  without  particular  trend 
depending  on  For  convenience,  the  high  and  the  low 
intensity  PL  peak  in  each  spectrum  are  denoted  by  peak  1 
and  2,  respectively.  Figure  4(b)  shows  the  variation  of  peak  1 
energy  depending  on  Fin  with  PL  intensity  normalized  with 
respect  to  that  of  FIn= 9.2  X  1014  atoms/cm2  s. 

As  Fln  is  increased  from  4.2  to  9.2  X  1014  atoms/cm2  s, 
the  energy  of  peak  1  varies  from  1.4813  to  1.4305  eV.  The 
shift  of  PL  peak  1  to  low  energy  with  increase  of  FIn  means 
an  increase  in  the  In  composition  of  the  QW.  Also,  as  seen  in 
Fig.  4,  the  PL  intensity  is  rapidly  increased  with  Fln.  These 
results  suggest  that  quantum  confinement  is  proportionally 
enhanced  by  a  potential  energy  barrier  which  depends  on  the 
In  amount  in  a  QW  structure,  and  therefore,  imply  that  peak 
1  originates  from  the  transition  between  the  lowest  conduc¬ 
tion  subband  and  the  lowest  valence  subband  of  the  QW. 
Then,  peak  2  in  each  spectrum  could  be  the  transition  from 
the  lowest  conduction  subband  to  impurities. 

With  the  empirical  formula  for  the  relation  between  PL 
peak  energy  and  i  of  a  5  nm  thick  In^Ga^As/GaAs  QW 
reported  in  our  previous  work,  the  In  composition  was  esti¬ 
mated  from  the  PL  peak  energies  and  is  summarized  in  Table 
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TABLE  I.  Summary  of  experimental  results  obtained  from  PL  and  DXRD. 


FIn(X  1014/cm2  s 

PL  (77  K) 

DXRD 

Peak  separation  (arc  sec) 

In  composition 

X 

Gn,inc(X1012/cm2s) 

)  Peak  1  (eV) 

(004) 

(115) 

0.7 

180 

324 

0.010 

0.2 

4.2 

1.4813 

0.039 

0.8 

4.2 

544 

1163 

0.041 

0.8 

5.2 

1.4730 

0.048 

1.0 

6.0 

746 

1310 

0.051 

1.1 

6.5 

1.4531 

0.070 

1.5 

7.8 

1.4489 

0.074 

1.6 

9.2 

1.4305 

0.094 

2.1 

I.17  As  Fln  is  increased  from  4.2  to  9.2  X  1014  atoms/cm2  s, 
x  varies  linearly  from  0.039  to  0.094  as  shown  in  Fig.  4(a). 
For  this  range  of  misfit,  the  critical  thickness  of  v= 0.094 
In^Ga^As  is  at  least  three  times  5  nm  and  the  QWs  of  Fig. 
4(a)  should  be  assumed  to  be  strained  even  with  the  high 

growth  temperature  justifying  the  use  of  the  empirical  for- 

18 

mula  to  evaluate  the  In  mole  fraction  in  Table  I. 

The  incorporation  rate  of  In  atoms  on  a  GaAs  surface  at 
595  °C,  a ,  defined  as 


can  be  extracted  from  Table  I  or  Fig.  4(a).  In  Eq.  (5),  ^ln,inc 
is  an  amount  of  an  effective  In  flux  which  is  incorporated 
into  an  In^Ga^As  layer.  From  the  In  compositions  mea¬ 
sured  by  PL,  FIn  inc  of  Table  I  can  be  calculated  and  is  plotted 
in  Fig.  4(b).  The  least  square  fit  of  the  experimental  results 
with  linear  dependence  of  EIn  inc  on  FIn  is  indicated  by  the 
solid  line.  At  595  °C  with  FGsl= 2.0 X  1013  atoms/cm2  s  and 
^As2/^Ga~  100,  ot  is  —0.0022. 

The  measured  a  is  extremely  low  compared  with  the 
reported  data  which  is  about  0. 3-0.4  at  similar  growth 
temperatures.12,13  The  previous  work  used  mass  spectrometry 
and  RHEED  oscillations  to  measure  the  desorption  In  flux 
during  growth  of  an  In^Ga^As  layer.  These  experimental 
techniques  are  very  sensitive  to  surface  state  such  as  In  cov¬ 
erage  and  segregation.  On  the  other  hand,  we  characterized  a 
through  the  bulk  In^Ga^As  layers  by  PL  and  DXRD. 

From  in  situ  RHEED,  we  have  observed  that  the  (2 
X  4)  reconstruction  pattern  from  the  GaAs  surface  is 
changed  to  (4X2)  when  InAs  begins  to  grow  on  GaAs  at 
595  °C.  This  transition  was  observed  even  for  the  lowest  FIn 
examined  in  this  work,  where  PAs2/PIn  — 20.  Depending  on 
Fm  there  is  a  finite  time  delay  from  the  starting  point  of  InAs 
deposition  to  the  buildup  of  the  (4  X  2)  pattern.  The  higher 
Fin  the  shorter  the  time  delay.  The  time  delay  changes  from 
less  than  1  to  —  20  s  as  FIn  is  reduced  from  9.2  to  0.7 
X  1014  atoms /cm2  s.  When  the  In  supply  is  stopped,  the  (4 
X  2)  pattern  immediately  changes  back  to  the  (2  X  4)  pattern. 
A  similar  variation  also  occurs  for  growth  of  In^Ga^As  (FIn 
and  FGa)  with  a  time  delay  shorter  than  that  observed  with 
only  Fjn-  This  means  that  the  change  of  RHEED  pattern  is 
related  to  the  presence  of  In  atoms  on  the  surface,  but  that 


the  Ga  atoms  also  play  a  role,  perhaps  in  assisting  the  In¬ 
atom  adsorption. 

At  FIn=7.0X  1013  atoms/cm2  s  and  PAs2/PIn  — 20,  the 
change  of  RHEED  pattern  mentioned  above  does  not  occur 
for  T<  Tdes  in  growth  of  InAs  on  GaAs.  On  the  contrary, 
RHEED  exhibits  a  change  to  a  spotty  pattern  indicating  the 
formation  of  quantum  dots.  Also,  growth  of  GaAs  at  the 
same  As  overpressure  with  the  ^Ga  equal  to  Fi„=7.0 
X  1013  atoms /cm2  s  proceeds  keeping  (2X4)  pattern  at 
595  °C  without  change  of  a  RHEED  pattern.  These  suggest 
that  the  change  of  RHEED  pattern  in  the  In^Ga^As  growth 
is  not  due  to  As  deficiency.  As  measured  above,  the  sticking 
coefficient  of  In  atoms  is  extremely  low  at  595  °C,  and 
FIninc  is  far  below  FIn  as  shown  in  Fig.  4(b)  or  equivalently 
much  less  than  ^Ga  under  the  given  growth  conditions. 

The  change  of  RHEED  pattern  in  the  In^Ga^As  growth 
at  595  °C  implies  that  the  front  growth  surface  of  InAs 
growth  becomes  In  rich.  In  other  words,  like  InAs  growth 
below  ^des  where  an  In  floating  layer  formed  by  segregation 
effect  on  a  GaAs  surface,  an  In  adlayer  is  also  generated  on 
the  surface  even  at  595  °C.19  Unlike  the  growth  below  Tdes, 
however,  most  of  In  atoms  trying  to  form  the  adlayer  evapo¬ 
rate  from  the  surface  as  the  In  supply  stops  because  of  high 
growth  temperature.  Quantitative  analysis  about  the  incorpo¬ 
ration  of  In  atoms  above  Tdes  will  be  reported  elsewhere  but 
the  change  of  RHEED  pattern  and  the  desorption  of  the  In 
adlayer  from  the  front  growth  surface  suggest  that  a  signifi¬ 
cant  amount  of  In  atoms  resides  on  the  front  surface  during 
growth  and  that  the  In  amount  at  the  surface  of  In^Ga^As 
could  be  very  different  from  the  incorporation  of  In  atoms 
effectively  measured  from  bulk  In^Ga^As.  Even  if  In  atoms 
are  incorporated  into  an  In^Ga^As  layer,  they  can  diffuse 
out  to  or  segregate  to  and  subsequently  desorb  from  the  front 
growth  surface  if  they  are  very  close  to  it.  The  comparison  of 
a  bulk  with  a  surface  in  the  measurement  of  In  incorporation 
rate  must  consider  this  difference  for  consistent  interpreta¬ 
tion. 

Another  possibility  for  the  large  discrepancy  in  a  be¬ 
tween  our  measurement  and  the  reported  data  is  the  differ¬ 
ence  of  Ga  flux.  As  mentioned  above,  Ga  atoms  seem  to 
positively  influence  In  incorporation.  In  this  work,  FGa  is 
very  low  and  less  than  10%  of  that  in  one  of  the  reported 
works.12  If  Ga  atoms  having  almost  unit  sticking  coefficient 
at  our  growth  condition  play  a  certain  role  in  the  incorpora- 
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tion  of  In  atoms  into  an  In^Ga^As  layer,  the  change  of  ^Ga 
may  affect  a  of  In  atoms.  Also,  some  difference  in  the  cali¬ 
bration  of  growth  temperature  could  partly  explain  the  dis¬ 
crepancy  in  a.  Further  study  is  required  to  understand  the 
incorporation  mechanism  of  In  atoms  above  Tdes. 


C.  Selective  growth  of  In^Ga^As 

Based  on  the  incorporation  rate  measured  in  the  preced¬ 
ing  section,  selective  growth  of  In^Ga^As  on  a 
Si02-patterned  GaAs(OOl)  substrate  was  examined.  Accord¬ 
ing  to  Fig.  1,  the  incorporation  of  Ga  atoms  is  considerably 
reduced  with  a  few  degrees  difference  in  growth  temperature 
(e.g.,  between  591  and  595  °C).  Then,  from  Eq.  (4),  the 
addition  of  small  amount  of  Fln  to  FGa= 2.0 
X  1013  atoms /cm2  s  may  result  in  incomplete  selective 
growth  at  595  °C.  For  this  reason,  the  range  of  FIn  was  low¬ 
ered  to  0.7-6.0  X  1014  atoms  /cm2  s.  As  previously  men¬ 
tioned,  selective  growth  relying  on  desorption  can  be  af¬ 
fected  by  supplied  amount  or  growth  time.  In  the 
consideration  of  the  application  to  QW  growth  or  a  strain 
buffer,  selective  growth  was  examined  with  —300  nm  thick 
In^Ga^As  layers. 

Figure  5  shows  45°-tilted  SEM  images  of  the  In^Ga^As 
layers  grown  on  a  Si02-patterned  GaAs(100)  substrate  with 
Fln= 0.7,  4.2,  and  6.0  X  1014  atoms/cm2  s.  Also,  Fig.  6  shows 
DXRD  (004)  and  (115)  (glancing  incidence)  reflection  rock¬ 
ing  curves  obtained  from  the  samples  shown  in  Fig.  5.  In 
Fig.  6(a),  the  angular  separation  between  the  peaks  from 
GaAs  and  In^Ga^As  in  both  (004)  and  (115)  glancing 
incidence  rocking  curves  is  increased  with  FIn.  In  (004) 
[(115)  glancing  incidence]  reflection  rocking  curves,  the 
peak  separation  is  varied  from  180  to  746  arc  sec 
(324  to  1310  arc  sec)  as  Fln  is  changed  from  0.7  to  6.0 
X  1014  atoms /cm2  s.  This  means  that  In  composition  is  en¬ 
hanced  with  Fln  as  observed  in  the  PL  results  shown  in  Fig. 
3.  From  the  peak  separations,  the  corresponding  v  varying 
from  0.01  to  0.05  was  obtained.  Relaxation  is  also  increased 
from  19.2%  to  39.7%.20  According  to  a  theoretical  calcula¬ 
tion  based  on  the  Matthews  and  Blakeslee  model,  the  critical 
thickness  of  In^Ga^As  for  v=0.01  is  less  than  300  nm.18 
Thus,  partial  relaxation  of  the  In^Ga^As  layers  of  Fig.  5 
was  expected.  The  x-Fln  relation  obtained  from  DXRD  was 
plotted  in  Fig.  4  together  with  the  results  of  PL.  As  seen  in 
Fig.  4,  the  DXRD  results  are  consistent  with  the  PL  results  in 
measurement  of  v  and  the  incorporation  rate  of  In  atoms.  The 
measured  In  composition  and  relaxation  of  each  sample  were 
summarized  in  Table  I. 

The  surface  of  the  In^Ga^As  layers  shown  in  Fig.  5  is 
severely  degraded  except  for  that  of  v= 0.01.  The  surfaces  of 
both  samples  shown  in  Figs.  5(b)  and  5(c)  were  hazy  in 
visual  inspection  and  revealed  island-type  morphology  in 
SEM.  In  Fig.  5,  higher  v  results  in  a  more  severe  islanding 
surface  by  three-dimensional  growth  mode.  This  island 
growth  mode  is  partly  affected  by  low  growth  rate  employed 
in  this  work.  Compared  with  other  heterostructures  (e.g., 
Si^Ge^/Si),  the  surface  degradation  by  islanding  growth  at 


FIG.  5.  45°  tilted  SEM  images  of  In^Ga^As  grown  on  a  Si02-patterned 
GaAs  substrate  at  595  °C  with  FIn=0.7  (x=0.01)  (a),  4.2  (x=0.04)  (b),  and 
6.0  X  1014  atoms/cm2  s  (x=0.05)  (c).  A  dashed  line  indicates  the  boundary 
between  In^Ga^As  and  Si02. 

the  same  range  of  misfit  —0.0035  for  v=0.05  is  more  severe 
in  In^Ga^As/GaAs.21  This  could  be  due  to  high  growth 
temperature  above  Tdes. 22,23 

As  seen  in  Fig.  5,  negligible  deposition  occurs  on  a  Si02 
surface  for  v=0.04  while  noticeable  nucleation  was  observed 
on  a  Si02  mask  for  v=0.05.  Figure  7  presents  a  magnified 
SEM  image  of  the  sample  shown  in  Fig.  5(c).  As  seen  in  Fig. 
7,  In^Ga^As  islands  were  grown  on  the  Si02  surface 
but  their  physical  size  is  less  than  1  jmm  which  is  much 
smaller  than  those  on  the  GaAs  surface.  Therefore,  selective 
growth  of  In^Ga^As  for  ^Ga-2.0  X  1013  atoms/cm2  s  at 
595  °C  is  achieved  by  reducing  Fm  from  6.0  to  4.2 
X  1014  atoms /cm2  s.  This  also  agrees  with  Eq.  (4)  in  that 
selective  growth  occurs  by  reduction  of  F  for  a  given  T.  With 
corresponding  FIn?inc,  Fc  at  595  °C  is  increased  to  2.08 
X  1013  atoms /cm2  s  which  is  the  sum  of  FGa  and  FIn?inc.  But 
it  should  be  noted  that  more  than  4%  increment  of  F  results 
in  incomplete  selective  growth.  This  means  that  the  margins 
in  F  and  T  from  Fc  and  Ts  for  selective  growth  are  not  very 
much  (—4%  in  F  and  —4-5  °C  in  growth  temperature). 
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FIG.  6.  (004)  (a)  and  (115)  glancing  incidence  (b)  x-ray  rocking  curves  of 
the  samples  shown  in  Fig.  5. 


FIG.  8.  A  top  (a)  and  a  side  view  (b)  of  the  selectively  grown 
In^Ga^As  (x=0.07)  on  a  Si02-patterned  GaAs  substrate  at  595  °C  with 
FGa=  1.4  X  1013  atoms/cm2  s  and  Fln= 6.5  X  1014  atoms /cm2  s.  (c)  A  side- 
view  SEM  image  of  a  single  In^Ga^As  epi-island  indicated  by  a  dashed 
box  in  (b). 


To  increase  x  while  keeping  a  selective  growth  mode, 
one  possible  way  is  to  reduce  ^Ga  and  increase  Fm  with 
keeping  their  effective  sum  below  2.08  X  10 13  atoms /cm2  s. 
For  higher  x,  FGa  is  reduced  to  1.4  X  1013  atoms/cm2  s  with 
Fm=6.5  X  1014  atoms/cm2  s  of  which  the  effective  sum  is 
about  1.55  X  1013  atoms/cm2  s.  Figure  8  shows  a  top-  and  a 
side-view  SEM  image  of  the  sample  grown  under  these  con¬ 
ditions.  Figure  8(a)  was  taken  near  the  boundary  between  the 
In^Ga^As  layer  and  the  Si02  mask.  As  seen  in  this  figure, 
no  deposition  occurs  on  the  Si02  surface  and  selective 
growth  is  achieved  at  the  given  fluxes,  as  expected.  Figures 
9(a)  and  9(b)  show  a  (004)and  (115)  glancing  incidence  re¬ 
flection  x-ray  rocking  curve  of  the  300  nm  thick  In^Ga^As 


In^Ga^^As  Si02 

- — — - 


FIG.  7.  A  magnified  45°  tilted  SEM  image  of  the  sample  shown  in  Fig.  5(c). 


on  GaAs  and  a  77  K  PL  spectrum  of  a  5  nm  thick 
In^Ga^As/GaAs  QW,  respectively.  Both  In^Ga^As  layers 
in  these  samples  were  grown  under  the  same  growth  condi¬ 
tions.  The  angular  peak  separation  between  GaAs  and 
In^Ga^As  in  a  (004)  and  a  (115)  glancing  incidence  reflec¬ 
tion  rocking  curve  are  939  and  1613  arc  sec,  respectively.  In 
the  PL  spectrum,  like  those  in  Lig.  3(a),  three  PL  peaks  were 
observed.  The  PL  peak  energy  from  the  In^Ga^As/GaAs 
QW  corresponding  to  peak  1  of  Lig.  3  is  1.445  eV.  Thus 
selective  growth  is  achieved  with  v  increased  to  0.069  in 
DXRD  (with  relaxation  of  56%)  and  0.078  in  PL.  Such  an 
increment  of  v  is  consistent  with  the  results  shown  in  Lig.  4 
with  similar  a.  Enhancement  of  v  by  reduction  of  how¬ 
ever,  may  have  a  restriction  since,  as  previously  mentioned, 
Ga  atoms  have  a  certain  role  in  incorporation  of  In  atoms 
onto  GaAs. 

In  Lig.  8,  the  three-dimensional  shape  and  size  of  an 
epitaxially  grown  In^Ga^As  island  (epi-island)  is  clearly 
revealed.  While  the  lateral  dimension  is  about  3-7  /zm,  its 
height  in  Lig.  8(c)  is  about  2.5  /zm  which  is  considerably 
greater  than  the  deposition  amount  per  unit  area.  Also,  inter¬ 
island  distance  (center- to-center)  is  about  10  /zm.  The  over¬ 
all  shape  of  an  In^Ga^As  epi-island  seems  to  be  a  spherical 
section  without  faceting.  The  observation  of  (004)  and  (115) 
reflection  peaks,  however,  shows  that  this  rounded  shape 
In^Ga^As  epi-island  is  epitaxially  grown  on  a  GaAs  sur¬ 
face.  Lormation  of  islands  suggests  the  redistribution  of  Ga 
and  In  atoms  on  the  front  growth  surface  by  migration.  Ac¬ 
cording  to  the  measurement  of  surface  diffusion  length  of  Ga 
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FIG.  9.  (a)  (004)  (a)  and  (115)  glancing  incidence  x-ray  rocking  curves  of 
the  sample  shown  in  Fig.  8.  (b)  77  K  PL  spectrum  of  a  5  nm  thick 
In^Ga^As/GaAs  QW  of  which  the  In^Ga^As  layer  was  grown  at  the 
same  growth  condition  as  that  shown  in  Fig.  8. 


atoms  on  a  GaAs  surface,  Ga  atoms  can  migrate  a  distance 
greater  than  —3  jum  at  595  °C  for  FGa  — 2 
X  1014  atoms /cm2  s  which  is  almost  ten  times  the  flux  used 
in  this  work.24  Reduction  of  ^Ga  enhances  the  surface  diffu¬ 
sion  length  and  may  enable  Ga  atoms  to  migrate  the  interis- 
land  distance.  As  previously  mentioned,  the  islanding  mode 
shown  in  Figs.  5  and  8  is  unusual  at  the  given  range  of  misfit 
but  such  a  mode  is  feasible  as  a  partial  release  of  strain 
energy  due  to  lattice  mismatch  like  self-assembly  of  quantum 
dots.26 

Stress  release  in  lattice  mismatched  heterostructures  is 
one  of  the  important  issues  in  selective  growth.  As  seen  in 
Fig.  8,  the  surface  morphology  of  the  selectively  grown 
In0  07Ga0  93As  layer  which  was  severely  degraded  is  critical 
for  device  applications.  The  size  of  the  In^Ga^As  epi-island 
is  in  micrometer  scale.  If  a  nanoscale  patterning  of  which  the 
period  is  much  smaller  than  the  lateral  dimension  of  the  epi- 
island  is  applied  for  the  growth  of  such  an  In^Ga^As  island, 
it  significantly  improves  surface  morphology  and  crystallin¬ 
ity  by  relieving  strain  without  relying  on  the  generation  of 
misfit  dislocations.27,28  Detailed  experimental  results  of 
strain-relieved  dislocation-free  In^Ga^As  selectively  grown 
on  a  nanoscale  Si02-pattemed  GaAs  will  be  reported  else¬ 
where. 

In  this  work,  selective  growth  was  demonstrated  up  to  a 
300  nm  thick  layer  with  v  —  0.07.  Thus,  it  is  possible  to  grow 
a  very  thin  layer  such  as  a  QW  layer  with  selective  growth 


mode.  Assisted  by  surface  migration,3  selective  growth  can 
be  achieved  at  lower  temperature  or  higher  In  composition 
with  better  crystallinity  and  surface  morphology. 

V.  SUMMARY  AND  CONCLUSIONS 

Heteroepitaxial  selective  growth  of  In^Ga^As  on 
Si02-patterned  GaAs  by  molecular  beam  epitaxy  (MBE)  has 
been  reported.  A  simple  model  for  selective  growth  has  been 
derived  from  the  general  rate  equation  for  thin  film  growth. 
Based  on  this  model,  selective  growth  of  In^Ga^As  has 
been  achieved  on  a  Si02-patterned  GaAs(001)  substrate  with 
variation  of  growth  temperature  and  Ga  and  In  fluxes. 
Growth  temperature  is  set  to  595  °  C  which  is  in  the  range  of 
typical  growth  temperature  of  GaAs  in  MBE  but  is  above  the 
desorption  temperature  of  In  atoms  from  GaAs  surface.  At 
this  temperature,  the  incorporation  rate  of  In  atoms  is  about 
0.22%.  The  incorporation  rate  which  is  extremely  low  but 
finite  is  utilized  for  the  growth  of  5  nm  thick 
In^Ga^As/GaAs  quantum  wells  and  —300  nm  thick 
In^Ga^As  layers.  The  amount  of  In  atoms  effectively  incor¬ 
porated  into  an  In^Ga^As  layer  is  linearly  proportional  to 
the  supplied  In  flux  in  the  range  of  0.7 -9.2 
X  1014  atoms /cm2  s.  The  QWs  show  a  systematic  variation 
of  PL  peak  energy  and  intensity  depending  on  In  flux.  The 
high-temperature  grown  In^Ga^As  has  a  severely  degraded 
surface  morphology  due  to  islanding  growth  mode  but 
clearly  shows  selective  growth  on  a  Si02-patterned  GaAs 
substrate.  The  experimental  results  are  consistent  with  the 
model  for  the  relation  between  growth  temperature  and  in¬ 
coming  flux.  In  future  work,  we  will  report  selective  growth 
of  high  In  composition  In^Ga^As  on  GaAs  in  the  nanoscale 
regime  to  investigate  the  effect  of  a  finite  epitaxial  area  on 
the  stress  distribution  in  large  lattice-mismatched  hetero¬ 
structures. 
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Lattice  mismatch  is  an  inherent  problem  in  heteroepi- 
taxial  growth.  As  a  solution  for  this  classic  conundrum,  the¬ 
oretical  models  have  proposed  the  possibility  of  strain  relief 
by  decreasing  the  initiating  epitaxial  areas  to  the  nanoscale 
regime.1-4  Among  these,  the  Luryi-Suhir  model1  derives  an 
effective  critical  thickness  depending  on  the  characteristic 
linear  dimension  of  individual  epitaxial  areas.  According  to 
this  continuum  mechanics  model,  the  effective  critical  thick¬ 
ness  becomes  infinite  as  the  linear  dimension  of  individual 
nanoscale  areas  is  reduced  to  the  point  where  the  stress  is 
relieved  before  sufficient  energy  is  accumulated  to  create  a 
defect  such  as  a  misfit  dislocation.  Thus,  there  is  no  thickness 
limit  to  the  growth  of  dislocation-free  heterostructures  for 
sufficiently  small  nanoscale  epitaxial  areas.  Recently,  we  in¬ 
troduced  nanoscale-patterned  growth  (NPG)  as  a  frontier  of 
epitaxial  growth  for  semiconductor  nanostructures.5,6 
In  this  work,  we  apply  NPG  to  the  growth  of 
InJCGa1_JCAs/GaAs(001)  and  examine  the  strain  relief  for  a 
lattice-mismatched  hetero structure  selectively  grown  and  lat¬ 
erally  coalesced  on  a  nanoscale  Si02-patterned  substrate  by 
molecular  beam  epitaxy  (MBE). 

Figure  1  shows  45°-tilted  scanning  electron  microscope 
(SEM)  images  of  the  sample  at  various  stages  of  the  process. 
The  inset  to  each  image  is  a  schematic  illustration  of  the 
corresponding  process  step  viewed  in  cross  section.  First,  as 
shown  in  Fig.  1(a),  a  two-dimensional  (2D),  periodic,  nanos¬ 
cale  circular-hole  pattern  was  fabricated  in  a  45-nm-thick 
Si02  film  that  had  been  deposited  on  a  semi-insulating 
GaAs(001)  substrate.  The  patterning  was  performed  by 
large-area  interferometric  lithography  and  dry  etching.5,7  The 
pattern  period  is  355  nm  and  the  hole  diameter  is  about 
300  nm. 

On  this  Si02-patterned  substrate,  a  50-nm-thick  GaAs 
buffer  layer  was  selectively  deposited  at  595  °C  by  MBE. 
The  resulting  surface  is  shown  in  Fig.  1(b).  For  this  50  nm 
deposition,  the  GaAs  epilayer  selectively  grown  within  each 
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hole  (GaAs  epi-island)  does  not  show  clear  faceting  but  has  a 
roughly  triangular  cross  section  as  seen  below. 

On  the  surface  shown  in  Fig.  1(b),  a  200-nm-thick 
In^Ga^As  layer  was  then  selectively  deposited  at  the  same 
growth  temperature  by  MBE.  The  deposition  conditions  for 
selective  growth  of  In^Ga^As/  GaAs (001)  have  been  re¬ 
ported  elsewhere.8  At  this  high  temperature,  the  incorpora¬ 
tion  rate  of  In  is  very  low  (—0.2%);  measurements  reported 
below  established  the  In  composition  of  the  film  at  v  — 0.06. 
While  the  starting  GaAs  buffer  surface  of  Fig.  1(b)  consisted 
of  a  periodic  array  of  discontinuous  nonplanar  growth  seeds, 
the  surface  of  the  as-grown  In^Ga^As  is  flat  and  smooth 
with  only  a  few  incompletely  planarized  areas  that  corre¬ 
spond  to  imperfections/contamination  in  the  original  pattern 
[Fig.  2(c)].  The  surface  morphology  of  this  InxGaj_xAs  layer 
is  significantly  improved  as  compared  with  comparable 
thickness  In^Ga^As  layers  grown  on  unpatterned  wide-area 
GaAs(001)  substrates  under  the  same  growth  conditions, 
which  are  severely  degraded  by  islanding  growth.8 

Figure  2  shows  the  details  of  the  In^Ga^As  layer.  Fig¬ 
ure  2(a)  is  a  cross  sectional  transmission  electron  micrograph 
(TEM)  of  the  as-grown  sample.  The  inset  of  Fig.  2(a)  is  a 
material  map  obtained  from  this  TEM  image.  As  seen  again 
in  this  figure,  the  In^Ga^As  layer  has  a  flat  surface  that 
completely  planarizes  the  GaAs  epi-island  buffer  and  the 
Si02  mask.  Figure  2(b)  is  a  cross  sectional  scanning  TEM 
image  of  a  single  period  shown  in  Fig.  2(a).  A  slight  contrast 
change  indicates  the  boundary  between  the  GaAs  buffer  and 
the  In^Ga^As  layer  and  demonstrates  that  the 
In^Ga^As/GaAs  heterointerface  is  confined  to  the  original 
circular  holes  in  the  Si02  mask.  Figure  3  is  a  high  resolution 
cross  sectional  TEM  image  corresponding  to  the  white  box 
of  Fig.  2(a).  As  seen  in  Fig.  3,  the  In^Ga^As  layer  initiated 
from  the  GaAs  buffer  surface  grows  laterally  over  the  Si02 
mask.  The  plan- view  TEM  image  of  Fig.  2(c)  shows  an  out¬ 
line  of  the  circular  GaAs  epi-island  buffer  bounded  by  the 
Si02  mask  which  is  buried  under  the  In^Ga^As  layer  after 
coalescence  and  planarization.  Figures  1(c)  and  2  demon¬ 
strate  the  achievement  of  epitaxial  lateral  growth  of  an 
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FIG.  1.  45°-tilted  SEM  images  of  (a)  a  355  nm  period  2D  array  of  circular 
holes  fabricated  in  a  45-nm- thick  Si02  him,  (b)  the  surface  of  the  sample 
shown  in  (a)  after  selective  deposition  of  a  50  nm  GaAs  layer,  and  (c) 
planarized  surface  of  an  as-grown  sample  after  200  nm  deposition  of 
In^Ga^As  on  the  surface  shown  in  (b).  The  inset  of  each  figure  is  a  sche¬ 
matic  illustration  of  the  corresponding  process  step. 


In^Ga^As/GaAs  heterostructure  over  a  Si02  mask  by 
MBE. 

The  thickness  of  the  In^Ga^As  layer  (—200  nm)  is  sig¬ 
nificantly  greater  than  the  critical  thickness  of  an  In^Ga^As 
layer  of  the  same  In  composition  grown  on  unpatterned 
large-area  GaAs(OOl).9  As  seen  in  Fig.  2(c),  however,  no 
dislocations  were  observed  in  the  plan-view  TEM  over  an 
area  of  about  20  X  10  /mi2.  Also,  no  other  crystal  defects  are 
observed  in  the  In^Ga^As  layer  of  Fig.  2(a).  Additional 
experiments  providing  a  more  accurate  measure  of  the  dislo¬ 
cation  density  are  presently  under  way;  nonetheless,  these 
results  imply  that  the  200-nm- thick  In^Ga^As  layer  grown 
on  the  nanoscale  Si02-patterned  GaAs(001)  substrate  has  a 
dislocation  density  too  low  (<106  cm-2)  to  estimate  in  TEM 
and  thus  is  substantially  free  from  misfit  dislocations. 

Double-crystal  x-ray  diffractometry  (DXRD)  provides 
critical  information  on  strain  relaxation  in  this  heterostruc¬ 
ture.  From  the  (004)  symmetric  and  (115)  asymmetric  reflec¬ 
tion  rocking  curves  shown  in  Fig.  4(a),  the  In  composition,  x, 
and  the  relaxation,  R ,  can  be  calculated.  The  separations  be¬ 
tween  the  In^Ga^As  peak  and  the  GaAs  peak  are  614.3  and 

979.8  arc  sec,  respectively.  From  peak  separation  analysis, 
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FIG.  2.  (a)  A  cross  sectional  TEM  image  of  the  In^Ga^As  layer  taken 
along  the  direction  parallel  to  the  dashed  line  in  Fig.  1(c).  The  inset  shows 
the  material  map  obtained  from  this  image,  (b)  A  cross  sectional  scanning 
TEM  image  of  a  single  period,  (c)  A  plan- view  TEM  image  of  the  as-grown 
sample. 

an  In  composition,  x,  of  0.06  (Misfit  /—  0.0042)  is  obtained. 
The  relaxation,  R ,  which  is  defined  as  a  ratio  of  the  differ¬ 
ence  between  the  in-plane  lattice  constant  of  the  In^Ga^As 
layer,  <2||jnGaAs>  and  the  lattice  constant  of  the  GaAs  substrate, 
^GaAs’  t0  the  difference  between  the  lattice  constant  of  an 
unstrained  In^Ga^As  layer,  <2inGaAs>  and  aGaAs  is  0-90.8  Thus 
«n,inGaAs  is  very  close  to  tfInGaAs  and  the  In^Ga^As  layer 
selectively  grown  on  the  nanoscale  Si02-patterned 
GaAs(001)  is  substantially  strain  relaxed. 

In  TEM,  as  mentioned  above,  the  dislocation  density  of 
the  In^Ga^As  layer  is  significantly  lowered  as  compared 
with  that  of  a  relaxed  heterostructure  which  has  a  similar 
thickness  and  misfit.10  From  DXRD,  however,  the 
In0  06Ga0  94As  layer  is  almost  fully  relaxed.  From  these  ex¬ 
perimental  results,  it  can  be  concluded  that  the  In0  06Ga0  94As 
layer  grown  on  the  nanoscale  Si02-patterned  GaAs(001) 

substrate  is  dislocation  free  but,  is  also  strain  relieved. 

to  AIP  license  or  copyright,  see  http://apl.aip.org/apl/copyright.jsp 
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FIG.  3.  A  high  resolution  TEM  image  of  the  region  indicated  by  a  white  box 
in  Fig.  2(a). 

By  growing  the  GaAs  buffer,  the  structure  of  our  as- 
grown  sample  becomes  analogous  to  that  of  Luryi-Suhir 
model.  The  model  thickness  beyond  which  the  stress  expo¬ 
nentially  decays  for  the  given  hole  diameter  (—300  nm)  and 
In0  06Ga0  94As  is  —50  nm.  This  is  considerably  smaller  than 
the  In0  06Ga0  94As  thickness  of  our  film.  Although  this  com¬ 
parison  provides  qualitative  justification  that  nanoscale  pat¬ 
terning  contributes  to  the  observed  strain  relief,  our  sample 
structure  has  several  differences  from  that  suggested  in  the 
model.  As  seen  in  Figs.  1(b)  and  2(b),  the  convex  shape  of 
the  GaAs  epi-island  buffer  results  in  an  expansion  of  the 
effective  area  of  the  GaAs  buffer/In0  06Ga0  94As  heterointer¬ 
face  within  each  circular  hole.  Also,  as  seen  in  the  inset  of 
Fig.  2(a),  this  further  leads  to  the  modulation  of  the 
In0  06Ga0  94As  thickness  across  each  pattern  unit  cell.  These 
effects  could  partly  contribute  to  strain  relief  of  the 
In0  06Ga0  94As  layer  observed  in  DXRD. 

There  have  been  several  studies  of  strain  relaxation  of 
limited-area,  lattice-mismatched  hetero  structure 

growth.2-4,11,12  While  the  Luryi-Suhir  model  focuses  on  the 
variation  of  the  biaxial  strain  as  the  growth  thickness  in¬ 
creases  on  a  periodically  patterned  substrate,  other  models 
concentrate  on  the  variation  of  the  stress  across  the  growth 
area  by  considering  edge  relaxation.2,4,12  According  to  all  of 
these  models,  the  isolated  In^Ga^As  epilayer  grown  on 
each  GaAs  epi-island  buffer  (In^Ga^As  nano-island)  of  Fig. 
1(b)  undergoes  stress  relaxation  leading  to  an  exponential 
decay  along  the  entire  surface.  If  this  stress  relaxation  is 
completed  before  coalescence  of  the  individual  nano-islands, 
the  resulting  film  could  be  also  strain  relieved  after  coales¬ 
cence.  The  stress  decay  length,  which  is  a  characteristic 
length  effective  only  in  nanoscale  patterned  growth,  un¬ 
doubtedly  depends  on  the  geometrical  shape  and  orientation 
of  each  nano-island  as  well  as  its  elastic  properties  and  the 
lattice  mismatch.  The  stress  decay  length  of  —50  nm  for  the 
In006Ga094As  on  our  nanopatterned  GaAs(001)  estimated 
from  the  Luryi-Suhir  model  is  comparable  to  the  growth 
amount  of  the  In0  06Ga0  94As  required  for  coalescence  which 
is  50-100  nm  deposition  under  the  present  growth  condi¬ 
tions.  This  implies  that  the  strain  relief  before  coalescence  is 
likely  a  major  relaxation  process  for  this  growth. 

Our  experimental  results  include  coalescence  effects  that 
have  not  yet  been  theoretically  evaluated  in  detail.  Coales¬ 
cence  between  fully  or  partially  relaxed  nano-islands  and  as¬ 
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FIG.  4.  DXRD  rocking  curves  from  the  as-grown  sample  in  (004)  symmet¬ 
ric  and  (115)  asymmetric  reflections. 

sociated  quenching/healing  of  crystal  imperfections  as  two 
(or  more)  nano-islands  conjoin  is  a  complex  process  deserv¬ 
ing  of  significant  further  investigation.  Nonetheless,  it  is  ex¬ 
perimentally  evident  that  nanoscale  patterning  plays  an  im¬ 
portant  role  in  the  strain  relief  of  a  lattice-mismatched 
hetero  structure  resulting  in  a  very  significant  reduction  in  the 
density  of  misfit  dislocations. 

We  have  demonstrated  MBE  growth  of  strain-relieved 
dislocation-free  In006Ga094As  on  nanoscale  Si02-patterned 
GaAs(001)  with  consistent  material  characterizations.  The 
experimental  results  are  qualitatively  consistent  with  the 
Luryi-Suhir  model.  Once  coalescence  and  planarization  are 
achieved  over  a  Si02  mask  in  In^Ga^As  deposition,  the 
growth  temperature  and  growth  rate  can  be  adjusted  to  the 
optimal  conditions  for  the  growth  of  a  device  structure  to 
suppress  contamination  issues  and  to  improve  the  quality  of 
the  epilayers.  These  conditions  are  typically  very  different 
from  those  for  selective  growth.  Growth  of  InAs  quantum 
dots  on  such  a  strain-relieved  dislocation-free  In^Ga^As 
layer  is  a  clear  future  research  direction,  aimed  at  long  wave¬ 
length  emission  on  GaAs.  Additionally,  residual  strain  and  its 
dependence  on  the  In^Ga^As  thickness  and  on  pattern  pa¬ 
rameters  such  as  period,  hole  size,  pattern  symmetry,  etc.,  as 
well  as  on  thermal  expansion  mismatch,  requires  further  in¬ 
vestigation. 

Support  for  this  work  was  provided  by  the  DOE  and  by 
DARPA. 
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A  fabrication  technique  for  nanoscale  two-dimensional  (2D)  patterning  by  large-area  interferometric 
lithography  (IL)  and  anisotropic  wet  etching  (AWE)  is  reported.  On  a  Si(001)  substrate  corrugated 
by  one-dimensional  (ID)  periodic  V-grooves  in  a  first  IL/AWE  step,  an  additional  ID  IL/AWE 
patterning  perpendicular  to  the  groove  direction  is  performed.  In  this  process,  {111}  facets — the 
sidewalls  of  a  V-groove  having  an  extremely  low  etch  rate  in  AWE — are  utilized  as  an  etch  barrier 
to  confine  AWE  to  a  nanoscale- wide  (001)  stripe  facet  between  V-grooves.  IL  on  the  corrugated  Si 
surface  results  in  an  array  of  photoresist  (PR)  lines  localized  on  each  (001)  stripe.  The  subsequent 
AWE  with  a  Cr  etch  mask  fabricated  from  this  unusual  PR  pattern  thus  begins  from  the  inner  side 
of  (001)  stripe  but  slows  down  as  it  reaches  the  (001)-{1 1 1}  facet  intersection  of  the  V-groove.  The 
{111}  planes  at  each  facet  intersection  play  the  important  role  of  an  etch  barrier  laterally  confining 
the  etch  area  within  each  (001)  stripe.  The  direction  along  the  stripe  is  also  confined  with  the 
formation  of  2D  {111}  inverted  pyramids.  A  355  nm  period,  2D  array  of  Si  mesas  having  a 
rectangular  top  (001)  facet  of  area  —150  nmXllO  nm  and  surrounded  by  various  (111)  facets  in  a 
unit  period  is  fabricated  on  a  Si(001)  substrate.  ©  2004  American  Vacuum  Society. 
[DOI:  10.1116/1.1771663] 


I.  INTRODUCTION 

Recent  progress  in  nanoscale  semiconductor  technology 
shows  potential  for  realizing  the  novel  characteristics  and 
high  performance  required  for  the  next-generation  of 
electronic/optoelectronic  devices.  By  scaling  feature  sizes  to 
the  nm-range  in  patterned  growth  of  semiconductor  thin 
films,  we  have  demonstrated  various  growth  phenomena 
which  are  not  available  with  jam-  or  larger-scale  pat¬ 
terning.1-3  One  such  phenomenon  is  orientation-dependent 
migration  and  incorporation  which  utilizes  surface  migration 
of  atoms  incident  on  a  nanoscale  faceted  (nanofaceted)  sur¬ 
face  and  crystal  orientation-dependent  adatom  incorporation. 
Reducing  the  lateral  dimension  of  a  facet  to  less  than  the 
surface  migration  length  of  an  adatom  allows  new  regimes  of 
crystal  growth  that  will  lead  to  new  device  capabilities. 

In  our  previous  work,  we  reported  a  deposition  technique 
for  the  fabrication  of  nanofaceted  surfaces  (nanofaceting).1,2 
Nanofaceting  can  be  used  for  the  fabrication  of  one-  to  three- 
dimensional  (ID  to  3D)  periodic  structures  or  particular 
shaped  features  required  for  nano-  and  pt m-scale  Si  electron¬ 
ics  and  machining.4,5  Anisotropic  wet  etching  (AWE)  is  a 
reliable  and  cost-effective  alternative  for  nanofaceting.4-8  In 
this  work,  we  present  a  fabrication  technique  using  a  {111} 
facet  which  has  a  significantly  lower  etch  rate  than  the  (001) 
orientation  in  AWE,  as  an  etch  barrier.  The  first  step  is  for¬ 
mation  of  a  nanoscale  ID  pattern  by  large  area  interferomet¬ 
ric  lithography  (IL)  and  etching  to  form  a  traditional 
V-groove  pattern  revealing  alternate  (100)  flats  and  {111} 
V-grooves.  After  a  second  orthogonal  IL  step,  AWE  is  con¬ 
fined  to  a  nanoscale- wide  (001)  stripe  facet  that  is  already 
bounded  by  neighboring  {111}  facets  in  one  direction.  This 
effectively  results  in  the  confinement  of  AWE  to  the  forma¬ 


tion  of  inverted  {111}  pyramids  inside  the  etch  area  and  re¬ 
sults  in  a  2D  pattern  restricted  to  the  (001)  stripe.  This  is  the 
basic  motivation  of  this  work.  With  this  technique,  we  have 
achieved  a  355  nm  period  2D  array  of  Si  mesas  with  a  rect¬ 
angular  top  (001)  facet  having  an  area  —150  nmXllO  nm 
surrounded  by  various  (111)  type  facets  in  a  unit  period.  To 
our  knowledge,  this  is  the  smallest  period  2D  pattern  ever 
made  by  AWE. 

In  Sec.  II,  ID  PR  patterning  on  a  corrugated  Si(001)  sur¬ 
face  by  IL  is  described.  This  is  a  critical  step  for  the  confine¬ 
ment  of  AWE  to  each  (001)  stripe.  IL  on  a  nanoscale  non- 
planar  surface  is  very  different  from  that  on  a  wide-area 
planar  surface  and  is  utilized  for  the  localized  photoresist 
(PR)  patterning  on  a  (001)  stripe.  The  intensity  distribution 
of  UV  light  on  a  corrugated  Si  surface  leading  to  this  unusual 
PR  pattern  is  discussed.  In  Sec.  Ill,  the  fabrication  of  a  Cr 
etch  mask  from  the  PR  pattern  is  described.  In  Sec.  IV,  the 
results  of  AWE  performed  on  the  periodic  V-grooved  Si(001) 
with  the  Cr  etch  mask  is  presented.  Finally,  we  summarize  ah 
of  the  results  in  Sec.  V. 

II.  PR  PATTERNING  ON  PERIODIC  V-GROOVED 
Si(001) 

Figure  1  shows  a  45°-tilted  scanning  electron  microscopy 
(SEM)  image  of  a  355  nm  period  ID  PR  pattern  generated 
atop  a  periodically  V-grooved  Si  surface  by  large-area  IL 
which  is  a  low-cost  technology  for  sub-  100  nm-scales.9  A 
355  nm  wavelength,  frequency-tripled  YAG  laser  was  used 
as  an  ultraviolet  (UV)  light  source.  A  boron-doped  Si(001) 
substrate  having  a  resistivity  of  about  8  to  15  12  cm  was 
used.  Aqueous  KOH,  which  exhibits  strongly  orientation  de¬ 
pendent  etching  on  a  Si  surface,  was  employed.6  Each 
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Fig.  1.  (a)  A  45°-tilted  SEM  image  of  a  ID  stripe  PR  pattern  generated  on  a 
periodic  V-grooved  Si  surface,  (b)  Magnification  of  (a).  The  discontinuity  of 
the  PR  stripes  near  the  (001)-{1 1 1}  facet  intersections  is  evident. 


V-groove  consists  of  (111)  and  (111)  sidewalls  oriented 
along  a  [110]  direction.  These  grooves  were  fabricated  by  IL 
and  AWE  using  a  Cr  etch  mask.  The  width  of  the  (001) 
stripes  between  V-grooves  is  about  130-170  nm.  The  direc¬ 
tion  of  the  PR  lines  was  chosen  to  be  perpendicular  to  that  of 
the  prepattemed  (001)  stripes.  The  PR  was  Shipley  5206E 
diluted  to  provide  a  thickness  of  about  200  nm. 

The  PR  pattern  of  Fig.  1(a)  is  very  different  from  that  on 
a  planar  surface.  There  are  voids  in  the  PR  lines  at  every 
(001)-{1 1 1}  intersection.  Here,  a  (001)-{111}  intersection 
means  the  line  along  which  each  (001)  stripe  facet  meets  the 
neighboring  {111}  facets;  each  period  has  two  such  intersec¬ 
tions.  Figure  1(b)  clearly  reveals  the  discontinuity  of  the  PR 
stripe  near  the  (001)-{1 1 1}  intersection.  Thus,  the  PR  film 
separately  remains  on  top  of  each  (001)  facet  and  inside  each 
V-groove. 

The  cross  section  SEM  image  of  Fig.  2(a)  shows  the  pho¬ 
toresist  film  before  the  development  step;  both  the  top  (001) 
faces  and  the  V-grooves  are  filled  with  PR.  The  PR  thickness 
near  the  (001)-{1 1 1}  intersections  is  comparable  to  that  on 
the  (001)  stripes.  As  seen  in  Fig.  2(b),  development  mainly 
removes  the  PR  at  the  intersections.  Thus,  the  unusual  dis¬ 
continuity  in  the  PR  lines  of  Fig.  1  is  not  due  to  the  thickness 
variation  across  the  V-groove. 

Generally,  IF  has  been  performed  on  a  planar  surface  and 


(a) _ pp 


Si  substrate 


Fig.  2.  Side  view  SEM  images  of  (a)  a  PR  film  (a)  before  and  (b)  after 
development  (Fig.  1). 


the  intensity  distribution  of  UV  light  within  a  PR  film  is 
readily  calculated.10  The  major  factors  that  determine  the  UV 
intensity  are  the  incident  power  and  polarization,  the  incident 
angle,  and  the  reflection  from  the  PR- substrate  interface.  If 
IF  is  applied  to  a  nonplanar  surface,  many  of  these  factors 
are  affected  by  the  surface  topography.  The  PR  persistently 
remaining  within  each  V-groove  in  Fig.  3(c)  can  be  the  result 
of  a  lower  UV  intensity  in  this  region  than  on  a  (001)  stripe. 
If  UV  incident  angle  is  set  to  30°  on  a  (001)  plane  for  a  355 
nm  period,  it  becomes  52.1°  on  a  tilted  (111)  facet  in  our  IF 
apparatus.  Thus,  the  effective  UV  dose  supplied  on  (111) 
plane  is  lower  than  that  on  a  (001)  plane  and  the  inside  of  a 
V-groove  is  relatively  under-exposed.  At  PR-Si  interface 
around  a  (001) -{111}  intersection,  the  UV  radiation  is  scat¬ 
tered  rather  than  reflected.  This  means  that  UV  reflection 
near  this  area  is  very  different  from  that  of  the  remaining 
region  in  a  single  period.  Therefore,  the  disconnected  PR 
lines  of  Fig.  1  could  be  due  to  the  nonuniform  dose  arising 
from  the  different  reflectivity  of  the  UV  beam  on  each  facet 
and  facet  intersection  in  this  complex  IF  geometry.  As  seen 
in  Sec.  IV,  this  discontinuity  enables  nanoscale  2D  patterning 
localized  on  each  (001)  stripe. 

III.  PREPARATION  OF  A  Cr  ETCH  MASK 

Figure  3(a)  shows  a  top  view  SEM  image  of  the  PR  pat¬ 
tern  similar  to  Fig.  1(a).  In  Fig.  3(a),  the  PR  on  each  (001) 
stripe  is  close  to  a  linear  array  of  ellipses.  These  PR  features 
generate  openings  localized  on  a  (001)  stripe  in  a  metal 
deposition  and  lift-off  process.  On  the  PR  pattern  of  Fig. 
3(a),  a  10-nm- thick  Cr  film  which  will  be  used  as  an  etch 
mask  in  the  subsequent  AWE  was  deposited  by  electron 
beam  evaporation  at  a  deposition  rate  of  0.2  nm/s. 

Figure  3(b)  shows  a  top- view  SEM  image  of  a  Cr  etch 
mask  generated  on  the  periodic  V-grooved  Si(001)  after  PR 
lift-off.  As  indicated  by  the  arrows,  two  different  openings 
through  which  AWE  can  occur  were  generated  in  the  Cr  film: 
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^  Opening  A  Opening  B 
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Fig.  3.  Top  view  SEM  images  of:  (a)  the  PR  pattern  shown  in  Fig.  1;  (b)  a 
10-nm-thick  Cr  etch  mask  after  PR  lift-off;  and  (c)  the  Si  surface  after  a 
50  s  AWE.  In  (b),  two  different  types  of  openings  in  the  Cr  etch  mask  are 
indicated:  A  [on  the  top  (100)  facet];  and  B  [atop  the  {111}  sidewalls].  In  (c), 
the  dashed  box  represents  the  unit  cell  of  the  2D  pattern. 


one  (labeled  A)  is  on  each  (001)  stripe  and  the  other  (labeled 
B)  is  at  the  valley  of  each  V-groove.  The  opening  A  has  very 
different  size  roughly  ranging  from  70  to  150  nm. 

IV.  ANISOTROPIC  WET  ETCHING  (AWE) 

For  AWE,  25  wt.  %  aqueous  KOH  solution  at  60±5°C 
was  used.  A  KOH-based  etchant  has  a  high  etch  selectivity 
for  (001)  [anisotropy  ratio  of  (001)  to  (111)^400  at  80  °C].6 
The  etch  rate  calibrated  on  a  wide-area  Si(001)  surface  for 
our  conditions  was  about  7±1  nm/s.  For  AWE,  the  sample 
shown  in  Fig.  3(b)  was  dipped  in  the  etchant  for  50  s. 


(a)  tUTo]  (c)  f[iio] 


Fig.  4.  (a)  A  top  view  and  (b)  a  side  view  SEM  image  of  the  50  s  etched 
sample,  (c)  A  45°-tilted  and  (d)  a  side  view  SEM  image  of  the  same  sample. 
Sample  orientations  are  indicated  with  various  facet  intersection  lines.  See 
the  text  for  details,  (e)  Schematic  illustration  of  a  single  Si  mesa  correspond¬ 
ing  to  the  dashed  box  in  (c)  with  the  facet  identification. 


Figure  3(c)  shows  a  top  view  image  of  the  etched  Si  sur¬ 
face  after  removing  the  Cr  film.  In  Fig.  3(b),  as  mentioned 
above,  there  are  two  classes  of  openings  in  the  Cr  mask 
where  the  etchant  can  access  the  Si  surface.  As  seen  in  Fig. 
3(c),  however,  AWE  proceeded  only  through  the  “A”  open¬ 
ings  on  the  (001)  stripes.  This  is  because  the  “B”  openings 
are  atop  the  very  low  etch-rate  (111)  V-groove  facets.  Thus, 
the  disconnected  PR  pattern  of  Fig.  1  results  in  selective 
etching  of  the  (001)  stripe  regions. 

Generation  of  facets  inside  of  the  etch  area  is  observed  in 
Fig.  3(c).  Figure  4  shows  magnified  images  that  were  taken 
from  four  different  directions.  Figures  4(a)  and  4(b)  were 
taken  along  a  [110]  direction,  whereas  Figs.  4(c)  and  4(d) 
were  obtained  along  a  [110]  direction.  Figures  4(b)  and  4(d) 
are  both  side  view,  Figs.  4(a)  and  4(c)  are  a  top  view  and  a 
45°-tilted  SEM  image  of  the  same  sample,  respectively.  Fig- 
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ures  4(b) -4(d)  clearly  reveal  the  formation  of  facets  inside 
of  the  etch  area.  All  the  facets  inside  the  etch  area  generated 
by  AWE  are  close  to  {111}.  Thus,  the  3D  shape  of  the  etch 
area  on  the  (001)  stripe  in  Fig.  4(a)  [or  Fig.  3(c)]  is  an  in¬ 
verted  pyramid.  Also^AWE  generates  several  Jacet  intersec¬ 
tions  consisting  of  (lll^  and  (111)  facets,  (11)  and  (111) 
facets,  and  (001)  and  (111)  facets  labeled  a,  (3,  and  y,  re¬ 
spectively.  Here,  a  which  originally  corresponded  to  a  (001)- 
(111)  intersection  (a0)  is  changed  to  a  (lll)-(lll)  intersec¬ 
tion  after  etching.  The  dashed  box  of  Fig.  3(c)  represents  a 
unit  period  of  the  2D  pattern.  A  Si  mesa  with  a  rectangular 
top  (001)  facet  having  a  size  of  about  150±20X110±10  nm2 
is  surrounded  by  different  {111}  facets  within  each  period. 
Figure  4(e)  shows  a  schematic  illustration  of  a  single  Si  mesa 
corresponding  to  the  dashed  box  in  Fig.  3(c)  along  with  the 
facet  identification.  In  the  nanoscale  range,  therefore,  AWE 
is  effectively  terminated  revealing  four  different  internal 
{111}  facets  for  a  50  s  etch.  This  facet  geometry  also  restricts 
AWE  longitudinally  along  a  (001)  stripe  and  leads  to  a  con¬ 
fined  2D  patterning. 

Generally,  AWE  inside  the  etch  area  should  be  terminated 
once  it  completely  reveals  the  four  different  {111}  facets.  As 
seen  in  Fig.  4(b),  however,  the  progression  of  the  interface 
position  a  which  was  originally  at  the  starting  (001)-(111) 
intersection  (a0)  to  a  new  position  further  along  the  (111) 
face  after _AWE  implies  that  the  etching  does  not  perfectly 
stop  at  a  (lll)-(lll)  intersection  but  removes  the  {111}  facet 
with  a  finite  etch  rate.  Then,  the  etch  rate  of  the  (111)  facet 
deduced  from  the  progression  of  a  is  about  2  nm/s  if  a  is 
taken  as  the  distance  of  —80  nm  from  a0  to  its  current  loca¬ 
tion  in  Fig.  4(b)  in  —50  s.  This  is  considerably  slower  than 
the  calibrated  etch  rate  of  —7  nm/s  on  a  (001)  plane  under 
the  same  conditions.  Even  with  this  finite  etch  rate,  therefore, 
a  (111)  V-groove  facet  plays  the  role  of  an  etch  barrier  in 
AWE. 

The  —2  nm/s  etch  rate  of  the  {111}  face  in  the  KOH 
solution  is  much  higher  than  the  value  predicted  from  the 
reported  large-area  anisotropy  ratio.5  In  Fig.  4,  AWE  forms 
another  unusual  structure,  the  undercutting  of  the  interfaces 
perpendicular  to  the  original  stripes  to  form  a  retrograde 
(lll)-(lll)  interface,  denoted  by  f3.  The  shape  of  the  Si 
mesa  shown  in  Figs.  4(b)  and  4(d)  suggests  that  the  forma¬ 
tion  of  f3  is  due  to  the  lower  etch  rate  of  the  (001)-(1 11) 
interface,  y,  from  which  the  current  intersection  evolves  [see 
Fig.  4(e)]  than  that  of  a.  This  means  that  the  (111)  facet  of  a 
(001) -{111}  intersection  has  an  etch  rate  much  lower  than  —2 
nm/s  in  our  process,  as  expected.  This  is  also  supported  by 
the  inactive  etching  through  the  “B”  openings.  Further  study 
is  necessary  to  understand  the  etching  mechanism  of  (111) 
facet  of  a  {11 1}-{1 11}  intersection  on  a  Si  surface. 

Figure  3(c)  suggests  that  the  “A”  openings  having  differ¬ 
ent  sizes  and  shapes  in  the  Cr  mask  were  forced  into  a 


uniform-size,  square-shape  etch  area  on  the  Si  surface  in  the 
top  view  of  Fig.  3(c).  This  improvement  is  likely  also  due  to 
AWE  on  this  complex  multiple- surface  substrate.  This  also 
means  that  the  uniformity  of  the  width  of  each  (001)  stripe 
along  and  across  the  groove  direction  which  is  determined  in 
the  first  step  ID  patterning  is  more  important  to  the  final  2D 
pattern  than  that  of  the  additional  patterning  for  the  “A” 
openings. 

V.  SUMMARY  AND  CONCLUSIONS 

A  fabrication  technique  for  2D  nanoscale  patterning  by 
large-area  IL  and  AWE  has  been  reported.  On  a  Si(001)  sub¬ 
strate  corrugated  by  ID  periodic  V-grooves,  a  second  ID 
patterning  perpendicular  to  the  groove  direction  was  per¬ 
formed  by  IL  and  AWE.  Nanoscale  2D  patterning  has  been 
demonstrated  utilizing  {111}  facets,  the  V-groove  sidewalls, 
as  etch  barriers  for  confining  AWE  to  the  top  of  the 
nanoscale- wide  (001)  stripe.  AWE  initiated  on  a  (001)  stripe 
extends  laterally  but  slows  down  after  reaching  both  (001)- 
(111)  facet  intersections.  Laterally  confined  AWE  induces  the 
same  effect  on  the  longitudinal  direction  along  the  (001) 
stripe.  Such  2D  confinement  of  AWE  leads  to  a  uniform  and 
reproducible  nanoscale  2D  pattern  on  Si(001).  With  this 
technique,  a  355  nm  period  2D  array  of  Si  mesas  with  a 
rectangular  top  (001)  facet  of  an  area  —150  nmXllO  nm 
surrounded  by  various  (111)  type  facets  in  a  unit  period  has 
been  achieved.  Nanofaceting  by  IL  and  AWE  might  be  im¬ 
portant  for  future  semiconductor  technologies  such  as  nanos¬ 
cale  patterned  growth  and  nanoscale  electromechanical  sys¬ 
tems  (NEMS). 
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Abstract 

Phase  stability  of  GaN  on  (1  1  1)-(001)  nanoscale  faceted  Si  is  investigated.  Nanoscale  faceting  (nanofaceting)  is 
accomplished  on  a  Si(00  1)  substrate  with  interferometric  lithography  and  anisotropic  wet  etching.  By  relying  on  the 
nucleation  and  associated  lateral  growth  depending  on  orientation  and  crystal  structure,  spatial  separation  of  a  cubic- 
phase  region  from  hexagonal-phase  GaN  regions  which  initiate  on  the  facing  Si(l  1  1)  sidewalls  of  a  355-nm  period  V- 
grooved  Si  surface  is  achieved.  The  appearance  of  cubic-phase  region  is  associated  with  a  rapid  surface  planarization 
with  its  principal  crystal  axes  parallel  to  those  of  the  Si(00  1)  substrate.  The  boundary  between  cubic  and  hexagonal 
phases  is  revealed  along  the  Si(l  1  1)  sidewalls  and  extends  from  each  V-groove  to  the  GaN  surface  for  a  600-nm 
deposition.  The  phase  separation  of  GaN  selectively  grown  on  a  nanoscale  faceted  Si  surface  and  the  comparison  of  its 
material  properties  (photoluminescence  and  Raman  scattering  for  stress  measurement)  with  those  of  GaN  grown  on 
wide-area  Si(l  1  1)  and  Si(001)  imply  that  nanofaceting  can  be  utilized  for  phase  control  of  GaN  on  a  Si(001) 
substrate. 

©  2004  Elsevier  B.V.  All  rights  reserved. 
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1.  Introduction 

Growth  of  GaN  on  a  Si  substrate  offers  the 
exciting  potential  of  integration  of  GaN  devices 
with  well-established  Si  technology.  In  spite  of  this 
advantage,  however,  the  large  lattice  and  thermal 
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mismatches  between  these  materials  remain  a 
critical  problem.  Several  attempts  for  growth  of 
GaN  on  a  Si(l  1  1)  substrate  such  as  wafer 
patterning  to  induce  lateral  growth  [1,2]  or  to 
confine  the  growth  area  [3-5]  and  various  kinds 
of  buffer  or  intermediate  layers  [3-6]  have 
yielded  improved  optical/electrical  properties  and 
suggest  a  possibility  of  high-quality  GaN  on  a  Si 
substrate. 
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For  integration  with  most  Si  electronics  tech¬ 
nology,  growth  on  Si(0  0  1)  is  much  more  favorable 
than  on  Si(l  1  1).  However,  deposition  of  GaN  on 
a  (0  01)  plane  results  in  poly-type  rough  surface 
morphology  which  is  not  suitable  for  most  device 
fabrication  [5-8].  Some  theoretical  calculations 
suggest  a  relatively  small  energy  difference  be¬ 
tween  hexagonal  and  cubic  phases  of  GaN  com¬ 
pared  with  other  Ill-nitrides  as  one  of  the  possible 
explanations  [9,10].  Together  with  the  problem 
already  mentioned,  the  phase  instability  of  GaN 
on  Si(0  0  1)  is  another  important  issue  which  must 
be  investigated  for  application  to  Si  technology. 

Recent  progress  in  lithography  technology  and 
its  application  to  molecular  beam  epitaxy  and 
metalorganic  vapor-phase  epitaxy  (MOYPE)  has 
made  accessible  a  new  regime  of  growth — nanos¬ 
cale  patterned  growth  and  nano-heteroepitaxy 
[11,12].  In  this  work,  we  address  the  issue  of  phase 
instability  with  a  nanoscale  patterning  technique. 
Experimentally,  we  found  that  GaN  begins  to 
grow  with  hexagonal  phase  on  the  Si{  1  1  1}  side- 
walls  of  a  nanoscale  Y-grooved  (nanogrooved)  Si 
surface,  but  changes  to  cubic  phase  in  continued 
deposition.  The  cubic-phase  GaN  is  thus  spatially 
separated  from  the  hexagonal-phase  region  in  the 
resulting  crystalline  structure  with  its  principal 
crystal  axes  parallel  to  those  of  the  Si(0  01) 
substrate.  The  morphology  of  the  GaN  surface  is 
also  significantly  improved.  These  imply  that 
nanofaceting  on  Si(0  01)  is  a  promising  phase 
control  technique  for  single-phase  GaN  on  a 
limited  area  of  Si(0  01).  In  this  work,  the  phase 
stability  and  stress  distribution  of  GaN  grown  on  a 
nanogrooved  Si  surface  is  investigated  and  com¬ 
pared  with  those  of  GaN  grown  on  wide-area 
Si(l  1  1)  and  Si(0  01). 


2.  Nanofaceting  and  MOVPE  growth 

A  355-nm  period  one  dimensional  (ID)  nanos¬ 
cale  Y-grooved  surface  is  fabricated  on  a  boron- 
doped  8-1 5  Q  cm  Si(0  01)  substrate  by  i- line 
interferometric  lithography  and  aqueous  KOH 
anisotropic  wet  etching  [13].  The  pattern  direction 
was  chosen  so  that  the  anisotropic  wet  etching 
revealed  {111}  type  sidewall  orientations  in  the 


Fig.  1.  Cross-sectional  SEM  images  of  a  355-nm  period 
nanogrooved  surface  fabricated  on  a  Si(00  1)  substrate. 


grooves.  Fig.  1  shows  cross  section  scanning 
electron  microscopy  (SEM)  images  of  a  nano¬ 
grooved  Si  surface.  As  shown  in  the  bottom  part 
of  Fig.  1,  each  Y-groove  has  well-defined  two 
facing  {111}  type  sidewalls  connected  to  neigh¬ 
boring  grooves  by  a  sub-lOOnm-wide  (0  0  1)  stripe 
facet.  Thus,  each  Y-groove  is  isolated  by  the  (0  0  1) 
stripe  facets. 

Three  different  substrates  were  examined  in  a 
single  run  growth;  the  nanogrooved  Si(0  01) 
shown  in  Fig.  1,  wide-area  Si(l  1  1),  and  wide-area 
Si(0  0  1).  On  these  substrates,  a  GaN  epilayer  was 
grown  by  MOYPE  with  trimethylgallium  (TMG) 
and  NH3.  First,  the  substrates  were  preheated  up 
to  1100°C  under  hydrogen  environment  and 
cooled  down  to  650  °C.  At  this  temperature,  a 
M.5-nm  thick  GaN  buffer  was  grown  on  each  Si 
substrate.  The  low-temperature  buffer  was  em¬ 
ployed  to  assist  nucleation  over  the  whole  sub¬ 
strate  surface.  To  avoid  reactions  between  NH3 
and  the  Si  surface,  which  could  result  in  Si- 
nitridation  at  the  beginning  of  growth,  both  TMG 
and  NH3  were  simultaneously  supplied  to  the 
reactor  for  the  buffer  growth.  On  top  of  this 
buffer,  a  600-nm  thick  GaN  epilayer  was  grown  at 
1050  °C.  Nominal  growth  rates  were  1.5nm/min 
for  the  buffer  and  10  nm/min  for  the  GaN  epilayer. 


3.  Results  and  discussion 

Fig.  2(a)  shows  a  top-view  SEM  image  of  the  as- 
grown  GaN  on  the  Si  nanogrooved  substrate.  As 
confirmed  in  the  cross-sectional  transmission 
microscopy  (XTEM)  below,  the  surface  shown  in 
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Fig.  2.  Top  view  SEM  images  of  the  600-nm  thick  GaN  on  (a) 
Si  nanogrooves,  (b)  wide-area  Si(l  1  1),  and  (c)  wide-area 
Si(00  1)  grown  under  the  same  conditions. 

Fig.  2(a)  is  smooth  and  flat.  Figs.  2(b)  and  (c)  are 
top-view  SEM  images  of  the  GaN  deposited  on 
wide-area  Si(l  1  1)  and  Si(0  01)  under  the  same 
growth  conditions  as  that  of  Fig.  2(a),  respectively. 


Although  the  GaN  on  Si(l  1  1)  is  degraded  by  a 
number  of  pm-size  irregular  dips,  it  has  a  relatively 
flat  surface.  On  the  other  hand,  the  GaN  on 
Si(0  01)  has  a  severely  degraded  morphology 
which  is  analogous  to  poly  crystalline  film  surfaces. 
This  granular  or  poly-type  surface  is  the  typical 
surface  morphology  of  GaN  on  wide-area  Si(0  0  1) 
[7,8].  Although  the  original  orientation  of  the 
substrate  surface  was  (0  0  1),  the  GaN  surface  of 
Fig.  2(a)  is  very  different  from  that  of  Fig.  2(c) 
because  of  the  impact  of  the  nanofaceting  on 
epitaxial  growth  [4-8].  The  comparison  of  Fig.  2(a) 
with  Fig.  2(c)  implies  that  nanofaceting  signifi¬ 
cantly  improves  the  surface  morphology  of  GaN 
onSi(OOl). 

Fig.  3(a)  shows  an  XTEM  image  of  the  as- 
grown  GaN  on  the  Si  nanogrooves.  This  figure 
confirms  that  the  GaN  surface  is  relatively  smooth 
and  flat.  As  indicated  by  arrows  in  Fig.  3(a),  an 
array  of  voids  located  on  Si(0  01)  stripe  facets 
between  Y-grooves  is  observed  at  the  GaN/Si 
interface.  The  inset  of  Fig.  3(b)  reveals  that  the 


Fig.  3.  (a)  An  XTEM  image  of  the  600-nm  thick  GaN  on  Si 
nanogrooves.  The  arrows  indicate  an  array  of  voids  on  the 
Si(00  1)  stripe  facets,  (b)  A  high-resolution  XTEM  image  taken 
near  the  bottom  left  corner  of  a  void  shown  in  (a).  The  inset  is  a 
magnification  of  a  single  void. 
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shape  of  the  void  (or  a  nano-channel  along  the 
groove  direction)  is  triangular.  A  similar  shape 
void  which  is  due  to  selective  nucleation  and 
coalescence  of  GaN  epilayers  grown  on  the  facing 
Si(l  1  1)  sidewalls  of  a  single  groove  has  been 
reported  previously  [14].  Fig.  3(b)  which  is  a  high- 
resolution  XTEM  image  taken  near  the  bottom 
left  corner  of  the  void,  reveals  that  the  GaN  has 
hexagonal  phase  at  the  initial  stage  of  growth  of 
which  the  oaxis  ([0  0  0  1])  is  normal  to  Si{l  1  1} 
and  is  highly  defected  by  stacking  faults  because  of 
large  lattice  mismatch  between  GaN  and  Si.  As 
shown  in  Fig.  3(b),  lateral  overgrowth  from  the 
neighboring  V-grooves  toward  the  Si(0  0  1)  stripe 
facet  along  GaN[0  0  01]  is  involved  in  the  forma¬ 
tion  of  the  void.  Fig.  3,  therefore,  implies  that  the 
planarization  of  Fig.  2(a)  is  partly  associated  with 
lateral  overgrowth  and  coalescence  over  the 
Si(0  0  1)  stripe  facets. 

High-resolution  XTEM  and  selected  area  elec¬ 
tron  diffraction  (SAED)  provide  important  infor¬ 
mation  about  planarization  and  phase  separation. 
Fig.  4  is  a  magnified  XTEM  image  of  a  single  V- 
groove  (the  white  box  in  Fig.  3(a)).  Figs.  5(a)-(f) 
are  high-resolution  XTEM  images  and  SAED 
patterns  taken  from  several  areas  of  Fig.  4. 
The  regions  labeled  A-F  of  Fig.  4  correspond  to 
Figs.  5(a)-(f).  As  seen  in  these  figures,  the  600-nm 
thick  GaN  exhibits  a  spatial  variation  in  crystal- 


Fig.  4.  A  magnified  XTEM  image  of  the  white  box  in  Fig.  3(a). 


Fig.  5.  High-resolution  XTEM  images  [(a)— (d)]  and  SAED 
patterns  [(e)  and  (f)]  taken  from  the  cross  section  of  the  600-nm 
thick  GaN  on  Si  nanogrooves  shown  in  Fig.  4.  The  regions 
labeled  in  Fig.  4  correspond  to  (a)-(f)  in  alphabetical  order.  In 
(e),  the  array  of  the  white  arrows  indicates  the  diffraction  spots 
from  the  hexagonal-phase  GaN. 


line  phases  across  each  period.  While  regions  A 
and  E  are  hexagonal  phase,  regions  B  and  F  show 
clear  cubic  phase.  On  the  other  hand,  regions  C 
and  D  have  cubic  phase  over  a  highly  defected 
region  where  two  hexagonal  phase  regions  are 
crossing  with  an  angle  of  ~110°  in  their  oaxis 
alignment.  This  misalignment  is  due  to  the  ~55° 
tilting  of  the  GaN(0  0  01)  [or  Si(l  1  1)]  from  the 
Si(0  0  1)  plane. 

The  XTEM  examination  reveals  the  boundaries 
between  these  two  phases,  indicated  by  the  white 
dashed  lines  in  Fig.  4.  They  are  parallel  to 
Si{  1  1  1}.  The  hexagonal  phase  extends  from  each 
Si  V-groove  to  the  surface  with  forming  V-shaped 
cross  section.  The  white  arrows  of  Fig.  4  denote 


6 


S.C.  Lee  et  al.  /  Journal  of  Crystal  Growth  272  (2004)  2-8 


the  direction  of  the  oaxis  in  each  hexagonal-phase 
GaN.  Thus,  cubic-phase  regions  are  spatially 
separated  from  hexagonal  phases  along  the 
boundaries  in  the  GaN  grown  on  a  nanogrooved 
Si  surface.  Region  D  of  Fig.  4  (or  Fig.  5(d)) 
corresponds  to  the  starting  point  of  the  phase 
transition  within  a  single  Y-groove.  From  the 
(0  0  2)  spot  position  of  Fig.  5(f),  it  can  be  noticed 
that  the  orientations  of  the  principal  crystal  axes  of 
the  cubic-phase  GaN  are  the  same  as  those  of  the 
Si(0  0  1)  substrate.  The  transition  from  hexagonal 
to  cubic  phase  also  contributes  to  the  formation  of 
the  planar  GaN  surface  of  Figs.  2(a)  and  3(a) 
which  includes  GaN(OOl)  parallel  to  Si(0  01)  at 
the  top  surface.  In  Fig.  5(e),  because  of  the  finite 
aperture  size,  region  E  of  Fig.  4  (a  solid  circle) 
partly  covers  the  adjacent  cubic-phase  area  and 
Fig.  5(e)  consists  of  the  overlap  of  SAED  patterns 
from  cubic  and  hexagonal  phases. 

Fig.  6  presents  room-temperature  photolumi¬ 
nescence  (PL)  spectra  obtained  from  the  GaN 
epilayers  grown  on  Si  nanogrooves  and  on  wide- 
area  Si(l  1  1),  shown  in  Fig.  2.  In  Fig.  6,  two  PL 
peaks  at  3.389  and  3.200  eV  are  observed  from  the 
GaN  on  the  nanogrooved  Si  surface,  while  only  a 
single  PL  peak  at  3.387  eV  is  seen  from  that  on 
wide-area  Si(l  1  1).  The  peaks  at  3.389  and 
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Fig.  6.  Room-temperature  PL  spectra  of  the  GaN  grown  on  Si 
nanogrooves  and  on  wide-area  Si(l  1  1). 


3.387  eY  are  close  to  the  band  edge  transition  of 
hexagonal-phase  GaN  on  Si(l  1  1)  or  of  homo- 
epitaxial  GaN  [15,16].  On  the  other  hand,  the  peak 
at  3.200  eV  is  very  close  to  the  PL  peak  energy  of 
cubic-phase  GaN  on  GaAs(0  0  1)  [17,18].  Thus,  the 
GaN  on  wide-area  Si(l  1  1)  is  mainly  hexagonal 
phase  but  that  on  Si  nanogrooves  has  both 
hexagonal  and  cubic  phase,  consistent  with  the 
high-resolution  XTEM  results  of  Fig.  5. 

The  Si  nanogrooved  surface  of  Fig.  1  also 
contains  (111)  orientation  which  allows  nuclea- 
tion  and  incorporation  to  proceed  selectively  on 
the  (1  1  1)  facets  and  leads  to  growth  of  hexagonal- 
phase  GaN  at  the  initial  stages  of  growth,  as  for 
the  wide-area  Si(l  1  1)  shown  in  Fig.  2(b).  In 
continued  deposition,  the  GaN  on  Si  nanogrooves 
undergoes  transition  to  cubic  phase.  But,  the 
planar  wide-area  Si(l  11)  substrate  does  not  result 
in  a  transition  to  the  cubic  phase.  Together  with 
lateral  overgrowth  and  coalescence,  the  phase 
transition  leads  to  smooth  surface  morphology 
and  spatial  separation  of  cubic  phase  from 
hexagonal  phase  in  the  GaN  on  Si  nanogrooves 
of  which  the  original  surface  orientation  was 
identically  (0  0  1),  like  the  wide-area  Si(0  0  1)  used 
for  the  GaN  of  Fig.  2(c).  Consequently,  the  GaN 
grown  on  a  nanogrooved  Si(0  01)  substrate  is 
significantly  different  from  that  grown  on  Si(l  1  1) 
and  Si(0  0  1).  Once  the  phase  transition  occurs,  its 
boundary  is  maintained  up  to  600-nm  deposition. 
If  phase  transition  had  not  occurred,  the  GaN  on  a 
nanogrooved  Si  surface  would  have  been  forced  to 
keep  periodic  misaligned  hexagonal  phase  across 
the  groove  direction  (the  white  arrows  in  Fig.  4) 
which  is  energetically  very  unstable.  Thus,  the 
phase  transition  is  likely  to  reduce  the  total  crystal 
energy  in  epitaxial  growth.  Further  study  on  the 
phase  transition  depending  on  groove  width  and 
pattern  period  is  presently  under  way. 

Finally,  Fig.  7  presents  room-temperature  Ra¬ 
man  scattering  spectra  taken  from  the  GaN 
epilayers  grown  on  Si  nanogrooves  and  on  wide- 
area  Si(l  1  1)  in  backscattering  geometry  with  an 
incident  direction  of  the  laser  beam  normal  to 
Si(0  0  1)  (or  GaN(0  0  1))  of  Si  nanogrooves  and  to 
wide-area  Si(l  1  1),  respectively.  As  seen  in  Fig.  7, 
two  peaks  near  550  and  565  cm-1  are  observed 
from  the  former  while  a  single  peak  at  564  cm-1 
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Fig.  7.  Room-temperature  Raman  scattering  spectra  of  the 
GaN  grown  on  Si  nanogrooves  and  on  wide-area  Si(l  1  1). 

appears  in  the  latter  one.  The  peaks  near  564  and 
565  cm-1  represent  the  high-frequency  E2  mode  of 
hexagonal-phase  GaN  and  the  peak  near  550  cm-1 
is  close  to  the  TO  mode  of  cubic-phase  GaN  on 
GaAs(OOl)  [19-22].  Thus,  the  absence  of  a  TO 
mode  in  the  GaN  on  wide-area  Si(l  1  1)  suggests 
that  the  dominant  phase  of  GaN  on  wide-area 
Si(l  1  1)  is  hexagonal,  confirming  the  PL  results  of 
Fig.  6.  Comparing  the  frequency  shifts  of  the  E2 
mode  with  that  of  the  corresponding  phonon 
mode  of  bulk  GaN  (~568cm-1),  the  E2  modes  of 
the  GaN  on  Si  nanogrooves  and  on  wide-area 
Si(l  1  1)  have  about  —3  and  —4  cm-1  shift  from 
that  of  unstrained  GaN,  respectively  [23,24].  Using 
a  linear  approximation  to  the  frequency  shift- 
biaxial  stress  relationship,  these  shifts  imply  ~0.5 
and  ~0.65  GPa  tensile  stress  [25].  This  is  consistent 
with  the  slight  difference  of  PL  peak  energies 
shown  in  Fig.  6  [16,26].  Thus,  the  GaN  on  Si 
nanogrooves  exhibits  lowered  tensile  stress  than 
the  growth  on  wide-area  Si(l  1  1).  This  could  be 
partly  explained  by  the  large  number  of  crystal 
defects  near  the  valley  of  the  V-grooves  and  at  the 
coalescence  regions  over  the  Si(0  0  1)  stripe  facets 
which  can  play  a  role  of  additional  local  relaxation 


centers  for  the  GaN  on  the  Si  nanogrooved 
substrate. 

The  TO  mode  of  GaN  on  GaAs(0  0  1)  is  at  a 
frequency  shift  of  ~555cm-1,  close  to  the  peak 
observed  near  550 cm-1  in  Fig.  7  [19,20].  The  TO 
mode  from  cubic  phase  is  forbidden  in  the  given 
backscattering  geometry  and  its  intensity  is  thus 
very  weak.  The  TO  mode  of  the  cubic-phase  GaN 
on  Si  nanogrooves  then  shows  about  —5  cm-1 
frequency  shift,  implying  higher  tensile  stress  than 
that  on  GaAs(OOl).  This  contradicts  the  predic¬ 
tion  based  on  lattice  mismatch.  However,  it  should 
be  noted  that  the  cubic-phase  GaN  on  a  Si 
nanogrooved  substrate  is  not  directly  equivalent 
to  that  on  wide-area  GaAs(0  0  1).  Further  investi¬ 
gation  is  required  to  understand  the  stress 
distribution  of  cubic-phase  GaN  on  Si  nano¬ 
grooves. 

In  the  GaN  on  Si  nanogrooves  of  Fig.  4,  cubic 
and  hexagonal  phases  are  periodically  separated 
across  the  groove  direction.  Since  both  phases  are 
generated  in  every  groove  and  comparably  extend 
to  the  surface,  probably  none  of  them  would  be  a 
dominant  phase  in  the  growth  direction  even 
though  deposition  is  continued  further.  For  device 
applications,  however,  it  is  possible  to  obtain  pure 
cubic-phase  GaN  on  a  limited  area  of  Si  nano¬ 
grooves.  In  Fig.  4,  the  extension  of  hexagonal- 
phase  GaN  to  neighbor  grooves  after  the  coales¬ 
cence  over  the  Si(0  01)  stripe  facets  prevents  the 
formation  of  large-area  cubic-phase  GaN  at  the 
top  surface.  Such  coalescence  could  be  avoided  or 
retarded  if  the  Si(0  01)  stripe  width  is  increased.  By 
controlling  pattern  period  and  duty  cycle  in 
grooving  process,  therefore,  growth  of  pure 
cubic-phase  GaN  on  a  desired  area  of  Si(0  01) 
can  be  realized. 


4.  Summary  and  conclusions 

Phase  behavior  of  GaN  on  nanofaceted  Si(0  01) 
has  been  investigated.  On  355-nm  period  V- 
grooves  fabricated  on  a  Si(0  01)  substrate,  600- 
nm  deposition  of  GaN  by  MOVPE  results  in 
spatial  separation  of  cubic-  and  hexagonal-phase 
GaN.  Orientation-dependent  nucleation  and  asso¬ 
ciated  lateral  overgrowth  and  coalescence  lead  to 
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formation  of  cubic-phase  GaN  for  which  the 
direction  of  the  principal  crystal  axes  are  identical 
to  those  of  the  Si(0  0  1)  substrate  and  result  in  a 
significant  improvement  of  the  surface  morpho¬ 
logy.  Conclusively,  nanofaceting  is  highly  predict¬ 
able  in  phase  control  of  GaN  on  Si(0  01),  and  can 
be  applied  to  growth  of  cubic-phase  GaN  on  a 
limited  area  of  Si(0  01). 
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Abstract — Nanoscale  spatial  phase  modulation  of  GaN  gorwn 
on  a  355-nm  period  array  of  V-grooves  fabricated  in  a  Si(001) 
substrate  is  reported.  Orientation-dependent  selective  nucleation 
of  GaN  in  metal-organic  vapor  phase  epitaxy  begins  from  the 
opposing  Si{lll}  sidewalls  and  rapidly  fills  each  V-groove.  At 
the  initial  stages  of  growth,  the  GaN  deposited  on  the  sidewalls 
has  hexagonal  phase  with  the  c-axis  normal  to  the  Sijlll}.  As 
the  growth  continues,  the  filling  of  the  V-groove  over  these  mis¬ 
aligned  hexagonal  phase  regions  results  in  a  transition  to  a  cubic 
phase  with  its  principal  crystal  axes  parallel  to  those  of  the  Si 
substrate.  In  a  cross-sectional  view  perpendicular  to  the  grooves, 
the  defected  hexagonal  phase  region  and  the  clean  cubic  phase 
region  above  it  form  a  boundary  at  the  inside  of  each  V-groove 
which  is  parallel  to  the  Sijlll}  sidewalls.  The  GaN  surface  is 
almost  planarized  for  only  75-nm  deposition  and  is  parallel  to  the 
original  (001)  plane  of  the  Si  substrate.  The  GaN  clearly  exhibits 
nanoscale  spatial  phase  modulation  with  a  periodic  separation  of 
hexagonal  and  cubic  crystal  structures  across  the  groove  direction 
for  600-nm  deposition,  implying  a  possibility  of  cubic  phase  GaN 
on  an  isolated  single  V-groove  fabricated  in  a  Si(001)  substrate 
for  monolithic  integration.  The  structural/optical  properties  and 
stress  measurements  of  this  phase-modulated  GaN  grown  on  a 
nanoscale  faceted  Si  surface  are  presented. 

Index  Terms — GaN  on  Si,  nanofaceting,  phase  transition. 


I.  Introduction 

GALLIUM  nitride  (GaN)  and  related  materials  have  at¬ 
tracted  much  attention  since  the  achievement  of  blue 
laser  diodes  on  a  sapphire  substrate  [1].  Because  of  their  large 
bandgap,  these  materials  are  also  attractive  for  high-power  and 
high-speed  electronic  applications.  While  sapphire  has  a  crystal 
symmetry  that  is  identical  to  that  of  GaN,  the  large  lattice  mis¬ 
match  (16%)  and  low  thermal  conductivity  (~  0.3  W/cm  •  K) 
are  disadvantageous  [2] .  The  choice  of  a  substrate  for  GaN  thus 
remains  an  important  issue. 

Lattice  mismatch,  thermal  expansion  mismatch,  elec¬ 
trical/thermal  conductivity,  and  surface  orientation  which 
affects  the  crystal  structure  and  material  properties  of  GaN 
grown  on  it  are  important  factors  in  the  selection  of  substrates. 
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Growth  of  Ill-nitrides  on  various  substrates  has  been  reported. 
Sapphire  is  most  widely  used  but  SiC,  GaN,  GaAs,  Si,  and 
ZnO  have  been  examined  [3].  With  the  problematic  exception 
of  GaN  homoepitaxy,  SiC  has  the  smallest  lattice  mismatch 
(3.5%  for  6H  SiC)  and  the  highest  thermal  conductivity  (4.9 
W/cm-K),  but  is  very  expensive  and  of  lower  overall  quality 
than  traditional  semiconductor  substrates  such  as  Si  [2].  Re¬ 
cently,  growth  on  ZrB2  which  has  0.5%  lattice  mismatch  with 
thermal  conductivity  of  0.9-1. 1  W/cm-K  has  been  reported 
[4].  Currently,  no  substrate  for  GaN  which  is  cost-effective  for 
high-power  operation  is  available. 

Si  has  potential  as  a  substrate  for  GaN  as  a  result  of 
the  availability  of  low-cost  large-area  substrates  (to  300-mm 
diameter)  with  extremely  high  quality.  Si  has  a  large  lattice 
mismatch  with  GaN  (—17%),  but  has  a  thermal  conductivity 
comparable  with  GaN  (1.3- 1.5  W/cm  K),  considerably  larger 
than  that  of  sapphire  [2].  However,  GaN  grown  on  Si(001) 
is  known  to  have  a  polytype  phase  instability,  degrading  the 
crystallinity  and  resulting  in  an  uncontrolled  mixture  of  cubic 
and  hexagonal  phases  with  poor  surface  morphology  [5]-[7]. 
Theoretical  calculations  imply  that  the  polytypism  results  from 
the  small  difference  of  total  crystal  energy  between  the  hexagonal 
and  cubic  phases  [8].  Experimentally,  this  could  be  related  to 
the  stoichiometry  of  the  front  growth  surface  [9].  For  device 
applications,  phase  control  or  phase  separation  into  large-area 
single-phase  GaN  on  Si(001)  is  required.  Recently,  progress 
in  the  growth  of  single  phase  GaN  on  Si(001)  such  as  cubic 
phase  GaN  with  a  low-temperature  GaN  buffer  or  hexagonal 
phase  GaN  with  an  AIN  buffer  layer  has  been  reported  [5], 
[10]. 

In  previous  work,  we  have  observed  orientation-dependent 
nucleation  and  the  growth  of  spatially  separated  hexagonal 
and  cubic  phase  crystals  of  GaN  on  a  nanoscale  faceted 
(nanofaceted)  Si  surface  referred  to  as  a  flat-bottom  groove 
[1 1],  [12].  Fig.  1  shows  a  schematic  cross-sectional  view  of  this 
structure  which  consists  of  two  opposing  Sijlll}  sidewalls 
isolated  by  Si(001)  top  and  bottom  facets  in  each  period.  Coa¬ 
lescence  of  the  two  hexagonal  phase  GaN  epilayers  separately 
grown  from  the  opposing  Sijlll}  sidewalls  with  misaligned 
c-axes  results  in  a  structural  instability  that  is  relieved  by  a 
transition  to  the  cubic  phase  in  the  central  V-groove  region 
above  the  bold  dashed  lines  of  Fig.  1. 

In  this  work,  utilizing  the  phase  transition  on  a  Si  grooved 
surface,  we  investigate  nanoscale  spatial  phase  modulation  of 
GaN  on  one-dimensional  (1-D)  periodic  V-grooves  fabricated  in 
a  Si(001)  substrate.  Growth  of  GaN  on  a  Si  V-groove  then  may 
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Fig.  2.  (a)  45  °  tilted  and  (b)  a  cross-sectional  SEM  image  of  a  355-nm  period 

1-D  V-grooves  fabricated  on  a  Si(001)  substrate. 

Fig.  1.  Schematic  illustration  of  the  material  phase  map  for  GaN  grown  on  a 
(001)-(1 1 1)  nanofaceted  Si  surface.  See  [11]  for  details. 

II.  Nanoscale  V-Grooving  and  MOVPE  Growth 


be  able  to  provide  more  information  for  polytypism  of  GaN  on 
Si.  Additionally,  the  completed  V-groove  is  easier  to  fabricate 
uniformly  over  large  areas  on  a  Si  substrate  since  it  is  self-lim¬ 
iting  for  simple  anisotropic  etching  processes. 

Several  advantages  of  cubic  phase  GaN  in  device  applications 
already  have  been  suggested.  It  has  been  reported  that  cubic  phase 
GaN  can  offer  a  higher  electron  mobility  and  the  possibility  of 
higher  electrical  activation  in  p-doping  than  hexagonal  one,  both 
of  which  promise  important  device  benefits  [9] ,  [  1 3] .  Realization 
of  high  quality  cubic  phase  GaN  is  essential  to  prove  these  merits. 
Especially,  realization  of  device-quality  cubic  phase  GaN  on 
Si(00 1 )  which  is  most  widely  used  in  semiconductor  technologies 
will  strongly  impact  overall  semiconductor  technology  and 
manufacturing  through  monolithic  integration  of  GaN  devices 
with  mature  Si  microelectronics.  Also,  cleaving  for  the  mirror 
facets  of  GaN  lasers  will  be  one  of  the  strong  advantages 
for  the  usage  of  Si(001)  substrates  in  optoelectronics. 

In  this  work,  we  examine  the  phase  transition  mechanism 
with  a  more  reliable  process — V-grooving — and  present 
in-depth  analysis  of  optical  and  structural  properties  of  cubic 
phase  GaN  grown  by  metal-organic  vapor  phase  epitaxy 
(MOVPE)  which  is  very  important  to  the  fabrication  of  micro- 
and  optoelectronic  GaN  devices  on  Si(001).  With  variation 
of  duty  cycle  and  depth  in  V-grooving,  as  suggested  in  this 
work,  spatial  phase  modulation  will  lead  to  the  ability  to  realize 
cubic-phase  GaN  on  a  limited  area  of  a  Si(001)  substrate.  This  is 
a  very  promising  solution  for  low-cost  high-speed  high-power 
GaN  functional  devices.  The  GaN  on  nanofaceted  Si(001) 
presented  in  this  work  is  thus  expected  to  play  a  significant 
role  in  the  monolithic  integration  of  GaN  devices  on  Si(001) 
which  is  a  very  attractive  candidate  for  future  semiconductor 
technologies  but  currently  out  of  interest  for  the  low-quality 
epitaxy  on  a  planar  substrate. 

In  this  paper,  fabrication  of  the  V-grooves  on  a  Si  substrate 
and  the  epitaxial  growth  of  GaN  are  described  in  Section  II.  Ex¬ 
perimental  results  obtained  from  GaN  grown  on  this  V-grooved 
Si  surface,  including  orientation-dependent  nucleation,  crystal 
structure,  lateral  growth,  formation  of  the  cubic  phase,  and 
the  optical  properties  of  phase-modulated  GaN  are  presented 
in  Section  III.  Discussion  of  these  results  is  described  in  Sec¬ 
tion  IV  with  suggestions  for  future  device  applications.  We 
summarize  all  the  results  in  Section  V  with  conclusions. 


Using  large-area  interferometric  lithography  (IL)  and 
anisotropic  wet  etching  (AWE),  a  nanoscale  1-D  array  of 
V-grooves  consisting  of  (lll)-type  faceted  sidewalls  was  fab¬ 
ricated  on  a  Si  surface.  A  boron-doped  Si(001)  substrate  with  a 
resistivity  of  about  8-15  Q-cm  was  used.  A  single  longitudinal 
and  transverse  mode,  frequency-tripled  yttrium  aluminum 
garnet  (YAG)  laser  at  a  355-nm  wavelength  was  used  as  an 
ultraviolet  light  source  in  IL.  A  355-nm  period  1-D  stripe  pho¬ 
toresist  (PR)  pattern  was  generated  on  a  Si(001)  substrate  and 
was  coated  with  a  10-nm- thick  Cr  film.  By  liftoff  process,  the 
PR  pattern  was  transferred  into  the  Cr  film,  which  was  used  as 
an  etch  mask  for  AWE  to  generate  Si(l  1  l)-sidewall  V-grooves. 
The  remaining  Cr  was  then  stripped,  leaving  a  nanofaceted 
(001)-(lll)-(m)-(001)  V-groove.  The  details  of  IL  and  AWE 
processing  were  presented  elsewhere  [14],  [15].  Fig.  2  shows 
45°  tilted  and  cross-sectional  scanning-electron  microscopy 
(SEM)  images  of  the  patterned  Si  substrates.  In  each  V-groove, 
the  width  of  the  top  (100)  facet  is  60-80  nm,  while  the  depth  of 
the  groove  is  1 80-200  nm. 

The  nanofaceted  substrates  were  passivated  with  hydrogen  by 
dipping  them  in  diluted  HF  immediately  before  loading  into  an 
MOVPE  reactor.  In  the  reactor,  they  were  preheated  to  1 100  °C 
and  cooled  down  to  650  °C  under  a  hydrogen  environment.  On 
the  nanofaceted  Si  surface  shown  in  Fig.  2,  two  different  thick¬ 
ness  GaN  epilayers  were  separately  deposited  using  (CPG^Ga 
and  NH3  source  gases.  A  initial  ~1.5-nm-thick  GaN  buffer  was 
grown  at  650  Ton  both  samples.  This  low-temperature-grown 
buffer  provided  better  nucleation  of  GaN  on  the  Si  surface.  On 
top  of  this  buffer,  75-nm  and  ~600-nm- thick  GaN  epilayers 
were  grown  on  separate  samples  at  1050  °C.  The  thinner  growth 
corresponds  to  about  a  half  of  the  depth  of  the  V-groove  while 
the  thicker  growth  is  about  twice  as  thick  as  the  groove  period. 
The  MOVPE  reactor  is  horizontal  type.  The  V/III  ratio  for  the 
buffer  and  the  epilayer  of  both  samples  were  ~  4.5  x  103  and 
~  10  4,  respectively.  Nominal  growth  rates  were  1.5  nm/min  for 
the  buffer  and  10  nm/min  for  the  GaN  epilayers. 

As  seen  in  the  next  section,  the  actual  deposition  amount  is 
almost  twice  that  estimated  from  the  nominal  growth  rate  mul¬ 
tiplied  by  the  growth  time  in  the  75-nm  layers.  The  growth  rates 
were  calibrated  with  a  planar  sapphire  substrate  since  growth 
of  GaN  on  planar  wide-area  Si(001)  or  Si(lll)  did  not  result 
in  a  reasonably  flat  surface  morphology.  In  MOVPE,  all  the 
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Fig.  3.  (a)  Top  view  SEM  image  of  a  75-nm-thick  GaN  film  deposited  on  the 

nanofaceted  Si  surface  shown  in  Fig.  2.  The  black  and  a  white  arrows  indicate 
surface  imperfections  parallel  and  normal  to  the  groove  direction,  respectively. 
The  inset  is  a  magnified  45  °  tilted  SEM  image  taken  at  the  cleaved  edge  of  the 
as-grown  sample,  (b)  XTEM  image  of  the  GaN  shown  in  (a). 

gases  supplied  to  a  reactor  are  not  fully  decomposed  for  the 
chemical  reactions  for  epitaxial  growth.  The  actual  growth  rate 
thus  can  be  affected  by  surface  conditions.  Generally,  filling  of 
V-grooves  which  typically  results  in  surface  planarization  in  epi¬ 
taxy  is  a  fast  process  compared  with  growth  on  a  nearby  planar 
surface  under  the  given  growth  condition.  Also,  gas  flow  dy¬ 
namics  on  the  nanofaceted  Si  surface  may  not  be  equivalent  to 
that  on  a  planar  substrate.  Until  the  V-grooves  are  completely 
filled  by  deposition,  therefore,  the  actual  growth  rate  of  GaN  on 
the  nanofaceted  Si  surface  can  be  different  from  that  measured 
on  a  planar  substrate. 

The  samples  were  characterized  with  SEM,  transmission 
electron  microscopy  (TEM),  atomic  force  microscopy  (AFM), 
photoluminescence  (PL),  Raman  scattering,  and  wide-angle 
X-ray  diffractometry  (XRD).  For  PL,  a  325-nm  wavelength 
He-Cd  laser  and  a  cooled  GaAs  photomultiplier  were  used. 
Raman  spectra  were  taken  in  backscattering  geometry  from 
Si(001)  with  a  488-nm  Ar-ion  laser  excitation  source.  X-ray 
diffraction  ( 0-20  scan)  was  performed  at  the  Cu  Ka\  wave¬ 
length. 

III.  Results 

Fig.  3(a)  shows  a  45  °  tilted  SEM  image  of  a  75-nm-thick 
GaN  film  deposited  on  the  nanofaceted  Si  surface  shown  in 
Fig.  2.  The  inset  of  Fig.  3(a)  is  a  magnified  45  °  tilted  SEM 
image  taken  at  the  cleaved  edge  of  the  as-grown  sample.  The 
dashed  line  in  the  inset  denotes  the  GaN/Si  interface.  Fig.  3(b) 
is  a  cross-sectional  TEM  (XTEM)  image  of  the  same  sample. 
As  seen  in  these  figures,  the  original  Si  V-grooves  were  filled 
with  GaN  and  the  V-shaped  Si  surface  evolves  to  a  smooth, 
horizontal  GaN  surface  with  only  this  75 -nm  deposition.  While 
the  GaN  surface  above  the  valley  of  each  groove  is  almost  flat, 
that  above  each  Si(001)  stripe  facet  shows  some  fluctuation  in 
Fig.  3(b).  These  uneven  growths  atop  the  Si(001)  stripes  cor¬ 
respond  to  small  variations  parallel  the  groove  direction  which 
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Fig.  4.  High-resolution  XTEM  images  of  the  white  boxes  labeled  (a)  1  and  (b) 
2  of  Fig.  3(b). 

are  indicated  by  the  black  arrows  in  Fig.  3(a)  and  its  inset.  Ad¬ 
ditionally,  cracks  crossing  the  grooves,  but  bounded  within  a 
single  groove,  are  also  observed.  One  of  them  is  highlighted 
by  the  white  arrow  in  Fig.  3(a).  In  Fig.  3(b),  a  nanoscale  trian¬ 
gular  cross  section  channel  (nanochannel)  or  a  nanoscale  void 
along  the  groove  direction  is  observed  atop  each  Si(001)  stripe 
facet.  Thus,  the  GaN  also  covers  the  Si(001)  stripe  facets  but 
with  voids  atop  each  stripe. 

Fig.  4  shows  high  resolution  XTEM  images  of  selected  areas 
of  Fig.  3(b).  Fig.  4(a)  and  (b)  correspond  to  the  white  boxes 
labeled  1  and  2  in  Fig.  3(b),  respectively.  Fig.  4(a)  shows  the  area 
directly  above  the  narrow  Si(001)  stripe  facet.  The  GaN  in  this 
region  is  hexagonal  phase  but  exhibits  a  grain  boundary  along 
the  center  line  of  the  channel,  resulting  from  lateral  overgrowth 
from  the  GaN  nucleated  on  the  two  adjacent  Si  { 1 1 1 }  facets  with 
oppositely  directed  c-axes.  On  the  other  hand,  the  GaN  in  box  2, 
which  is  near  the  inside  center  of  a  Si  V-groove,  is  cubic  phase  as 
shown  in  Fig.  4(b).  This  cubic  phase  GaN  emerges  from  a  large 
number  of  stacking  faults  as  the  growth  proceeds,  and  shows 
a  phase  transition  similar  to  that  observed  on  the  flat-bottom 
groove  of  Fig.  1  [11]. 

Fig.  5(a)  shows  an  XTEM  image  of  a  600-nm-thick  GaN  film 
grown  on  a  nanofaceted  Si  surface.  Contrast  difference  clearly 
reveals  the  GaN/Si  interface.  The  top  GaN  surface  is  flat  and 
smooth.  As  indicated  by  white  arrows,  a  nanochannel  is  again 
observed  on  each  Si(001)  stripe  facet. 

Fig.  6(a)  is  an  AFM  image  of  the  surface  morphology  of  the 
as-grown  sample  shown  in  Fig.  5(a).  The  original  Si  grooves 
are  fully  covered  with  GaN  by  this  600-nm  deposition  with  im¬ 
proved  surface  morphology  as  compared  with  Fig.  3.  This  result 
is  also  significantly  flatter  and  smoother  than  other  reported  re¬ 
sults  on  wide-area  planar  Si(001)  surfaces  [5]— [7] .  Fig.  6(b)  is  a 
45  °  tilted  SEM  image  of  the  as-grown  sample.  The  cracks  and 
surface  defects  parallel  to  the  groove  direction  which  were  seen 
in  Fig.  3(a)  are  not  observed  on  the  600-nm-thick  GaN  film. 
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Fig.  7.  High-resolution  XTEM  images  of  the  boxes  labeled  A  (a)  through  C 
(c)  near  the  GaN/Si  interface  in  Fig.  5(a). 


Fig.  5.  (a)  XTEM  image  of  a  600-nm-thick  GaN  grown  on  a  nanofaceted  Si 

surface  shown  in  Fig.  2.  The  white  arrows  indicate  nanochannels  on  Si(001) 
stripe  facets,  (b)  Material  phase  map  of  the  cross  section  of  the  as-grown  sample 
shown  in  (a),  (c)  Schematic  illustration  of  cubic  phase  GaN  grown  on  a  single 
V- groove  (left)  and  hexagonal  GaN  grown  on  a  single  Si(l  1 1)  type  facet  (right). 
The  width  of  the  V- groove  ~  1  fim  is  assumed. 


(a) 


Groove  direction 


Fig.  6.  (a)  AFM  image  of  the  as-grown  600-nm-thick  GaN  surface,  (b)  45  ° 

tilted  SEM  image  of  the  same  sample. 


Fig.  7  shows  high  resolution  XTEM  images  taken  from 
the  cross  section  along  the  GaN-Si  interface  in  Fig.  5(a). 
Fig.  7(a)-(c)  correspond  to  the  white  boxes  of  Fig.  5(a)  la¬ 
beled  A  to  C.  As  seen  in  Fig.  7(a),  the  GaN  layer  exhibits  an 
amorphous-like  phase  at  the  beginning  of  deposition.  After  a 
few  nanometers  of  growth,  this  amorphous  phase  evolves  to  a 
hexagonal  phase  crystalline  structure,  but  with  a  large  number 
of  stacking  faults.  Fig.  7(b)  and  (c)  show  high  resolution  XTEM 
images  taken  at  the  vertexes  of  a  nanochannel  on  a  Si(001) 
stripe  facet.  The  nanochannel  in  Fig.  7(b)  is  large  enough  and 
has  a  sufficiently  clear  shape  to  identify  the  orientation  of  the 
sidewall  facets.  The  angle  between  the  sidewall  of  this  channel 
which  is  parallel  to  Si{  111}  and  the  Si(001)  at  the  bottom  is 


Fig.  8.  High-resolution  XTEM  images  of  the  areas  indicated  by  the  boxes 
labeled  (a)  D  and  (b)  E  near  the  GaN  surface,  and  (c)  F  and  (d)  G  near  the 
crossing  points  of  two  hexagonal  phase  GaN  regions  in  Fig.  5(a). 

about  55  °.  The  sidewall  orientations  of  these  channels  are  thus 
approximately  GaN  {0001}.  The  same  identification  can  be 
applied  to  the  channels  shown  in  Fig.  3(b).  As  in  Fig.  4(a),  the 
two  c-axes  of  the  GaN  over  the  nanochannel  are  misaligned 
with  an  angle  of  ~  1 10  °  in  Fig.  7(b)  and  (c). 

Fig.  8(a)  and  (b)  shows  high-resolution  XTEM  images  taken 
near  the  GaN  top  surface  regions  (the  boxes  labeled  D  and  E) 
in  Fig.  5(a),  respectively.  Also,  Fig.  8(c)  and  (d)  is  a  high-res¬ 
olution  XTEM  image  taken  near  the  crossover  regions  of  two 
hexagonal  phase  GaN  epilayers  which  are  indicated  by  boxes  F 
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Fig.  9.  SAED  patterns  taken  from  the  regions  indicated  by  the  circles  in 
Fig.  5(a).  The  regions  labeled  with  numeric  in  Fig.  5(a)  (1-4)  correspond  to 
(a)-(d).  The  circles  enclosing  each  letter  in  Fig.  5(b)  represent  the  aperture  size 
for  SAED. 


and  G  in  Fig.  5(a),  respectively.  The  GaN  near  box  D  has  hexag¬ 
onal  phase  but  that  around  box  E  has  cubic  phase.  On  the  other 
hand,  the  boxes  F  and  G  have  cubic  phase  over  and  under  the 
crossing  GaN  hexagonal  phases,  respectively. 

From  the  XTEM  on  the  whole  cross  section  of  the  600-nm- 
thick  GaN  on  the  nanoscale  V-grooved  Si  surface  of  Fig.  5(a), 
the  phase  map  shown  in  Fig.  5(b)  was  obtained.  As  indicated  by 
the  dashed  lines  in  Fig.  5,  the  hexagonal  phase  GaN  begins  to 
grow  from  the  V-grooved  Si  surface  and  extends  up  to  the  sur¬ 
face  of  the  epilayer.  The  cubic  phase  GaN  appears  almost  at  the 
same  point  in  each  groove.  It  was  consistently  formed  above  the 
defected  hexagonal  phase  regions  and  was  also  generated  above 
the  crossing  points  of  hexagonal  phase  GaN.  Fig.  5(b)  reveals 
that  phase  separations  similar  to  those  observed  in  Figs.  1  and  4 
periodically  occur  across  the  groove  direction.  That  is,  the  phase 
of  the  GaN  is  modulated  across  the  groove  direction  with  the 
period  same  as  that  of  the  V-grooves.  In  Fig.  5,  both  cubic  and 
hexagonal  phase  regions  have  boundaries  parallel  to  Si{  111}, 
extending  to  the  surface  without  noticeable  phase  intermixing. 

Selected  area  electron  diffraction  (SAED)  confirms  the  re¬ 
sults  of  high  resolution  XTEM  of  Fig.  8.  In  Fig.  9,  SAED  pat¬ 
terns  obtained  from  regions  1  through  4  which  are  indicated  by 
solid  circles  in  Fig.  5(a),  clearly  show  the  crystal  structure  varia¬ 
tion.  As  seen  in  Fig.  9(a),  region  1  is  hexagonal  phase.  Region  2 
of  Fig.  9(b)  is  also  hexagonal  but  its  c-axis  is  1 10  °  off  from  that 
of  region  1.  On  the  other  hand,  region  3  of  Fig.  9(c)  has  a  clear 
cubic  phase.  An  SAED  pattern  of  Fig.  9(d)  corresponding  to  re¬ 
gion  4  which  is  taken  at  the  GaN-Si  interface  near  the  apex  of 
a  V-groove  includes  diffraction  spots  from  the  Si  substrate  to¬ 
gether  with  those  from  a  crossing  hexagonal  phase  region.  The 
circles  of  Fig.  5(a)  correspond  to  the  aperture  size  for  SAED. 


Energy  (eV) 


Fig.  10.  Room  temperature  PL  of  the  600-nm-thick  GaN  sample  shown  in 
Fig.  5(a). 


530  570  610  650  6S0  730  770 
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Fig.  11.  Raman  scattering  spectra  of  the  600-nm-thick  GaN  sample  shown  in 
Fig.  5(a)  with  both  incident  and  scattered  polarizations  parallel  to  (a)  [110]  (x' , 
the  groove  direction),  (b)  [100]  (x),  and  (c)  [1 10]  (y').  The  dashed  line  indicates 
a  weak,  broad  peak  near  550  cm-1  in  each  spectrum. 


In  Fig.  9(a)  and  (b),  because  of  the  finite  aperture  size,  an  elec¬ 
tron  beam  partly  covers  the  adjacent  cubic  as  well  as  hexagonal 
phase  areas  and  results  in  an  overlapped  SAED  pattern  from 
these  two  different  phase  regions. 

Fig.  10  shows  a  room- temperature  PF  spectrum  obtained 
from  the  600-nm-thick  sample.  The  PF  exhibits  two  peaks  of 
comparable  intensity  at  3.389  and  3.200  eV,  which  are  close 
to  the  reported  PF  peak  energies  of  band-edge  transitions  in 
hexagonal  and  cubic  phase  GaN,  respectively  [16]— [19].  The 
line  width  of  each  PF  peak  is  about  100  meV,  comparable  to 
other  reports  [13]— [16] 

Fig.  11  shows  room-temperature  Raman  spectra  from  the 
600-nm-thick  GaN  sample  taken  in  a  backscattering  geometry 
with  the  incident  laser  beam  normal  to  Si(001)  [or  GaN(001)]. 
As  indicated  in  the  figure,  three  different  polarizations  were 
investigated:  electric  field  parallel  to  [110]  (the  groove  direc¬ 
tion)  [(a)],  [100]  [(b)],  and  [110]  [(c)].  The  polarizations  of 
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Fig.  12.  XRD  pattern  (0-20  scan)  near  the  GaN(002)  direction.  The  inset 
shows  a  wide-angle  scan  of  the  600-nm-thick  GaN  sample  shown  in  Fig.  5(a). 

both  the  incident  laser  and  the  scattered  beam  were  parallel  in 
each  case.  The  sample  was  rotated  with  respect  to  the  Si[001], 
the  backscattering  direction,  to  vary  the  geometry.  In  Fig.  11, 
the  Raman  peak  intensities  are  not  strongly  dependent  on 
polarization  direction.  There  are  three  Raman  peaks  in  all  three 
spectra  of  Fig.  11  at  ~  550  cm-1,  565.8  ±0.5  cm-1  and  735.3 
±1.1  cm-1.  The  most  probable  phonon  mode  close  to  550 
cm-1  is  a  transverse  optical  (TO)  phonon  mode  of  cubic  phase 
GaN  while  that  close  to  565.8  cm-1  is  a  high-frequency  E 2 
peak  of  hexagonal  phase  GaN  [20]— [27].  The  peak  at  735.3 
cm-1  will  be  discussed  in  the  next  section.  The  remaining 
peaks  indicated  by  arrows  including  a  peak  at  520.3±0.1  cm-1 
(not  shown  in  Fig.  1 1  but  providing  background  intensity  near 
the  E2  peak)  are  from  the  Si  substrate  [25]— [27]. 

Fig.  12  is  an  XRD  pattern  ( 0-20  scan)  from  the  600-nm- 
thick  sample  showing  a  GaN(002)  peak  at  20  =  40.18°  [25], 
[28]— [30].  The  inset  is  a  wide-angle  scan  spectrum  which  re¬ 
veals  the  Si(004)  peak  together  with  GaN(002).  The  vertical  lat¬ 
tice  constant  of  the  cubic  phase  GaN  of  Fig.  5(a)  estimated  from 
the  angular  position  of  the  GaN(002)  peak  is  4.485  A,  which  is 
approximately  0.015-0.017  A  smaller  than  the  experimentally 
measured  relaxed  vertical  lattice  constant  of  cubic  phase  GaN 
on  GaAs(001)  [30]— [32].  No  peaks  from  hexagonal  phase  GaN 
such  as  a  GaN(0002)  peak  were  observed  since  the  (0001)  is 
tilted  from  Si(001)  by  54.7  °. 

IV.  Discussion 

A.  Phase  Transition  of  GaN  on  a  Nanofaceted  Si  Surface 

Fig.  4(b)  shows  the  initiation  point  of  the  cubic  phase  GaN 
epilayer  as  it  emerges  from  the  two  opposing  hexagonal  phases 
accompanied  by  a  dense  array  of  stacking  faults.  The  large  lat¬ 
tice  mismatch  between  GaN  and  Si  leads  to  the  formation  of 
these  stacking  faults  near  the  GaN-Si  interface.  Fig.  9(d)  is  a 
SAED  pattern  taken  at  the  region  labeled  4  in  Fig.  5(a)  which 
partly  includes  the  Si  substrate  as  a  reference.  In  Fig.  9(d),  the 
diffraction  spots  from  the  hexagonal  phase  GaN  are  observed 
together  with  those  from  the  Si  substrate.  The  comparison  of 
Fig.  9(c)  with  (d)  reveals  that  the  cubic  phase  GaN  has  a  lattice 


constant  smaller  than  the  Si  substrate,  as  expected,  and  that  the 
directions  of  the  principal  crystal  axes  of  the  cubic  phase  GaN 
are  identical  to  those  of  the  Si  substrate. 

The  phase  map  of  Fig.  5(b)  shows  that  the  boundaries  be¬ 
tween  hexagonal  and  cubic  phases  are  roughly  parallel  to  the 
Si{lll}sidewalls.  The  phase  boundary  observed  in  Figs.  3(b) 
and  5(a)  is  similar  to  the  results  obtained  from  the  GaN  depo¬ 
sition  on  a  flat-bottom  groove  under  similar  growth  conditions 
(Fig.  1).  The  lower  part  of  the  cubic  phase  GaN  region  in  a  single 
V-groove  of  Fig.  5(b)  corresponds  to  the  area  above  the  bold 
dashed  lines  in  Fig.  1  [11].  The  results  of  Fig.  5,  therefore,  sug¬ 
gest  that  nanofaceting  with  V-grooves  on  a  Si(001)  substrate  is 
sufficient  to  cause  the  transition  from  hexagonal  to  cubic  phase 
GaN. 

Not  only  before  the  appearance  of  the  cubic  phase,  but  also 
at  the  very  initial  stage  of  growth,  the  crystallinity  of  the  GaN  is 
severely  degraded  by  a  large  number  of  stacking  faults  similar  to 
the  highly  defected  region  of  Fig.  1.  Near  the  Si  V-groove  sur¬ 
face,  as  seen  in  Fig.  7(a),  the  crystalline  phase  of  GaN  embedded 
between  the  cubic  phase  GaN  region  and  the  Si  V-groove  sur¬ 
face  is  not  clearly  identified  in  XTEM  because  of  the  high-den- 
sity  of  stacking  faults.  The  generation  of  a  stacking  fault  in 
the  hexagonal  phase  region  corresponds  to  the  formation  of  the 
cubic  phase  GaN  (and  vice  versa)  and  can  be  regarded  as  a  phase 
fluctuation  or  an  aperiodic  variation  between  the  two  phases. 
Following  this  growth  region  further  from  the  V-groove,  to  the 
area  shown  in  high  resolution  in  Fig.  8(a)  and  SAED  in  Fig.  9(a) 
and  (b),  however,  it  is  clear  that  this  region  evolves  into  a  hexag¬ 
onal  crystal  phase.  Additionally,  the  SAED  pattern  shown  in 
Fig.  9(d)  confirms  the  existence  of  a  hexagonal  phase  at  re¬ 
gion  4  that  includes  the  GaN-Si  interface  near  the  apex  of  the 
V-groove.  Based  on  these  observations,  the  defected  region  near 
the  Si  V-grooves  was  identified  as  hexagonal  phase  in  Fig.  5(b). 

As  indicated  by  the  arrows,  a  V-shaped  hexagonal  phase  re¬ 
gion  which  extends  from  one  V-groove  is  blocked  by  another 
one  developed  from  the  adjacent  V-groove.  This  blocking  mech¬ 
anism  needs  further  investigation  but  could  be  related  with  the 
uniformity  of  Si  V-grooves  which  affects  the  filling  speed  and  as 
a  result  the  initiation  point  of  the  phase  transition  in  individual 
V-grooves. 

The  cubic  phase  regions  observed  in  Figs.  3(b)  and  5(a)  [or 
Figs.  4(b)  and  8(b)-(d)]  are  almost  free  from  stacking  faults  and 
have  significantly  lower  defect  density  than  the  hexagonal  re¬ 
gions.  Also,  as  seen  in  Fig.  8(a),  the  number  of  defects  in  the 
hexagonal  regions  near  the  surface  is  considerably  reduced  com¬ 
pared  with  the  region  near  the  GaN-Si  interface  in  the  600-nm- 
thick  GaN  film.  This  implies  that  the  overall  crystallinity  is  grad¬ 
ually  improved  as  growth  proceeds.  Together  with  surface  pla¬ 
narization  discussed  in  Section  IV  C,  the  crystallinity  improve¬ 
ment  of  GaN  on  a  nanofaceted  Si(001)  is  very  important  for 
device  applications. 

Energetically,  the  misaligned  hexagonal  phase  GaNs  of  Fig.  5 
are  unstable.  The  orientation-dependent  nucleation  and  crystal 
structure  of  GaN  observed  in  Figs.  3  and  5  indicate  that  GaN 
is  more  stable  with  hexagonal  rather  than  cubic  phase  at  the 
initial  stage  of  growth  under  our  deposition  conditions.  This 
is  supported  by  a  theoretical  model  for  the  calculation  of  total 
crystal  energy  [8].  According  to  this  model,  the  ground  state  of 
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GaN  is  hexagonal.  Also,  the  energy  difference  between  cubic 
and  hexagonal  phases  is  9.88  meV/atom  for  GaN.  This  is  rel¬ 
atively  small  compared  with  18.41  meV/atom  for  AIN  which 
also  has  a  stable  hexagonal  phase  in  typical  epitaxial  growths. 
In  Figs.  5(a)  and  (b),  the  initiation  of  the  cubic  phase  GaN  within 
each  a  Si  V-groove  is  located  at  almost  the  same  height  from  the 
vertex.  This  implies  that  there  could  be  a  certain  critical  depo¬ 
sition  amount  above  which  the  total  crystal  energy  of  GaN  on 
a  nanofaceted  Si  surface  is  reduced  or  minimized  by  changing 
phases  rather  than  by  retaining  unstable  misaligned  hexagonal 
phases.  The  relatively  small  energy  difference  between  the  two 
phases  of  GaN  which  could  explain  more  probable  polytypism 
of  GaN  than  of  AIN  might  be  one  of  the  physical  origins  driving 
the  phase  transition  to  lower  the  total  crystal  energy. 

In  the  direction  normal  to  Si(lll)  [or  GaN(OOOl)  or 
GaN(lll)]  in  Fig.  5(a),  the  lattice  mismatch  of  the  (111)  plane 
of  the  cubic  phase  GaN  with  respect  to  the  (0001)  plane  of  the 
hexagonal  phase  GaN  is  —0.0025,  which  is  much  smaller  than 
the  lattice  mismatch  of  —0. 17  between  Si(l  1 1)  and  GaN(0001). 
Although  the  boundary  between  hexagonal  and  cubic  phase 
GaN  which  corresponds  to  a  high-energy  plane  defect  is  not 
clearly  defined,  the  small  lattice  mismatch  implies  an  elas¬ 
tically  low-cost  phase  transition  since  it  accompanies  quite 
small  variation  of  strain  energy,  and  thus  indirectly  supports  the 
transition  from  hexagonal  to  cubic  phase  with  minimization  of 
the  total  crystal  energy. 

B.  Lateral  Overgrowth — Formation  of  Nanochannels 

In  previous  work,  we  reported  the  selectivity  of  GaN  nu- 
cleation  for  Si{  1 1 1 }  [11],  [12].  The  orientation  selectivity  of 
nucleation  results  in  lateral  growth  of  GaN  from  the  Si{  1 1 1 } 
sidewalls  inside  of  each  V-groove  toward  the  adjacent  Si(001) 
facets.  The  arrows  in  Fig.  7(b)  indicate  the  directions  of  lat¬ 
eral  growth  conjectured  from  the  TEM  images  of  Figs.  4(a), 
7(b)-(c).  They  are  perpendicular  to  GaN(0001)  for  our  sample 
structure.  Thus  the  sidewall  orientation  of  the  nanochannel  on 
the  top  Si(001)  is  determined  by  the  lateral  growth  along  the  ad¬ 
ditional  [0001] .  Lateral  overgrowth  of  GaN  over  Si02  patterning 
has  been  investigated  by  several  groups  [33]-[35].  In  contrast  to 
the  prior  reports,  the  lateral  growth  observed  in  this  work  is  en¬ 
tirely  caused  by  orientation-dependent  nucleation,  as  there  is  no 
Si02  masking  layer  present  during  the  growth. 

Continued  deposition  with  lateral  growth  parallel  to  the 
[0001]  over  the  adjacent  Si(001)  stripe  facet  leads  to  coales¬ 
cence  between  the  two  GaN  epilayers  separately  grown  on  the 
Si{  111}  sidewalls  of  the  neighboring  V-grooves,  resulting  in 
the  sub- 100-nm- wide  nanochannel  on  the  Si(001)  stripe  facet 
between  V-grooves  which  has  been  observed  in  Figs.  3(b), 
5(a)  and  7(b).  Since  the  growth  inside  of  the  nanochannel  is 
stopped  from  the  moment  of  coalescence  over  a  Si(001)  stripe 
facet  and  the  lateral  size  of  the  channel  can  be  varied  with  the 
pattern  period  and  the  width  of  the  Si(001)  stripe  facet,  the 
lateral  overgrowth  on  a  V-grooved  surface  can  be  utilized  for 
the  fabrication  of  an  array  of  nanochannels. 

C.  Planarization 

From  Fig.  3,  the  filling  of  the  Si  V-grooves  by  the  GaN  depo¬ 
sition  proceeds  rapidly.  For  the  75-nm  deposition,  such  filling 
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can  occur  since  the  depth  of  the  Si  V-grooves  is  180-200  nm 
which  is  roughly  twice  the  deposition  thickness.  As  mentioned 
previously,  the  selectivity  of  GaN  nucleation  for  Si{  1 1 1 }  results 
in  negligible  deposition  on  the  Si(001)  stripe  facet  located  be¬ 
tween  the  V-grooves.  Thus,  rapid  filling  could  be  also  partially 
attributed  to  selective  nucleation  on  the  Si{  111}  sidewalls  in 
each  V-groove. 

Continued  deposition  planarizes  the  GaN  surface,  leading  to 
the  smooth  and  flat  surface  shown  in  Figs.  5(a)  and  6.  The  short 
and  shallow  trenches  on  the  GaN  surface  over  a  Si(001)  stripe 
facet  along  the  groove  direction  in  Fig.  3(a)  were  due  to  in¬ 
complete  coalescence  for  the  relatively  thin  75-nm  deposition 
and  were  simply  filled  in  for  the  600-nm  deposition  shown  in 
Fig.  6(b).  The  direction  and  spatial  confinement  of  the  cracks  in 
Fig.  3(a)  suggest  that  their  generation  may  be  related  to  stress 
relief.  Since  the  GaN  surface  of  Fig.  3  already  consists  of  cubic 
and  hexagonal  phases  for  75-nm  deposition,  the  inner  part  of 
the  cracks  is  in  a  cubic  phase  region  and  is  parallel  to  Si[l  10] 
which  is  one  of  the  cleavage  directions.  The  confinement  of  the 
cracks  to  a  single  period  of  the  grooves  in  Fig.  3(a)  could  be 
due  to  incomplete  coalescence.  In  the  coalesced  regions,  how¬ 
ever,  its  propagation  may  be  blocked  by  the  hexagonal  phase 
GaNs  which  are  located  at  the  edge  sides  of  each  groove  and 
thus  embedded  between  cubic  phase  regions  in  the  top  surface. 
As  mentioned  above,  such  nano-cracks  are  filled  by  continued 
growth. 

The  surface  planarization  observed  in  Fig.  5(a)  will  be  very 
important  for  the  subsequent  growth  of  a  device  structure  on  top 
of  the  GaN  grown  on  a  nanofaceted  Si  surface.  Unlike  the  GaN 
shown  in  Fig.  1,  the  planarized  GaN  surface  of  Fig.  5  does  not 
include  GaN(llOl).  The  absence  of  GaN(llOl)  facets  in  the 
thick  GaN  layer  is  due  to  the  coalescence  of  GaN  over  a  Si(001) 
stripe  facet.  In  our  sample  structure,  GaN(3304)  is  the  closest 
orientation  which  is  parallel  to  GaN(001).  Since  a  (0001)  plane 
of  hexagonal  phase  GaN  is  parallel  to  a  Si(l  1 1)  sidewall  of  the 
V-groove,  there  is  an  angle  of  54.7  °  between  GaN(001)  and 
GaN(0001).  Then,  the  angle  between  GaN(3304)  and  GaN(001) 
is  only  0.12°.  Thus,  the  crystal  structure  of  the  resulting  GaN 
surface  of  Fig.  5(a)  is  modulated  by  cubic  [(001)]  and  hexagonal 
[(3304)]  phases  with  the  period  of  355  nm  perpendicular  to  the 
groove  direction.  As  seen  in  the  PL  of  Fig.  9,  this  orientational 
modulation  also  implies  a  nanoscale  bandgap  modulation  at  the 
top  surface  due  to  the  electronic  differences  between  hexagonal 
and  cubic  phase  GaN. 

D.  Stress  Measurements 

The  PL  spectrum  of  Fig.  10  supports  the  TEM  result  of  the 
coexistence  of  two  different  phases  of  GaN  in  a  single  layer. 
Together  with  the  Raman  scattering  results  shown  in  Fig.  11 
and  the  XRD  studies  of  Fig.  12,  PL  provides  information  on  the 
strain  effects  for  the  phase-modulated  GaN-Si  heterostructure. 
The  PL  peak  energy  at  3.389  eV  is  very  close  to  bandgap  of 
hexagonal  phase  GaN  but  is  approximately  40  and  15  meV 
lower  than  the  reported  room-temperature  band  edge  transi¬ 
tions  of  homoepitaxially  grown  hexagonal  phase  GaN  and  GaN 
grown  on  wide-area  Si(lll),  respectively  [16],  [17].  For  the 
comparison  of  our  experimental  results  with  other  reported 
data,  several  factors  such  as  substrate  materials,  buffer  layers, 
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and  the  thickness  of  the  GaN  epilayer  which  affect  PL  peak 
energy  by  strain  effect  must  be  considered. 

In  the  Raman  spectra  of  Fig.  1 1,  the  position  of  the  high-fre¬ 
quency  E2  peak  from  hexagonal  phase  GaN  is  at  565.8  cm-1 
which  shows  approximately  a  negative  frequency  shift  of  2  - 
3  cm-1  from  that  measured  in  bulk  GaN,  including  an  instru¬ 
mental  error  of  ±0.5  cm-1  [36],  [37].  Several  results  for  hexag¬ 
onal  phase  GaN  on  Si(l  1 1)  also  have  reported  a  2-3  cm-1  neg¬ 
ative  shift  of  the  E2  peak,  similar  to  our  results  [25]-[27].  As¬ 
suming  a  linear  strain-Raman  shift  relationship 

A  uj  =  Ka  ||  (1) 

where  A  uj,  K ,  and  cry  are  the  Raman  frequency 
shift  for  a  given  phonon  mode  in  cm-1,  the  linear 
stress  coefficient  (=  6.2  cm-1 /GPa)  and  the  in-plane  biaxial 
stress  in  GPa,  respectively  [38],  the  negative  frequency  shift  of 
the  E2  peak  implies  a  tensile  stress  of  about  0.35-0.5  GPa  in 
the  hexagonal  phase  GaN,  as  expected  from  the  difference  of 
lattice  constant  between  GaN  and  Si. 

This  finite  biaxial  tensile  stress  of  the  hexagonal  phase  GaN 
extracted  from  the  Raman  scattering  suggests  that  the  red  shift 
of  the  PL  peak  energy  at  3.389  eV  may  be  partly  due  to  strain 
effects.  From  the  experimental  linear  approximation  between 
PL  peak  energy  of  hexagonal  phase  GaN  E pl  versus  biaxial 
stress  measured  as 

«  20  meV/GPa  (2) 

da\\ 

the  red  shift  of  the  PL  peak  energy  at  0.35-0.5  GPa  is  estimated 
as  7-10  meV  [17],  [39].  The  difference  in  PL  peak  energy  of 
Fig.  9  (~40  meV)  from  that  of  homoepitaxially  grown  GaN  and 
its  comparison  with  the  reported  PL  peak  energy  of  hexagonal 
phase  GaN  on  wide-area  Si(lll)  (~15meV  )  implies  that  the 
PL  peak  at  3.389  eV  might  be  due  to  the  transition  of  excitons 
bound  to  some  defects  or  acceptor  impurities  such  as  carbon 
which  is  a  typical  impurity  in  epitaxial  growth.  In  considera¬ 
tion  of  the  nanofaceting  fabrication  process,  residual  contami¬ 
nation  by  carbon  and  other  impurities  with  a  level  higher  than 
for  growth  of  GaN  on  a  unprocessed  substrate  is  not  unexpected. 
Also,  the  hexagonal  phase  GaN  near  the  Si  V-grooves  is  de¬ 
graded  by  a  large  number  of  stacking  faults.  Thus,  the  PL  peak 
shift  can  be  qualitatively  understood  as  related  to  the  presence 
of  crystalline  defects  and  impurities. 

In  case  of  cubic  phase  GaN,  homoepitaxy  is  not  available 
and  cubic  phase  GaN  on  GaAs(001)  is  the  closest  to  that  ob¬ 
served  in  Fig.  5(a).  In  contrast  to  the  experimental  results  for  the 
higher  energy  PL  peak,  the  peak  at  3.200  eV  in  Fig.  10  is  about 
10-20  meV  lower  than  the  room-temperature  PL  peak  energy 
of  cubic  phase  GaN  on  GaAs(001)  [18],  [19].  In  Raman  scat¬ 
tering,  cubic  phase  GaN  on  GaAs(001)  exhibits  peaks  from  TO 
and  LO  phonon  modes  which  appear  near  555  and  740  cm-1 ,  re¬ 
spectively  [20]-[23].  The  TO  phonon  mode  of  cubic  phase  GaN 
is  forbidden  under  the  given  backscattering  but  may  appear  by 
multiple  scattering  in  our  sample.  As  mentioned  in  the  previous 
section,  the  weak  and  broad  peak  around  550  cm-1  is  very  close 
to  the  TO  phonon  mode  of  cubic  phase  GaN  on  GaAs(001).  This 


is  a  —5  cm-1  shift  and  implies  the  same  shift  for  the  longitu¬ 
dinal  optical  (LO)  phonon  mode  at  ~  735  cm-1  if  it  is  assumed 
that  both  TO  and  LO  phonon  modes  show  the  similar  frequency 
shift  under  a  given  stress  [40],  [41].  However,  the  peak  posi¬ 
tion  ~  735  cm-1  also  coincides  with  that  of  the  Ai-LO  phonon 
mode  of  hexagonal  phase  GaN  on  Si(l  1 1)  [26],  [27].  Thus,  there 
might  be  an  accidental  overlap  of  the  two  phonon  modes,  one 
from  the  cubic  phase  and  the  other  from  the  hexagonal  phase 
GaN  at  ~  735  cm-1  in  the  Raman  spectra  of  Fig.  10. 

From  (1),  the  negative  frequency  shift  of  ~  5  cm-1  roughly 
corresponds  to  a  tensile  stress  of  ~  1  GPa  for  cubic  phase  GaN 
on  GaAs(001).  For  a  more  accurate  comparison,  the  bulk  elastic 
properties  of  cubic  GaN  must  be  considered.  Nonetheless,  it  is 
evident  that  the  negative  frequency  shift  implies  tensile  stress  of 
cubic-phase  GaN  on  a  nanofaceted  Si  surface  which  is  greater 
than  that  on  GaAs(001).  The  vertical  lattice  constant  calculated 
from  the  XRD  pattern  of  Fig.  12  and  its  comparison  with  that  of 
cubic  phase  GaN  on  GaAs(001)  also  supports  the  presence  of  a 
tensile  stress  for  the  cubic  phase  GaN  of  Fig.  5(a)  higher  than 
that  on  GaAs(001). 

There  is  a  controversy  about  a  type  of  stress  in  cubic  phase 
GaN  on  GaAs(001)  [30].  Thus,  it  should  be  emphasized  that 
the  stress  mentioned  above  is  a  result  of  relative  comparison. 
The  tensile  stress  of  ~  1  GPa  can  explain  the  red  shift  of  about 
10-20  meV  in  the  PL  peak  at  3.200  eV  of  Fig.  10  from  that  of 
cubic  phase  GaN  on  GaAs(001)  using  (2)  as  a  rough  approxima¬ 
tion.  There  is  about  a  100-nm-thick  hexagonal  phase  GaN  layer 
that  forms  a  buffer  layer  between  the  Si  substrate  and  the  cubic 
phase  GaN  as  seen  in  Fig.  5(a).  Thus,  the  contamination  and  Si 
out-diffusion  issues  are  less  severe  in  the  cubic  phase  than  the 
hexagonal  phase  GaN.  Also,  the  cubic  phase  GaN  has  a  rela¬ 
tively  low  density  of  crystal  defects  in  TEM  compared  with  the 
hexagonal  phase  that  is  defected  with  stacking  faults.  Then,  the 
PL  peak  at  3.200  eV  in  Fig.  10  is  very  close  to  the  transition 
energy  of  donor  bound  excitons  of  cubic  phase  GaN. 

Understanding  of  strain  effect  of  a  cubic  phase  formed  on  a 
nanofaceted  Si  surface  is  very  important  for  its  device  applica¬ 
tions.  According  to  Figs.  1 1  and  12,  the  cubic  phase  GaN  formed 
on  Si  nanoscale  V-grooves  may  be  in  higher  tensile  stress  than 
that  on  wide-area  GaAs(001).  This,  however,  contradicts  the 
simplistic  prediction  based  on  lattice  mismatch  (e.g.,  GaAs  has 
a  larger  mismatch  with  GaN  than  does  Si).  Although  the  cubic 
phase  GaN  of  Fig.  5(a)  has  the  same  crystal  orientation  as  the  Si 
substrate,  it  is  not  equivalent  to  cubic  phase  GaN  on  wide-area 
Si(001)  or  GaAs(001)  since  there  is  a  misaligned  hexagonal 
phase  GaN  between  them.  Also,  thermal  stress  on  GaN  due  to 
the  difference  of  thermal  expansion  coefficient  between  GaAs 
and  Si  must  be  considered  [42] .  As  previously  mentioned,  how¬ 
ever,  all  physical  parameters  necessary  for  the  analysis  of  cubic 
phase  GaN  are  not  available  yet.  Further  research  is  required  to 
elucidate  the  overall  stress  effects  for  the  phase-modulated  GaN 
shown  in  Fig.  5(a). 

E.  Application  to  Monolithic  Integration  on  Si(001 ) 

Fig.  5  is,  to  our  knowledge,  the  first  observation  of  periodic 
spatial  separation  of  cubic  phase  from  hexagonal  phase  GaN 
ever  grown  on  Si(001)  and  GaAs(001).  The  phase  map  of 
Fig.  5(b)  suggest  that  the  phase  modulation  or  separation  of 
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GaN  on  a  Si(001)  substrate  can  be  controlled  by  the  facet 
geometry  on  the  substrate  surface. 

Growth  of  a  large-area  single  phase  region  of  cubic  phase 
GaN  on  Si(001)  would  have  important  device  possibilities. 
While  the  two  different  phases  are  periodically  separated  at 
the  nanoscale  range,  as  seen  in  Fig.  5,  this  may  not  be  useful 
in  practical  applications  which  require  a  single  phase  GaN 
over  a  large  area  since  both  phases  are  still  mixed  together  on 
micron  and  larger  scales.  Variation  of  the  duty  cycle  and  pattern 
period  of  V-grooves  provides  a  technique  for  controlling  the 
phase  distribution.  It  has  been  confirmed  that  the  cubic  phase  is 
formed  in  each  groove  only  for  75 -nm  deposition  where  coales¬ 
cence  does  not  completely  proceed.  By  increasing  the  pattern 
period  to  micron  range  while  keeping  the  width  of  a  V-groove 
in  nanometer  range  or  by  fabricating  a  single  V-groove  on  a 
desired  area,  coalescence  over  the  Si(001)  stripe  facets  can  be 
delayed  or  avoided  while  phase  transition  is  activated  in  the 
V-groove  at  the  initial  stage  of  growth.  Then,  as  schematically 
shown  in  Fig.  5(c)  which  is  based  on  the  results  of  Fig.  3,  a  GaN 
stripe-type  epilayer  grown  along  a  single  V-groove  dominantly 
can  have  a  cubic  phase  top  surface  with  a  micron-order  width 
that  is  enough  for  the  fabrication  of  devices. 

As  seen  in  Fig.  5(c),  a  single  hexagonal  phase  GaN  epilayer 
is  also  available  on  a  limited  area  of  Si(001)  but  is  not  suit¬ 
able  for  device  application  because  its  top  surface  is  not  parallel 
to  Si(001).  For  compatibility  with  conventional  device  tech¬ 
nology,  crystal  growth  techniques  must  provide  planar  surface 
epilayers.  A  planar  top  surface  of  the  cubic  phase  GaN  par¬ 
allel  to  Si(001)  thus  highly  favorable  for  device  fabrication  on 
nanofaceted  Si  surface.  It  has  been  reported  that  cubic  phase 
AlGaN  (or  AIN)  and  InGaN  are  also  available  on  (001)  type  ori¬ 
entation  [43]— [45].  On  the  planar  surface  of  cubic  phase  GaN  on 
a  single  V-groove  shown  in  Fig.  5(c),  therefore,  it  is  possible  to 
grow  various  hetero structures  composed  of  GaN  and  its  related 
compounds  in  cubic  phase,  for  monolithic  integration  of  micro- 
and  optoelectronic  GaN-based  devices  on  Si(001). 

In  this  work,  we  have  investigated  spatial  phase  modulation 
of  GaN  on  a  nanofaceted  Si(100)  substrate  with  low-cost  but 
highly  reliable  conventional  process  techniques  such  as  large- 
area  i- line  IL,  AWE,  and  MOVPE.  Further  research  is  required 
to  understand  the  relation  between  facet  geometry/physical  di¬ 
mension  and  the  relative  size  of  the  cubic  phase  regions  and 
associated  strain  effects  of  GaN  on  a  nanofaceted  Si  surface. 
Also,  the  optical/electronic  properties  of  spatially  phase-mod¬ 
ulated  GaN  need  more  investigation  for  future  device  applica¬ 
tions. 

V.  Summary  and  Conclusions 

Spatial  phase  modulation  of  GaN  grown  on  a  nanofaceted  Si 
surface  has  been  investigated.  On  a  355-nm  period,  1-D  array 
of  V-grooves  fabricated  in  a  Si(001)  substrate,  selectivity  of 
the  nucleation  and  incorporation  of  hexagonal  phase  GaN  on 
Si(lll)  has  been  observed  in  MOVPE  growth.  Orientation-de¬ 
pendent  nucleation  results  in  lateral  growth  from  Si{  1 1 1 }  facets 
toward  neighboring  top  Si(001)  facets  with  the  formation  of 
nanochannels  between  them,  and  eventually  coalescence.  For 
the  75 -nm  deposition,  the  entire  GaN  film  is  almost  planarized 
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to  a  smooth  surface  but  with  a  complex,  spatially  separated 
phase  structure  resulting  from  the  periodically  misaligned 
c-axes  across  the  groove  direction.  This  structural  instability 
induces  a  phase  transition  from  hexagonal  to  cubic  phase  on 
the  nanofaceted  Si  surface,  which  results  in  spatial  phase  sepa¬ 
ration  of  GaN.  Based  on  these  results,  highly  predictable  phase 
modulation  of  GaN  has  been  demonstrated  on  a  V-grooved  Si 
surface  with  the  600-nm  deposition.  PL,  Raman  scattering,  and 
XRD  show  that  both  the  hexagonal  and  cubic  phase  GaN  are 
in  tensile  stress.  Our  experimental  results  therefore  suggest  that 
nanofaceting  can  be  used  for  the  formation  of  cubic  phase  GaN 
on  limited  areas  of  a  Si  (001)  substrate  through  phase  transition 
for  monolithic  integration  of  GaN  devices  on  Si(001). 
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Abstract — A  unified  analytical  treatment  of  the  radiation 
from  an  electric  dipole  of  arbitrary  orientation  embedded  at  an 
arbitrary  location  within  a  symmetrically  clad  dielectric  slab 
is  presented.  Both  the  emission  into  three-dimensional  (3-D) 
radiation  modes,  corresponding  to  emission  within  the  critical 
angle  escape  cone  within  the  dielectric  slab,  and  into  the  two-di¬ 
mensional  (2-D)  waveguide  modes  are  evaluated  from  a  single 
calculation.  The  model  is  valid  for  arbitrary  dielectric  contrast 
between  the  slab  and  the  cladding.  The  mathematical  approach 
uses  well-known  complex  analysis  techniques:  the  3-D  radiation  is 
described  by  a  steepest  descents  integration  around  branch  cuts 
while  the  2-D  waveguide  modes  correspond  to  simple  poles.  The 
division  of  the  radiated  power  between  the  3-D  and  2-D  modes  is 
evaluated  across  the  entire  range  from  small  dielectric  contrast 
appropriate  to  diode  lasers  (<1.1)  to  the  very  large  dielectric 
contrast  of  free-standing  semiconductor  slabs  (~  12-19).  Both 
enhancement  and  suppression,  depending  on  position,  slab  width, 
dielectric  contrast,  and  wavelength,  of  the  total  radiated  power 
in  comparison  with  that  in  an  unbounded  dielectric  medium 
are  found  for  slab  widths  on  the  order  of  a  wavelength  with  a 
maximum  enhancement  of  ~30%  for  these  one-dimensional 
Fabry-Perot  structures.  For  thicker  slabs,  the  total  radiation  is 
almost  constant  and  equal  to  that  in  the  unbounded  medium  for 
low  dielectric  contrast  while  still  exhibiting  some  modulation  as 
increasing  thickness  allows  additional  waveguide  modes. 

Index  Terms — Dielectric  waveguides,  dipole  radiation,  micro¬ 
cavity. 


I.  Introduction 

THE  DISTRIBUTION  of  radiation  from  a  dipole  near  a 
planar  interface  is  a  classic  antenna  problem  [1]— [3].  As 
is  very  well  known  [4],  [5],  the  addition  of  a  second  dielectric 
interface,  e.g.,  a  dipole  embedded  in  a  dielectric  slab,  adds  sig¬ 
nificant  complexity  to  the  problem,  largely  because  the  radia¬ 
tion  is  now  distributed  between  three-dimensional  (3-D)  radia¬ 
tion  modes  as  in  the  single  interface  case  and  two-dimensional 
(2-D)  waveguide  modes  supported  by  the  dielectric  slab. 

Despite  this  long  history,  the  problem  is  increasingly  impor¬ 
tant  today  with  developments  such  as  sophisticated  diode  laser 
structures — both  edge  emitters  and  vertical  microcavity  lasers 
[6],  enhanced  LEDs  [7],  and  2-D  photonic  bandgap  lasers  [8]. 
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Two  general  classes  of  treatment  have  appeared  in  the  literature. 
The  first  class  [9]— [12]  provides  an  analytic  treatment  while  the 
second  provides  a  complete  numerical  solution  of  Maxwell’s 
equations  in  an  arbitrary  geometry  using  finite  element  or  other 
numerical  techniques  [13],  [14].  The  analytic  results  have  relied 
largely  on  a  somewhat  ad  hoc  combination  of  results  from  3-D 
radiation  and  2-D  waveguide  mode  calculations.  Ho  et  al.  in  a 
very  complete  chapter  [15]  obtained  results  very  similar  to  those 
presented  here;  they  do  not  present  closed  form  analytic  results 
for  all  of  the  radiation  components  nor  allow  the  dipole  to  be 
arbitrarily  located  within  the  dielectric  slab.  The  numerical  cal¬ 
culations,  while  applicable  to  arbitrary  structures,  are  difficult  to 
generalize  to  structures  not  presented  in  detail  without  recoding 
the  problem  and  do  not  serve  to  build  up  physical  intuition  as 
quickly  as  is  possible  with  analytic  results. 

The  purpose  of  this  paper  is  to  present  simple,  closed-form, 
analytic  expressions  for  the  emission  into  both  3-D  continuum 
radiation  and  2-D  waveguide  modes  for  an  electric  dipole  of 
arbitrary  location  and  orientation  within  a  symmetrically  clad 
dielectric  slab.  These  expressions  are  derived  from  a  common 
calculation  based  on  mathematical  apparatus  extended  from 
the  seminal  work  of  Sommerfeld  [1].  In  this  formulation, 
the  radiation  from  a  dipole  is  expanded  into  plane  waves 
and  the  angularly-dependent  Fresnel  boundary  conditions 
for  reflection/refraction  at  a  dielectric  interface  are  applied 
to  each  component  plane  wave.  The  various  emission  modes 
appear  as  singularities  in  the  inverse  transform  in  the  complex 
plane:  the  3-D  modes  as  branch  cuts  that  are  evaluated  by  a 
steepest  descents  method;  the  2-D  modes  as  poles  that  are 
evaluated  by  the  calculus  of  residues.  The  advantage  of  these 
analytic  solutions  is  that  they  are  readily  evaluated  for  arbitrary 
dielectric  constants  ranging  from  the  very  large  dielectric 
discontinuities  of  microcavity  lasers  to  the  much  more  modest 
discontinuities  of,  e.g.,  double-heterostructure  edge-emitting 
lasers.  The  model  is  straightforwardly  extensible  to  more  com¬ 
plex  layered  structures  (e.g.,  asymmetric  slabs,  vertical-cavity 
lasers,  periodic  grating  structures),  although  this  extension  is 
not  pursued  in  this  initial  report.  Examples  of  the  results  for  a 
variety  of  dielectric  slabs  are  discussed  with  particular  attention 
to  the  distribution  of  radiated  energy  between  the  3-D  and  2-D 
modes  and  to  the  comparison  of  the  total  radiated  power  with 
that  of  a  dipole  in  an  unbounded  dielectric  material. 

For  a  dipole  in  a  dielectric  slab,  both  macroscopic  electro¬ 
magnetic  effects  arising  from  the  modified  density  of  states  of 
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the  electromagnetic  field  and  microscopic  Clausius-Mossotti 
local-field  effects  need  to  be  considered.  The  former  are  the  sub¬ 
ject  of  this  paper,  the  latter  are  consistently  normalized  out  of 
the  results  by  comparing  the  radiation  patterns  and  intensities 
for  the  slab  to  those  in  an  unbounded  medium  of  the  same  di¬ 
electric  constant. 

As  is  well  known,  for  very  large  dielectric  discontinuities  [for 
example,  a  Si  (ei/eo  =  r±  ~  18)  or  GaAs  (r±  ~  12)  slab  sur¬ 
rounded  by  air  (^2  ~  1)]  and  extremely  thin  slabs  (~A/tt^2) 
where  A  is  the  free-space  wavelength  and  ei  the  slab  dielectric 
constant)  the  total  radiated  energy  can  be  either  larger  or  smaller 
than  that  in  bulk  depending  on  the  precise  relationship  of  slab 
width  L,  A,  and  ei  and  the  location  of  the  dipole  within  the  slab. 
When  considered  in  a  cavity  quantum  electrodynamic  context, 
this  corresponds  to  cavity  enhancement  or  suppression  of  the 
spontaneous  radiative  lifetime  [5],  [9]— [13] .  We  show  that  the 
maximum  enhancement  for  a  simple  slab  geometry  saturates  as 
a  function  of  dielectric  contrast  at  about  15%  for  dipoles  ori¬ 
ented  perpendicular  and  30%  for  dipoles  oriented  parallel  to 
the  slab  with  the  optimal  transverse  location  being  the  center 
of  the  slab  for  slab  widths  of  order  A / .  Our  motivation  for 
attacking  this  problem  was  unexplained,  strongly  enhanced  op¬ 
tical-phonon  Raman  scattering  results  for  nanoscale  Si  gratings 
[16];  therefore,  many  of  the  high  dielectric  constant  numerical 
examples  are  given  for  parameters  appropriate  to  these  visible 
experiments.  The  application  of  this  theory  to  the  Raman  exper¬ 
iments  will  be  discussed  in  a  subsequent  publication. 


P  = 


fcjUtbl2 

327r277o^o^2 


sin2  9 


J  sRR2<m  = 


fcpRe(nj)b|2 

12-ivqoel 


(4) 


where 

Sr  radial  Poynting  vector; 

P  total  radiated  power  from  the  dipole; 

R  —  koR  dimensionless  distance  scaled  to  the  free-space 
wavelength. 

To  contain  the  complexity  of  the  equations,  only  the  radiation 
fields  (oc  1  /R)  are  presented.  All  near-field  effects  are,  nonethe¬ 
less,  included  in  the  analysis. 

It  is  useful  to  investigate  the  3-D  Fourier  transform  of  the 
impulse  response,  given  by 
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where  K  is  the  transform  coordinate  and  K  —  \K\.  This  ex¬ 
pression  can  be  put  into  a  more  suitable  form  for  evaluating  the 
boundary  conditions  by  carrying  out  the  inverse  transform  in  the 
z  direction  (perpendicular  to  the  plane  surfaces)  by  the  calculus 
of  residues  and  expressing  the  remaining  integral  of  the  trans¬ 
form  in  cylindrical  coordinates  [2] 


II.  Hertz  Vector  Formalism 

Following  Sommerfeld  [1],  we  formulate  the  problem  in 
terms  of  the  Hertz  vector  (II)  from  which  the  electric  and 
magnetic  fields  follow: 

E  —  V(V  •  n)  +  k^RiU.  (1) 

H  =  -  i—  Vx  n=  -^-X7xE  (2) 

Vo  rjoko 

where 

kl  =  uj2R0£o 

K>i  =  nf  =  £i/£0 

and 

Vo  =  \J  Mo/^o- 

The  impulse  response  for  the  Hertz  vector  in  an  unbounded 
medium  for  a  dipole  located  at  the  origin  and  oriented  along  the 
z  axis  is  given  by 


1  r  °°  k 

MR)  =  -  /  —  e-^J0(\p)\d\ 

Ki  Jo  li 

1  r°° 

=  5-  /  -e-^Hl(\p)\d\  (6) 

2  Ri  J —  00  Vi 

where  A  is  the  radial  transform  coordinate,  p  =  kop  and  z  —  k^z 
are  the  dimensionless  radial  and  vertical  spatial  coordinates,  and 

7 i  =  a/A2  -  Ki  — >  -ini  (7) 

A — ^0 

and  the  sign  of  the  square  root  is  chosen  to  ensure  Re  (7^)  >  0 
for  convergence  of  (6).  Jo  is  the  zeroth-order  Bessel  function 
and  iTo  is  the  Hankel  function  of  the  first  kind  of  zeroth  order. 
The  second  form  of  this  expression  will  be  useful  for  evaluation 
of  the  complex  integrations  since  the  integration  extends  across 
the  entire  A  axis. 

III.  Vertical  Dipole 
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where  p  is  the  dipole  moment.  Applying  (1)  and  (2)  gives  the 
familiar  dipole  radiation  fields 
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Initially,  the  analysis  will  be  restricted  to  a  vertical  dipole 
because  the  mathematics  is  slightly  simpler  since  a  one-com¬ 
ponent  Hertz  vector  suffices  whereas  a  two-component  Hertz 
vector  is  required  for  a  horizontal  dipole.  The  results  for  a  hor¬ 
izontal  dipole  will  be  presented  below.  An  arbitrary  orientation 
can  always  be  reduced  to  a  superposition  of  the  appropriate  ver¬ 
tical  and  horizontal  components.  The  geometry  for  a  vertical 
dipole  (relative  to  the  horizontal  slab)  is  shown  in  Fig.  1.  A 
dipole  is  located  at  a  vertical  distance  a  above  the  origin  within 
an  infinitely  long  slab  of  width  L  and  dielectric  constant  £]_. 


BRUECK:  RADIATION  FROM  A  DIPOLE  EMBEDDED  IN  A  DIELECTRIC  SLAB 


901 


II  fC2 


r  a; 


v  n 

A* 


L/2 

a 

*x  =  0 


-L/2 


IIIat2 


Fig.  1.  Geometry  of  the  problem:  a  dipole  oriented  either  vertically  or 
horizontally  is  located  at  a  height  a  from  the  center  of  a  slab  of  width  L  and 
relative  dielectric  constant  i<  L  clad  on  both  sides  with  semi-infinite  materials  of 
relative  dielectric  constant  k  2  . 


The  slab  is  adjacent  to  two  semi-infinite  half  planes  of  dielec¬ 
tric  constant  £2.  In  this  situation,  the  Hertz  vector  has  only  a 
z  component  [2],  [17] 

n*  =  </>% 


with  the  boundary  conditions 


—  K2  4>2z 
dz  dz  ' 


(8) 


In  the  semi-infinite  cladding  media  above  and  below  the  slab, 
the  trial  solutions  are  homogeneous  while  inside  the  slab  there 
are  both  inhomogeneous  and  homogenous  contributions  given 
by 
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Applying  the  boundary  conditions  (8)  at  the  top  and  bottom 
edges  of  the  slab  in  the  usual  way  results  in  (10)  as  shown  at  the 
bottom  of  the  page  where 


RP  - 


1  —  a 
1  -\-  ex 


with  a  — 


*772 

«27l* 


(ID 


These  expressions  have  very  simple  physical  interpretations.  Rp 
is  the  magnetic  field  reflectivity  for  a  TM-polarized  plane  wave 


incident  from  region  I  onto  region  II.  The  prefactor  of  (1  —  Rp) 
accounts  for  the  transmission  into  region  II.  The  two  terms  in  the 
numerator  arise  from  1)  emission  toward  the  top  interface  and 
2)  emission  toward  the  bottom  interface  that  is  reflected  into  the 
upward  direction  at  the  bottom  interface.  The  denominator  ac¬ 
counts  for  multiple  reflections  in  this  planar  cavity.  It  is  straight¬ 
forward  to  show  that  the  poles  of  the  integrand  (zeros  of  the  de¬ 
nominator)  correspond  to  the  (even  and  odd)  bound  TM-wave- 
guide  modes.  The  coupling  to  these  modes  will  be  evaluated 
by  the  residues  at  the  poles  when  the  integration  is  carried  out. 
There  are  branch  points  in  the  A  plane  as  a  result  of  the  multi¬ 
valued  7*’s.  The  integration  around  the  branch  cuts  corresponds 
to  the  3-D  radiation  modes.  As  noted  above,  a  significant  virtue 
of  this  formulation  is  that  both  the  2-D  waveguide  modes  and 
the  3-D  radiation  fields  are  obtained  consistently  from  a  single 
calculation.  Before  considering  the  evaluation  of  the  integrals  in 
(10),  the  equivalent  results  for  a  horizontal  dipole  will  be  pre¬ 
sented. 


IV.  Horizontal  Dipole 


A  two-component  Hertz  vector  is  required  [1],  [2]  to  describe 
the  fields  of  a  horizontal  dipole: 


IF  =cPl.ex  +  <-Pl£ 


with  the  boundary  conditions 
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_  dLl 
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(12) 


The  x  and  z  components  are  coupled  by  the  last  boundary  con¬ 
dition,  so  the  approach  is  to  solve  first  for  the  x  components 
using  the  first  two  boundary  conditions  and  then  use  that  solu¬ 
tion  to  evaluate  the  z  components.  The  inhomogeneous  driving 
term  appears  only  in  the  x  component  inside  the  slab.  The  result 
for  the  x  components  is  shown  in  (13)  at  the  bottom  of  the  next 
page  where 
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These  expressions  are  very  similar  to  the  vertical  dipole  case. 
Rp  is  replaced  with  Rs  throughout  where  Rs  is  the  electric  field 
reflectivity  for  a  TE-polarized  plane  wave  incident  on  the  dielec¬ 
tric  interface  from  medium  I.  £2  now  appears  in  the  expressions 
for  the  Hertz  vector  outside  of  the  slab  rather  than  gq  as  was  the 
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case  for  the  vertical  dipole.  The  poles  in  these  integrands  now 
correspond  to  the  TE  waveguide  modes.  These  results  can  now 
be  used  to  evaluate  the  z  components  of  the  Hertz  vectors  as 
shown  in  (15)  at  the  bottom  of  the  page. 

These  expressions  now  involve  both  the  TE  and  TM  poles. 
Fortunately,  as  is  shown  below,  much  of  the  algebraic  com¬ 
plexity  of  these  expressions  is  simplified  when  the  residues  are 
evaluated. 

V.  Evaluation  of  the  Inverse  Transform 

The  formidable  apparatus  of  complex  analysis  can  now  be 
used  on  these  integral  solutions.  The  manipulations  are  similar 
for  both  vertical  and  horizontal  dipole  cases  and  are  presented  in 
great  detail  for  a  single  interface  in  [2] .  Only  a  very  abbreviated 
presentation  for  the  vertical  dipole  is  given.  To  avoid  undue  rep¬ 
etition,  the  results  for  the  horizontal  dipole  are  presented  without 
detailed  derivation. 

The  first  step  is  to  extend  the  path  of  integration  along  the 
entire  Re(A)  axis  by  shifting  to  the  Hankel  function  form  [cf. 
(6)]  of  the  integrals.  The  singularities  of  the  integrand  in  the 
A  plane  are  illustrated  in  Fig.  2.  As  noted  above,  there  is  a  pole 
of  the  integrand  corresponding  to  the  lowest  order  bound  mode 
(TM  for  a  vertical  dipole).  For  simplicity  of  illustration,  it  is 
assumed  that  the  slab  is  sufficiently  thin  (L  <  A/|ni|)  so  that 
only  the  zeroth-order  mode  is  supported.  The  analytic  results 
are  presented  for  an  arbitrary  slab  thickness.  As  a  result  of  the 
multivalued  square  root  functions,  there  are  branch  points  at  A  = 
zbni  and  dm2.  The  branch  cuts  are  chosen  to  extend  parallel  to 
the  Im(A)  axis  to  ±00.  There  is  also  a  branch  point  at  A  =  0 
corresponding  to  the  divergence  of  the  Hankel  function  at  the 
origin;  the  corresponding  branch  cut  extends  to  —00  along  the 
negative  Im(A)  axis  and  plays  no  role  in  the  evaluation  of  the 
radiation  fields. 


Fig.  2.  Singularities  and  paths  of  integration  in  the  complex  A  plane  for  the 
inverse  transform.  The  branch  points  correspond  to  the  square  roots  in  n± 
and  71 2  and  to  the  properties  of  the  Hankel  function.  The  poles  correspond  to 
the  waveguide  modes.  The  integration  path  labeled  C0  is  the  original  inverse 
transform  path,  the  curves  labeled  C±  —  C3  are  the  steepest  descents  paths  for 
evaluation  of  the  inverse  transform  for  external  angles  9  of  0°  (perpendicular 
to  the  slab),  45°,  and  90°  (along  the  slab). 

A.  Radiation  Modes  for  Vertical  Dipole 

Also  shown  in  Fig.  2  are  several  integration  paths  that  are 
convenient  for  the  evaluation  of  the  inverse  transform.  The  solid 
path,  Co,  is  the  undeformed  path  that  corresponds  to  the  deriva¬ 
tion  of  the  inverse  transform.  It  passes  below  the  branch  points 
at  A  =  ni  and  712,  above  the  branch  points  at  A  =  —  m,  —  712 
as  well  as  above  the  branch  point  at  the  origin.  A  small  imag¬ 
inary  part  to  71 2  has  been  assumed  in  drawing  this  figure;  this 
imaginary  part  is  set  to  zero  in  the  final  result.  A  steepest  de¬ 
scents  evaluation,  valid  for  large  distances  from  the  source,  pro¬ 
vides  an  analytic  approach  to  integration  along  the  branch  cuts. 
This  approach  allows  evaluation  of  the  radiation  fields  of  the 
dipole  a  distances  greater  than  several  wavelengths  from  the 
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source;  however,  the  formulation  of  the  problem  correctly  in¬ 
cludes  all  near-field  effects  at  the  slab  boundaries  which  can  be 
much  less  than  a  wavelength  from  the  dipole.  The  appropriate 
integration  path  for  this  evaluation  depends  on  the  external  polar 
angle,  9 ,  for  which  the  integration  is  performed.  These  paths  are 
shown  for  three  particular  values  of  9  in  the  figure  and  for  eval¬ 
uation  of  the  integral  around  the  n2  branch  cut,  corresponding 
to  the  radiation  fields  in  medium  II.  The  path  for  evaluation  of 
the  radiation  fields  normal  to  the  slab,  9  —  0,  is  given  by  the 
(dashed)  path  C± .  The  path  for  an  intermediate  angle  9  —  7r/4 
is  denoted  (dash-dot)  C2.  Finally,  the  path  for  evaluation  of  the 
fields  very  close  to  the  slab  9  —  tt/2  is  denoted  (dotted)  C3. 
In  each  case,  the  slopes  of  the  paths  asymptotically  approach 
Im(A)/Re(A)  =  tan  9  for  large  A. 

For  9  —  0,  deformation  of  the  path  of  integration  from  Co 
to  Ci  does  not  pass  any  singularities  in  the  complex  plane,  so 
the  steepest  descents  integration  gives  the  entire  response.  As 
9  increases,  the  contributions  from  crossing  the  branch  point  at 
ui  and  the  pole  P  must  be  included  in  the  response.  The  inte¬ 
gration  around  the  n±  branch  cut  contributes  only  to  near  fields 
in  medium  II  and  can  be  neglected  since  we  are  interested  only 
in  the  radiation  fields  many  wavelengths  away  from  the  slab. 
The  roles  of  the  integrals  around  the  branch  cuts  are  reversed 
for  evaluation  of  the  radiation  fields  in  medium  I.  The  residue  at 
the  pole  gives  the  radiation  coupled  into  the  bound  waveguide 
mode  (including  the  evanescent  fields  in  the  cladding  regions). 

To  make  this  more  concrete,  evaluate  the  radiation  fields  for 
arbitrary  angles  in  region  II.  The  starting  point  is  (10),  modified 
by  the  transition  to  the  Hankel  function  as  described  above: 
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Substitute  an  integral  expression  for  the  Hankel  function  [2], 
viz. 
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The  next  step  is  to  transform  the  last  exponential  into  a  suitable 
expression  for  a  steepest  descents  evaluation  (i.e.,  to  find  a  path 
through  a  saddle  point  with  a  constant  imaginary  argument  to 
eliminate  rapid  oscillations  of  the  exponential).  Transform  to 
spherical  coordinates  (p  =  R  sin  9;  z  =  R  cos  9)  and  make 
the  conformal  mapping: 


A  =  n2  sin  r 
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m  =n2/ni. 

(20) 

Since  we  are  only  interested  in  the  lowest  order  radiation  terms 
(scaling  as  1  /R),  it  is  permissible  to  neglect  the  y2  in  the  rad¬ 
ical  and  carry  out  the  simple  Gaussian  integral  over  y.  Including 
these  transformations  and  approximations  gives  (21)  as  shown 
at  the  bottom  of  the  page  where  the  path  of  integration  is  ini¬ 
tially  over  the  transformed  Re(A)  axis.  The  deformation  of  this 
integration  path  to  the  steepest  descents  path  will  be  carried  out 
after  the  next  step.  Now  define 

w  —  t  —  9  (22) 

to  express  the  exponential  part  of  the  integrand  as  a  simple  co¬ 
sine  function 
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The  saddle  point  for  the  integration  is  clearly  at  w  —  0  and 
the  steepest  descents  path  of  integration  is  chosen  to  keep 
Re(cos  w)  —  1  and  expressing  w  —  u  +  it |  gives  for  the 
integration  path  cosh(n)  cos(v)  =  1.  Tracing  this  equation 
back  through  the  transform  gives  the  contours  already  discussed 
in  the  A  plane. 
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One  final  transformation  is  needed  to  complete  the  integra¬ 
tion,  viz. 
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which  displays  the  usual  steepest  descents  form  with  w  now  a 
function  of  x.  The  dominant  contributions  to  the  integral  occur 
for  small  x  and  the  remaining  terms  need  only  be  expanded  in 
a  power  series  around  x  =  0.  Since  we  are  interested  in  the 
radiation  terms  only,  it  is  sufficient  to  evaluate  all  of  the  terms 
in  the  integrand  except  for  the  Gaussian  exponential  at  x  —  0, 
giving  the  final  result 
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After  all  of  this  complex  manipulation,  the  final  result  has  a 
very  intuitive  form.  Remember  that  9  is  the  external  angle  in  the 
cladding  material  above  the  dielectric  slab.  Then  from  Snell’s 
law,  '9  is  the  internal  angle  within  the  slab.  Rp o  is  the  angu¬ 
larly  dependent  magnetic  field  reflectivity  and  since  by  defini¬ 
tion  m  <  1,  Rpo  <  0  at  9  —  0.  The  first  factor  in  the  result 
is  the  unbounded  medium  I  contribution  to  the  Hertz  vector  (5), 
the  second  factor  is  an  almost  pure  phase  shift  reflecting  the  di¬ 
vision  of  the  propagation  between  medium  I  and  medium  II,  and 
the  third  factor  describes  the  angularly  dependent  transmission 
into  medium  II  in  terms  of  the  familiar  TM-polarized  Fresnel 
reflectivity  modified  by  multiple  reflections  in  the  slab  geom¬ 
etry.  This  term  has  a  series  of  maxima  at  2n\L  cos  d  —  2ln 
(l  an  integer)  as  a  result  of  the  Fabry-Perot  modes  of  the  slab 
geometry.  The  two  terms  in  the  numerator  come  from  the  up¬ 
ward  and  the  downward- and-reflec ted  emitted  waves  in  the  2-D 
Fourier  transform  of  the  dipole  radiation,  at  a  —  0  dipole  in  the 
middle  of  the  slab,  these  two  terms  combine  to  eliminate  every 
other  slab  resonance.  As  a  result  of  the  sign  of  Rp o,  the  even 
resonances  are  absent  in  the  response. 

In  order  to  proceed  further,  it  is  convenient  to  shift  from  the 
Hertz  vector  to  the  corresponding  electric  and  magnetic  fields 


using  (1)  and  (2).  Since  again  we  are  only  interested  in  the  far 
fields,  this  is  a  relatively  simple  operation  with  the  result 


Gin2R 

Ee  (xk$m2  sin  9F J  (0,  Z,  a)- 

Hv  oc  TO2  sin  9FY  {9,  L, 
v  Vo 

with 


R 

pin2R 

®) — —  (28) 


Fy(9,  L,  a) 


_  ni  cos  i)—n2  cos  &)Lf  2 


(1  -  Rpo) 


-ini a  cos  $ 


+  Rpoe 


ini  (L+o)  cos  V 


1  -  Rp0e2iniL  cos  ^ 


•  (29) 


For  a  single  interface, 

F]r  (9,  L,  a) 

t  gini(L/2-a)  cos  i) ^—in2L  cos  0/2^  _  R  q) 


single  interface 


(30) 


Finally  integrating  across  the  entire  upper  hemisphere  and  nor¬ 
malizing  to  the  power  radiated  by  the  dipole  in  an  unbounded 
medium  of  dielectric  constant  ei  gives 

Py  —  f  | Fj{9,  Z,  a) |2  sin3  9d9.  (31) 

4  7o 

Here  the  “hat”  is  a  reminder  that  this  result  is  normalized  to 
the  unbounded  medium  radiated  power.  The  expression  for  the 
power  radiated  into  the  lower  half- space  is  identical  with  a  — > 
—a.  Note  the  very  strong  dependence  of  the  radiated  power  on 
the  dielectric  discontinuity  oc  ra° ;  for  a  GaAs  slab  in  air,  m  ~ 
1/3.6  and  m°  ~4x  10-3.  As  is  well  known,  very  little  of  the 
radiated  power  escapes  from  a  high-dielectric  constant  slab.  For 
a  single  interface,  this  radiation  distribution  has  been  derived 
[18]  previously  with  a  very  different  approach. 

As  is  shown  below,  for  a  horizontal  dipole,  a  larger  fraction  of 
the  dipole  power  is  radiated  for  the  same  dielectric  discontinuity. 
Detailed  evaluation  of  the  radiated  power  is  deferred  to  a  later 
section  after  the  equivalent  result  for  the  horizontal  dipole  is 
presented.  The  question  we  next  address  is  the  power  radiated 
into  the  waveguide  slab  modes,  since  so  little  power  escapes 
from  the  slab,  we  expect  that  much  of  the  radiation  power  is 
contained  in  these  slab  modes. 


B.  2-D  Waveguide  Modes  for  Vertical  Dipole 

As  has  been  noted  already,  the  waveguide  modes  correspond 
to  the  residues  at  the  poles  of  the  denominator  that  correspond  to 
the  slab  waveguide  modes.  These  correspond  to  Rpe~llL  —  ±1 
with  the  plus  sign  for  even  modes  and  the  minus  sign  for  the  odd 
modes.  For  a  sufficiently  thin  slab  ( L  <  A/ni),  there  is  only  a 
single  even  mode.  We  start  the  evaluation  for  this  case  and  gen¬ 
eralize  later  to  arbitrary  L.  For  distances  of  several  wavelengths 
from  the  source,  it  is  appropriate  to  use  the  asymptotic  limit  of 
the  Hankel  function 


(32) 


^  large 
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Applying  these  results  to  the  Hertz  vector  contribution  inside 
the  slab,  and  neglecting  the  inhomogeneous  solution  that  does 
not  have  the  pole  associated  with  the  cavity  resonances,  gives 


<&{R)  =  - 


s  r°°  \d\  lXP 

Vtt  vve 

cosh(7i^)  cosh(7id) 

7i(l  -  RPe~^'L){l  +  RPe-^'L) 


The  next  step  is  to  expand  the  denominator  around  the  pole  at 
A  —  kp  where  kp  is  the  solution  of  the  transcendental  dispersion 
relationship  for  the  mode.  After  some  algebra,  the  result  is 


The  evaluation  of  the  power  radiated  into  the  mode  requires  an 
integration  across  the  mode  profile  that  scales  as  cosh(7ipz)  in¬ 
side  the  slab  and  as  cosh(7ipZ/2)e=F72p^=FL/2^1  in  the  cladding 
regions  above  (—  signs)  and  below  (+  signs)  the  slab.  This  in¬ 
tegration  gives 


Otm  — 


/  J-L/2 


|  cosh(7ipA:o^)|2  dz 


!  2|cosh(7iPfc0JL/2)|2  f°°  ei{l2p-llv)k 0sdz 
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=  I  ^  f  sinll^pfc|)£)  ,  sin(7ipfco^)\ 


2  \ki 


J/  kp(X  kp )  - 


x  1  + 


12PL[(ki  +  K2)kl  -  «i«2] 


where  7 ip  denotes  7 \  evaluated  at  A  —  kp.  Using  this  expression, 
the  equivalent  integral  to  (16)  for  medium  I  can  be  evaluated  as 
27r i  x  (residue  at  the  pole),  giving  the  simple  result 


|cosh(7ip/c0i/2)|2 


where  the  real  and  imaginary  parts  of  the  7 i  ’  s  are  denoted  by  7  • 
and  7 ",  respectively.  The  final  result  for  the  power  radiated  into 
the  TM  mode  by  a  vertical  dipole,  integrated  across  all  angles 
and  normalized  to  the  unbounded  medium  radiated  power,  is 


4>l(R)=k0 


pV  _ 
1  TM  — 


37r  |cosh(7ipa)|  \kp\2OfM 


Re(ni)Z2  1  + 


cosh(7i pz)  cosh(7ipa) 


72 pL  [(«!  +  n2)k2  -  K1K2] 


n±L  1  + 


nPL[{ni  +  n2)k2  -  kik2\ 


Generalizing  this  result  to  an  arbitrarily  slab  thickness  involves 
summation  over  all  allowed  even  and  odd  modes,  viz. 


Detailed  discussion  will  be  deferred  until  later;  at  this  point,  it 
is  appropriate  to  note  a  few  simple  features  of  this  result.  First, 
this  represents  a  cylindrical  wave  propagating  along  the  slab  and 
decaying  as  1  /y/p,  in  contrast  to  the  3-D  result  from  the  branch 
cut  integration,  (26),  that  gave  a  spherical  wave  propagating  in 
the  radial  direction  decaying  as  1  /R.  The  cosh(7ipi)  represents 
the  mode  variation  across  the  slab,  while  the  cosh(7ipa)  is  the 
overlap  of  the  dipole  with  the  mode  profile  which  is  strongest 
in  the  middle  of  the  slab.  For  small  Z,  >  0  since  kp  ^  n  2 
and,  therefore,  72  — >  0;  while,  for  large  Z,  fl  scales  as  1/Z. 

As  was  the  case  for  the  3-D  modes,  the  next  step  is  to  convert 
from  the  Hertz  vector  to  the  corresponding  electric  and  magnetic 
fields 

E1  =  J I7  etkpP  if  sinh(7ipi)ep  +  cosh(7ipi)ez 


Am  —  A3 


37 r  |cosh(7ipa)|2  \kp\2OeTl 


Re(ni)Z2  1  +  - 


2K\K2 


'2 pL  [{.Ki+K2)(k™)2-KiK2] 


37T  |sinh(7ipq)|2  \kp^Oju 


Re(ni)Z2  1  +  - 


'2 pL  [(Kl+K2)(k™)2  -  KiK2]  I 

(39) 


where 


1  /  1  \  /  smh(^lpk0L)  sm(^pk0L) 

yTM  —  o  ^  I  —  I  77  77 7 


Z  1  + 


cosh(7ipa) 

2k\K2 

12PL[{ki  +  K2)k2  -  kik2] 
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Vo  V  P 


osh(7ip7)e¥ 


cosh(7ipa) 


L  1  + 


72p-£[(«l  +  «2 )k2  -  KiK2\ 


1  sinh(7ipfc0L/2)| 


is  defined  analogously  with  OfM . 

C.  3-D  Radiation  Modes  for  Horizontal  Dipole 

The  calculation  proceeds  in  much  the  same  way  as  that  for  the 
vertical  dipole,  with  the  added  complexity  of  a  two-component 
Hertz  vector.  For  the  <fix  component,  the  result  is  available  from 
that  of  the  vertical  dipole  with  the  substitutions  noted  above  for 
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the  Hertz  vector 


4JI 


(Ji) 


~i(ni  cos  t) — Tt-2  cos  $)L/2 


3D  K2R 
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where 


In  the  single  interface  limit,  these  results  reduce  to 

Ff  (e,  L,  a)  - - ■ - ► 

\  /  single  interface 

x  ^mi (L/2— d)  cos  1?^— m2L  cos  #/2^  R  o') 


7H 


(0,  L ,  a)  - 

V  /  si 


single  interface 
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(45) 


cos 


$  =  \/l  —  m2  sin2 


-^sO  — 
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(42) 


Here  Rsq  is  the  TE-polarized  electric  field  Fresnel  reflectivity 
across  the  boundary  from  region  I  into  region  II.  The  interpreta¬ 
tion  of  this  result  is  very  similar  to  that  discussed  above;  we  will 
show  below  that,  while  the  interference  of  the  emission  compo¬ 
nents  arising  from  the  up  and  down  directions  again  cancels  half 
of  the  cavity  resonances  in  the  denominator,  because  Rs q  is  pos¬ 
itive  for  small  angles,  while  Rp o  is  negative,  the  alternate  set  of 
resonances  is  eliminated. 

As  expected  from  the  Hertz  vector,  the  z  component  is  more 
algebraically  complex  involving  both  the  TE-  and  TM-Fresnel 
reflectivities  as  shown  in  (43)  at  the  bottom  of  the  page.  As 
above,  the  next  step  is  to  obtain  the  electric  and  magnetic  ra¬ 
diation  fields  in  the  cladding  half-plane  from  the  Hertz  vector 
components. 


Detailed  discussion  of  this  result  and  the  comparison  between 
the  horizontal  and  vertical  dipoles  in  various  dielectric  struc¬ 
tures  will  be  deferred  until  after  the  waveguide  mode  radiation 
results  for  the  horizontal  dipole  are  presented. 

D.  2-D  Waveguide  Modes  for  Horizontal  Dipole 

The  horizontal  dipole  can  couple  to  TE  (s)  waveguide  modes 
through  the  component  of  the  mode  electric  field  and  to  TM 
(p)  waveguide  modes  through  the  ep  component.  Both  sets  of 
poles  occur  in  the  (/^response.  The  procedure  for  evaluating 
the  radiated  power  is  similar  to  that  outlined  above  and  only  the 
final  results  are  given  here: 
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Fig.  4.  Power  into  the  lowest,  zeroth-order,  even  waveguide  mode  (normalized 
to  an  unbounded  dielectric  medium)  for  dielectric  constants  corresponding  to 
Si  and  Si02 .  The  peak  in  the  radiated  power  occurs  as  the  effective  mode  index 
shifts  from  the  index  of  the  cladding  to  that  of  the  slab.  The  maximum  radiated 
power  for  a  vertical  dipole  for  the  large  Si  dielectric  constant  is  greater  than  that 
in  an  unbounded  medium  corresponding  to  an  enhanced  radiative  efficiency. 


Fig.  3.  Total  power  (above  and  below  the  slab)  into  the  3-D  radiation  modes 
(normalized  to  an  unbounded  dielectric  medium)  for  horizontal  and  vertical 
dipoles  as  a  function  of  normalized  slab  width  for  three  dielectric  constants.  For 
large  dielectric  contrasts,  the  radiation  clearly  shows  alternating  Fabry-Perot 
slab  resonances.  Radiation  is  strongly  suppressed  for  the  vertical  dipole  in 
comparison  with  the  horizontal  dipole.  For  the  lowest  dielectric  contrast,  the 
resonances  are  less  pronounced  and  correspond  to  the  onset  of  waveguide 
modes. 


and 


where  the  definitions  of  the  mode  integrals  O^m  are  given  in 
(37)  and  (40).  The  coupling  for  a  dipole  in  the  center  of  the  slab 
(a  =  0)  is  to  the  even  TE  and  the  odd  TM  modes. 

E.  3-D  Radiation  Coupling 

The  total  3-D  radiation  (integrated  over  both  cladding  regions 
above  and  below  the  slab)  is  plotted  in  Fig.  3  as  a  function  of 
the  normalized  slab  width  for  vertical  and  horizontal  dipoles 
in  the  center  of  the  slab  for  three  different  slab  dielectric  con¬ 
stant  values  corresponding  to  SiC>2  [ki  —  2.25],  Si  [at  a  visible 
wavelength  =  18.25  (+0.5i)],  and  a  fictitious  high  dielec¬ 
tric  constant  material  [n±  =  50]  to  demonstrate  the  limiting  be¬ 
havior  of  the  radiation.  The  cladding  dielectric  constant  is  taken 
as  unity  in  all  cases.  A  varying  scale  factor  has  been  applied  to 
the  (smaller)  vertical  dipole  results  so  that  both  the  vertical  and 


horizontal  dipole  3-D  radiation  powers  are  visible  on  the  same 
scale.  The  low  efficiency  of  coupling  into  the  radiation  modes 
for  the  vertical  dipole  is  expected;  because  of  the  small  critical 
angle  at  high  dielectric  contrast,  only  radiation  emitted  very  near 
vertical,  where  the  dipolar  radiation  vanishes,  can  exit  the  slab. 

For  the  two  larger  slab  dielectric  constants,  the  periodic  be¬ 
havior  resulting  from  the  Fabry-Perot  denominators  in  (10)  and 
(13)  is  clearly  evident.  Because  of  the  differing  signs  of  Rp o  and 
Rs  o,  the  resonances  alternate  with  peaks  in  the  horizontal  dipole 
radiation  at  even  numbers  of  half- wavelengths  and  in  the  ver¬ 
tical  dipole  radiation  at  odd  half- wavelength  slab  thicknesses. 
The  horizontal  dipole  resonances  are  narrower  than  the  corre¬ 
sponding  resonances  for  the  vertical  dipole.  The  effects  of  ab¬ 
sorption  are  indicated  as  dotted  lines  in  the  middle  panel  for  the 
Si  case.  For  thicker  slabs,  the  resonances  broaden  out  as  a  result 
of  the  angular  integrations.  It  is  straightforward  to  show  from 
the  analytic  results  that  the  radiated  power  away  from  the  reso¬ 
nances  scales  as  |m|°  for  the  vertical  dipole  and  as  |m|3  for  the 
horizontal  dipole,  while  at  the  peaks  the  scaling  is  |m|3  (vertical) 
and  \m\  (horizontal).  For  the  lower  dielectric  constant,  the  reso¬ 
nances  are  less  pronounced  with  the  cusps  of  the  curves  scaling 
in  thickness  as  y/ni  —  rather  than  the  limiting  result  of 
ni/X  for  high  dielectric  contrast.  These  features  in  the  3-D  ra¬ 
diation  pattern  all  correspond  to  the  onset  of  2-D  waveguide 
modes  with  increasing  slab  thickness  as  is  discussed  below. 

F.  2-D  Waveguide  Mode  Coupling 

The  coupling  into  the  lowest  order  bound  mode  (TE  for  the 
horizontal  dipole,  TM  for  the  vertical  dipole)  for  a  dipole  in  the 
center  of  the  slab  is  shown  in  Fig.  4  as  a  function  of  the  normal¬ 
ized  slab  thickness  for  two  slab  dielectric  constants,  again  cor¬ 
responding  to  SiC>2  and  to  Si  at  visible  wavelengths.  As  before, 
these  results  are  normalized  to  the  unbounded  dielectric  radiated 
power.  For  very  thin  slabs,  <CA/ni,  the  radiative  coupling  van¬ 
ishes  in  both  orientations  because  the  modes  are  very  spread  out 
(72  — >  0)  and,  therefore,  their  intensity  at  the  slab  is  very  low. 
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Fig.  5.  Same  as  Fig.  4  except  the  dipole  is  at  the  edge  of  the  slab.  There  are 
now  two  peaks  for  the  horizontal  dipole  corresponding  to  both  the  zero-order 
TE  and  TM  modes.  These  are  clearly  distinct  for  the  higher  dielectric  constant, 
but  are  merged  for  the  lower  k  x  .  The  radiation  for  the  vertical  dipole  at  high 
k  i  is  almost  completely  suppressed  in  this  configuration  as  a  result  of  the  very 
small  value  of  the  modal  electric  field  at  the  edge  of  the  dielectric. 

For  very  wide  slabs,  again  the  mode  is  spread  out  and  of  low  in¬ 
tensity,  from  the  analytic  results  the  radiation  power  into  a  spe¬ 
cific  mode  decays  as  1  /L2.  The  peak  radiative  coupling  in  each 
case  occurs  for  slab  thicknesses  where  the  geometrical  waveg- 
uiding  effects  are  strongest  ( kP}  s  midway  between  n 2  and  ni). 
For  the  Si  dielectric  constant,  the  normalized  vertical  mode  radi¬ 
ated  power  is  larger  than  unity,  implying  a  larger  radiative  cross 
section  for  the  dipole  than  in  an  unbounded  medium.  This  is  by 
now  a  familiar  cavity  electrodynamic  result  [5]— [7],  [9]— [15] . 
For  arbitrarily  large  dielectric  constants,  this  emission  enhance¬ 
ment  saturates  at  about  15%  above  the  unbounded  medium  re¬ 
sult.  When  the  contributions  of  higher  order  modes  are  added, 
the  horizontal  dipole  emission  becomes  slightly  larger  than  that 
of  the  vertical  dipole  and  saturates  at  about  30%  above  the  un¬ 
bounded  medium  result.  Of  course,  larger  enhancements  of  the 
emission  efficiency  are  achieved  with  more  dimensions  of  con¬ 
finement,  e.g.,  wires  and  boxes.  For  this  symmetric  position  in 
the  center  of  the  slab,  there  is  no  coupling  to  the  lowest  order 
(even)  TM  mode  [oc  |  sinh(7ia)  |2].  If  the  dipole  is  moved  away 
from  the  center,  this  coupling  is  allowed.  An  example  is  shown 
in  Fig.  5  for  the  dipole  just  at  the  edge  of  the  slab  for  the  same 
dielectric  constant  values.  For  the  Si  dielectric  constant,  the 
two  peaks  corresponding  to  the  TE  and  TM  modes  are  clearly 
evident,  at  the  lower  SiC>2  dielectric  constant  they  merge  into 
a  single,  broader  peak.  The  radiative  coupling  for  the  vertical 
dipole  is  so  low  as  to  be  barely  visible  for  the  Si  dielectric  con¬ 
stant  as  a  result  of  the  small  amplitude  of  the  z -directed  TM 
mode  field  at  the  edge  of  the  slab  for  large  dielectric  contrast. 

As  the  slab  thickness  is  increased,  radiation  is  possible  into 
other  modes  and  the  total  radiated  power  is  a  sum  over  all  avail¬ 
able  channels  including  both  2-D  waveguide-  and  3-D  radiation 
modes.  Both  the  2-D  waveguide  radiation  and  the  total  radiated 
power  as  a  function  of  slab  thickness  is  shown  in  Fig.  6  for  the 
Si  dielectric  constant  and  in  Fig.  7  for  the  SiC>2  dielectric  con¬ 
stant.  In  both  cases,  the  dipole  is  taken  to  be  in  the  center  of 
the  slab.  The  additional  modes  smooth  out  the  radiation  power 
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Fig.  6.  The  distribution  of  radiated  power  into  all  modes  for  a  dipole  located 
in  the  center  of  a  Si  slab.  For  a  vertical  dipole,  the  only  contribution  is  from  the 
even  TM  modes.  Except  for  very  thin  slabs,  the  radiation  is  roughly  constant  and 
equal  to  that  in  an  unbounded  dielectric  medium.  For  a  horizontal  dipole,  there 
are  contributions  from  both  the  even  TE  and  odd  TM  modes.  At  the  peak  of 
the  radiative  coupling,  where  coupling  to  TE0 ,  TE2  and  TMi  all  contribute,  the 
radiation  peaks  at  about  1 .3  times  that  in  bulk.  The  addition  of  the  3-D  radiation 
results  in  the  dotted  curve  for  the  horizontal  dipole,  filling  in  radiative  energy 
just  at  the  onset  of  each  mode.  The  3-D  radiation  contributions  are  too  small  to 
be  visible  on  this  scale  for  the  vertical  dipole. 
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Fig.  7.  The  distribution  of  radiated  power  into  all  modes  for  a  dipole  located 
in  the  center  of  an  Si02  slab.  The  larger  contribution  of  the  3-D  radiation  to  the 
total  power  is  evident  for  both  dipole  orientations. 

as  a  function  of  slab  width  although  the  onset  of  each  mode  is 
clearly  visible  for  both  dielectric  constants.  For  the  higher  di¬ 
electric  constant  (Fig.  6),  the  addition  of  the  radiation  into  both 
the  l  —  1  odd  TM  mode  (shoulder)  and  the  l  —  2  even  TE 
mode  (peak)  increases  the  maximum  radiated  power  for  the  hor¬ 
izontal  dipole  slightly  above  that  of  the  vertical  dipole.  There 
are  two  curves  presented  in  Fig.  6  for  the  horizontal  dipole:  the 
lower  curve  is  the  radiated  power  into  the  waveguide  modes, 
and  the  higher  curve  (dotted),  which  is  filled  in  just  before  the 
onset  of  each  mode,  includes  the  contributions  of  the  power  into 
the  3-D  radiation  modes  out  the  sides  of  the  slab.  This  3-D  ra¬ 
diation  contribution  is  too  small  to  be  visible  for  the  vertical 
dipole.  As  noted  above  in  the  discussion  of  the  3-D  radiation, 
the  peaks  in  the  3-D  radiation  occur  at  the  valleys  of  the  2-D 
modal  emission  and  tend  to  lead  to  a  more  uniform  emission  as 
a  function  of  slab  width.  Mathematically,  both  of  these  effects 
are  described  by  the  multiple-reflection,  Fabry-Perot  denomi¬ 
nators  [cf.  (10),  (13),  and  (15)]  that  play  a  role  in  both  the  3-D 
and  2-D  calculations.  At  the  lower  dielectric  constant  (Fig.  7) 
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Fig.  8.  The  distribution  of  radiated  power  into  all  modes  for  /V  i  ’s  of  1.3  and 
1.03  for  a  dipole  located  in  the  center  of  the  slab.  As  the  dielectric  constants 
are  decreased,  an  increasing  fraction  of  the  power  goes  into  the  3-D  radiation 
modes. 

the  contributions  of  the  3-D  radiation  are  significant  for  both 
dipole  orientations.  The  peak  total  radiation  (sum  of  2-D  wave- 
guide-  and  3-D  radiation-modes)  is  shifted  away  from  the  2-D 
waveguide  peak  and  is  higher  for  the  horizontal  than  for  the  ver¬ 
tical  dipole.  The  contributions  of  the  l  —  1  odd  TM  mode  for 
the  horizontal  dipole  are  clearly  evident  at  half-integral  normal¬ 
ized  slab  widths.  These  trends  are  continued  for  lower  dielectric 
constants  as  shown  in  Fig.  8  for  dielectric  constants  of  1.3  and 
1.03,  spanning  the  typical  dielectric  contrasts  for  heterostruc¬ 
ture  diode  lasers.  The  lower  the  dielectric  contrast,  the  larger 
the  fraction  of  power  radiated  into  the  3-D  modes.  For  the  di¬ 
electric  constant  of  1.3,  the  total  radiated  power  for  a  vertical 
dipole  is  almost  uniform  at  unity,  i.e.,  equal  to  that  in  an  un¬ 
bounded  medium,  except  for  very  thin  slabs.  There  is  somewhat 
more  structure  for  a  horizontal  dipole  and  the  total  power  radi¬ 
ated  both  peaks  at  a  higher  level  and  saturates  at  the  slightly 
lower  level  than  for  the  vertical  dipole.  For  the  1.03  dielectric 
constant  case,  all  of  the  structure  is  essentially  eliminated.  These 
curves  were  calculated  for  a  unity  cladding  dielectric  constant, 
identical  curves  are  obtained  for  the  actual  semiconductor  laser 
case  of  much  larger  cladding  dielectric  constants,  but  the  same 
percentage  difference  between  the  slab  and  cladding  dielectric 
constants. 

Useful  insight  is  gained  by  plotting  the  radiated  power  and 
its  division  between  3-D  radiation  and  2-D  waveguide  modes 
as  a  function  of  the  dielectric  discontinuity.  Fig.  9  shows  this 
dependence  for  lossless  dielectrics.  In  each  case,  the  peak  radi¬ 
ated  power  is  plotted  for  both  horizontal  and  vertically  oriented 
dipoles  located  at  the  center  of  a  slab.  The  slab  width  is  a  free 
parameter  that  is  allowed  to  vary  for  different  dielectric  discon¬ 
tinuities.  The  curves  labeled  vertical  (horizontal)  dipole — TM 
(TE)  mode  correspond  to  the  largest  values  in  the  plots  of  2-D 
radiated  power  versus  slab  width  as  shown  in  Figs.  5-8.  As 
noted  above,  these  values  saturate  for  large  dielectric  discon¬ 
tinuities.  The  dashed  curve  labeled  P, ^  +  P^m  corresponds  to 
the  peak  emission  power  for  the  horizontal  dipole  at  large  di¬ 
electric  contrast  when  the  sum  of  powers  at  for  slab  widths  of 
about  a  normalized  wavelength  is  larger  than  the  zeroth  order 
mode  peak.  The  two  dashed  curves  labeled  uncorrelated  sur¬ 
faces  are  the  power  emitted  into  the  3-D  radiation  modes  out 
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Fig.  9.  Summary  of  the  dielectric  contrast  dependence  of  the  power  radiated 
into  both  the  3-D  radiation  and  2-D  waveguide  modes.  The  slab  width  is  allowed 
to  vary  as  the  dielectric  constants  are  varied.  The  peak  values  of  the  3-D  radiation 
power  at  the  first  peak  (~\n2/2n1  for  the  vertical  dipole  and  ~An2/ni  for 
the  horizontal  dipole  at  high  contrast)  are  displayed.  The  2-D  waveguide  mode 
power  is  the  peak  of  the  zeroth-order  mode  (TM0  for  a  vertical  and  TE0  for 
a  horizontal  dipole).  The  maximum  radiation  for  the  horizontal  dipole  at  high 
contrast  shifts  to  the  sum  of  the  powers  into  TE0 ,  TE2  and  TMi  as  indicated  by 
the  dashed  line  giving  a  maximum  enhancement  in  the  radiated  power  of  ~30% 
over  a  dipole  in  an  unbounded  medium. 

the  two  sides  of  the  slab  neglecting  any  Fabry-Perot  cavity  ef¬ 
fects,  e.g.,  just  twice  the  single  surface  result.  As  required,  in  the 
limit  of  small  dielectric  discontinuities,  these  powers  approach 
unity.  In  the  opposite  limit  of  large  dielectric  discontinuity,  these 
curves  scale  as  ra°  for  the  vertical  dipole  and  m3  for  the  hor¬ 
izontal  dipole.  Finally,  the  two  solid  curves  labeled  resonance 
peak  are  the  total  emitted  radiation  at  the  first  Fabry-Perot  peak 
of  the  emission  out  both  sides  of  the  slab  and  include  the  effects 
of  the  slab  resonances  that  are  important  in  vertical  cavity  lasers. 
For  low  discontinuity,  these  approach  unity,  for  high  discontinu¬ 
ities  they  scale  as  m3  (vertical)  and  m  (horizontal).  The  slight 
bend  in  the  curve  for  the  3-D  radiation  of  the  vertical  dipole 
at  (a?i/^2  —  1)  ~  3  corresponds  to  the  shift  of  the  peak  from 
even  integral  reduced  wavelengths  at  small  dielectric  contrast 
to  odd  integral  half  wavelengths  at  high  dielectric  contrast  (cf. 
Fig.  3).  The  dielectric  discontinuities  corresponding  to  impor¬ 
tant  classes  of  semiconductor  and  glass  dielectric  constants  are 
marked  on  the  figure.  These  curves  were  calculated  for  ^2  =  1 
(e.g.,  air  or  vacuum).  As  noted  above,  for  the  case  of  a  diode 
laser  with  semiconductor  claddings  equivalent  results  are  ob¬ 
tained  as  long  as  the  x  axis  is  normalized  as  y/ni  —  K2L/X;  all 
of  these  results  are  universal  curves  as  a  function  of  relative  di¬ 
electric  discontinuity. 

VI.  Summary 

An  analysis  of  the  radiation  emitted  from  a  dipole  of  arbi¬ 
trary  location  and  orientation  in  a  symmetrically  clad  dielec¬ 
tric  slab  has  been  presented.  The  results  for  arbitrary  dielectric 
contrasts  are  consistent  with  previous  work  [15]  and  are  pre¬ 
sented  in  terms  of  simple,  closed-form  equations  for  both  the 
3-D  free-space  emission  into  plane  waves  and  the  2-D  emis¬ 
sion  into  waveguide  modes  of  the  slab.  These  results  arise  from 
a  unified  analysis  based  on  the  early  work  of  Sommerfeld  [1] 
for  a  dipole  near  a  dielectric  interface.  It  is  interesting  to  find 
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that  work  that  was  motivated  by  the  discovery  of  long-distance 
radiotelegraphy  has  fresh  relevance  to  current  microcavity  and 
diode  laser  research. 

The  analytic  results  are  evaluated  for  a  wide  range  of 
dielectric  contrasts  ranging  from  those  of  free-standing  semi¬ 
conductor  films  to  those  of  double-heterostructure  diode  lasers. 
Over  this  wide  range,  the  distribution  of  power  into  the  3-D  and 

2- D  modes  varies  widely.  For  large  dielectric  contrast,  almost 
all  of  the  radiated  power  is  into  the  waveguide  modes  with 
only  a  small  amount  of  power  into  the  radiation  modes.  For  a 
vertically  oriented  dipole  at  the  center  of  the  slab,  the  power 
into  the  radiated  modes  scales  as  ra°  for  a  single  interface 
and  as  m3  at  the  peak  of  the  Fabry-Perot  resonances.  This 
vertical  dipole  emission  is  weak  because  of  the  small  critical 
angle  escape  cone  and  the  null  in  emission  along  the  direction 
of  the  dipole.  The  peak  emission,  for  a  slab  thickness  close 
to  the  wavelength,  saturates  at  15%  above  that  of  a  dipole 
in  an  unbounded  dielectric  medium.  The  emission  into  the 

3- D  radiation  modes  is  much  larger  for  a  horizontally  oriented 
dipole  in  a  high  index  slab  since  the  dipole  emission  is  strongest 
in  the  direction  of  the  slab  normal,  the  power  into  the  radiated 
modes  scales  as  m3  for  a  single  interface  and  as  m  at  the  peak 
of  the  Fabry-Perot  resonances.  For  this  case,  the  peak  power 
into  the  lowest  order  mode  saturates  at  about  5%  below  the 
unbounded  medium  result;  however,  the  maximum  power  into 
all  waveguide  modes  occurs  for  slab  widths  of  about  a  reduced 
wavelength  where  power  is  radiated  into  all  three  of  the  zeroth- 
and  second-order  (even)  TE  modes  and  the  first-order  (odd) 
TM  mode.  At  this  peak,  the  radiated  power  saturates  at  a  level 
^30%  larger  than  that  in  the  unbounded  medium.  Even  for 
very  small  dielectric  discontinuities,  an  enhancement  of  the 
total  (2-D  and  3-D)  horizontal  dipole  radiated  power  persists 
for  slab  thicknesses  of  about  a  reduced  wavelength. 

These  results  have  application  to  a  wide  variety  of  topics  of 
current  interest  including: 

•  to  crystal  membrane  microresonantors  using  dielectric 
confinement  in  one  direction  and  photonic  crystal  con¬ 
finement  within  the  plane  of  the  membrane; 

•  to  enhanced  LEDs  where  the  object  is  to  couple  more  of 
the  emitted  radiation  into  the  useful  3-D  modes  and  less 
into  the  trapped  waveguide  modes; 

•  to  traditional  edge-emitting  lasers  where  the  radiation  into 
the  waveguide  modes  can  support  lasing  while  the  power 
into  the  3-D  modes  represents  lost  excitation; 

•  to  VCSELs  where  just  the  opposite  comments  apply; 

•  and  to  nanostructure  enhanced  Raman  scattering. 

For  the  Raman  experiment,  three  distinct  analyzes  are  re¬ 
quired:  1)  the  distribution  of  the  excitation  for  a  plane  wave 
incident  on  a  deep,  periodic  grating  structure;  2)  the  emission 
into  waveguide  and  radiation  modes  (this  analysis);  and  3)  the 
coupling  of  the  waveguide  modes  into  free- space  modes  at  the 
top  of  the  nanostructured  grating.  Extensions  of  this  work  to 
more  complex  dielectric  structures  such  as  VCSELs  and  peri¬ 
odic  gratings,  and  to  more  confined  structures  such  as  photonic 
wires  and  dots  will  be  discussed  in  ongoing  work. 
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behavior  of  capacitive  grids. 
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I.  Introduction 

Crossed  gratings,  of  which  metallic  grids  are  special 
cases ,  find  application  across  the  electromagnetic  spectrum. 
They  are  used  in  the  microwave  regime  as  filters  and  for 

making  efficient  antennas1.  In  the  far  infrared,  they  find 

2 

applications  as  beam  splitters  and  mirrors  .  Their  use  as 

3,4  5 

filters  and  solar  selective  surfaces  in  the  visible  and  near 
infrared  has  also  been  demonstrated.  Recently  there  has 

been  interest  in  the  behavior  of  sub -wavelength  aperture 

6 

metallic  crossed  gratings  (inductive  or  hole  grids  in  particu¬ 
lar)  in  the  visible  with  potential  applications  for  novel  opto¬ 
electronic  devices.  In  that  work,  the  important  features  of 
the  transmission  spectra  of  sub-wavelength  inductive  grids 
were  characterized  on  the  basis  of  a  wavevector  conserva¬ 
tion  model.  The  authors,  adopting  a  ‘kinematic’  approach, 
showed  that  the  salient  spectral  features  of  the  experimen¬ 
tally  measured  transmittance  correspond  to  the  coupling  of 

6 

incident  radiation  to  surface  plasma  waves  (SPW)  .  Al¬ 
though  this  method  explains  the  wavelength  position  of  the 
experimental  features,  a  rigorous  diffraction  model  is  neces¬ 
sary  to  quantitatively  describe  all  experimental  features  and 
as  a  tool  for  crossed  grating  design.  The  importance  of  this 
approach  is  underscored  by  the  recent  work  of  Heinzel  et 

al.  ,  Martin -Moreno  et  al.  and  Popov  et  al.,  where  the  be¬ 
havior  of  metallic,  crossed  gratings  was  analyzed  using  vec¬ 
tor  diffraction  models.  Reference  [7]  presented  experimental 

results  and  numerical  modeling  (based  on  rigorous  coupled 
10 

wave  analysis  (RCWA)  )  for  periodically  structured  metal 

surfaces  with  potential  applications  as  filters  in  thermo - 

8 

photovoltaic  systems.  Martin -Moreno  et  al.  and  Popov  et 

9 

al.  specifically  examined  the  behavior  of  metallic,  sub- 
wavelength  hole  arrays.  Reference  [8]  treated  the  electro¬ 
magnetic  fields  in  the  metallic  inductive  grids  using  surface 
impedance  boundary  conditions,  while  Ref.  [9]  employed 

the  Fourier  modal  method  .  Both  these  papers  analyzed  the 

6 

experimental  work  of  Ebbesen  et  al.,  and  attempted  to  ex¬ 
plain  the  mechanism  of  unusually  high  transmittance 
through  metallic,  sub  wavelength  aperture  inductive  grids. 
However,  to  the  best  of  our  knowledge,  rigorous  numerical 


modeling  of  finite  conductivity  capacitive  grids  along  with 
experimental  verification  has  not  yet  been  presented 
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(McPhedran  et  al.  and  Botten  et  al.  have  numerically 
modeled  perfectly  conducting  capacitive  grids  using  a  rigor¬ 
ous  modal  approach). 

In  this  paper  we  present  a  rigorous  diffraction  model 
validated  by  experimental  data  for  both  inductive  (hole)  and 
capacitive  (dot)  sub-wavelength,  metallic,  crossed  gratings 
under  normal  plane  wave  illumination.  We  present  the  ef¬ 
fects  of  varying  grating  parameters  on  the  mid-IR  transmit¬ 
tance  spectrum.  The  paper  is  organized  as  follows.  First,  we 
briefly  explain  the  fabrication  and  experimental  characteri¬ 
zation  of  crossed  gratings.  This  is  followed  by  a  description 
of  the  numerical  model.  Then,  we  discuss  the  coupling  of 
incident  radiation  into  SPWs  for  inductive  grids  and  show 
that  similar  coupling  also  occurs  for  capacitive  grids. 
Coupling  in  SPWs  for  both  kinds  of  gratings  are  described 
using  the  rigorous  diffraction  model.  Finally,  the  experimen¬ 
tal  verification  of  the  diffraction  model  is  demonstrated  by 
modeling  gratings  with  varying  parameters . 

n.  Fabrication  and  experimental  characterization 

The  gratings  used  to  validate  the  modeling  were  fabri¬ 
cated  using  interferometric  lithography  and  lift-off  pattern 
transfer.  A  double  polished  silicon  sample  is  used  as  a  sub¬ 
strate  material  as  it  is  transparent  in  the  2-5  jam  wavelength 
regime.  After  coating  the  substrate  with  photoresist,  inter- 
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ferometric  lithography  is  used  to  expose  the  pattern. 
Developing  the  photoresist  and  metallization  ty  e-beam 
evaporation  of  gold  follows.  The  final  step  is  an  acetone  jet 
lift-off  leaving  the  desired  pattern. 

Experimental  characterization  of  these  structures  is  done 
using  Fourier  transform  infrared  spectroscopy  (FTIR)  with 
an  unpolarized  light  source.  All  of  the  results  presented  in 
this  paper  are  normal  incidence  transmittance  spectra  nor¬ 
malized  to  the  system  response  without  any  sample. 

HI.  Modeling 

Our  numerical  model  takes  into  account  the  following 
important  aspects  of  diffraction  from  crossed  gratings. 
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IILa:  Finite  conductivity  of  the  gratings 

Crossed  gratings  are  modeled  in  the  microwave  regime 
by  taking  the  metallic  scatterers  to  have  infinite  conductiv- 

15-17 

ity  .  While  this  approximation  works  well  at  microwave 
frequencies  where  metals  have  very  high  conductivity,  its 
use  for  the  visible  and  ultraviolet  spectral  region  is  clearly 
inappropriate.  In  the  infrared  spectral  region,  the  suitability 
of  the  infinite  conductivity  approximation  is  uncertain.  We 
avoid  these  questions  by  using  a  rigorous  diffraction  model, 
which  takes  into  account  the  finite  conductivity  of  the  grat¬ 
ings.  This  approach  comes  at  a  high  comp  utational  cost. 

Since  the  late  1970s,  significant  work  has  been  done  in 
modeling  crossed  gratings  with  finite  conductivity.  Derrick 
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et  al.  and  Harris  et  al.  proposed  a  model  based  on  a 

21 

coordinate  transformation,  Vincent  used  a  finite  difference 
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method,  while  Bruno  et  al.  employed  a  variation  of 

23 

boundaries  method.  More  recently,  Kettunen  et  al.,  pre¬ 
sented  numerical  results  for  metallic  inductive  grids  based 
on  the  rigorous  coupled  wave  analysis  (RCWA).  Their 
modeling  was  done  principally  for  the  case  of  rectangular 
scatterers  and  included  the  dielectric  function  expansion 

li 

changes  suggested  by  Li  to  improve  the  convergence  of 
the  algorithm.  Here,  we  present  modeling  based  on  RCWA 
for  finite  conductivity  rectangular  as  well  as  circular  metal¬ 
lic  scatterers,  arranged  in  a  square  lattice  on  a  silicon  sub¬ 
strate. 

b:  Finite  substrate  effects 

The  results  presented  in  this  paper  are  for  the  normal  in¬ 
cidence  transmittance  spectra  of  metallic  crossed  gratings 
placed  on  a  silicon  substrate.  The  spectral  range  of  interest 
is  from  2-5  pm,  where  silicon  is  practically  transparent. 
Clearly,  in  this  regime  assuming  the  silicon  substrate  to  be 
infinite  (as  is  commonly  done  in  grating  diffraction  model¬ 
ing)  is  not  justified  due  to  multiple  reflections  taking  place 
within  the  high-index  substrate.  The  thickness  of  the  sub¬ 
strate  is  much  larger  than  the  coherence  length  of  the  inci¬ 
dent  light  for  the  chosen  experimental  conditions,  therefore 
the  appropriate  way  to  model  the  substrate  is  to  do  incoher¬ 
ent  addition  of  the  diffracted  orders  inside  the  substrate. 
Previous  work  based  on  this  approach  has  been  done  for 
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one-dimensional  gratings  by  Chateau  et  al.  and  Li  .  Here 
we  extend  the  energy  scattering  matrix  approach  used  by 
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li  to  model  diffraction  from  a  crossed  gratings  made  on  a 
thick  transparent  substrate. 

In  order  to  further  study  the  usefulness  of  incorporating 
finite  substrate  effects,  we  model  the  transmittance  spectra 
of  capacitive  grids  for  the  2-5  pm  wavelength  range  using 
both  the  infinite  substrate  model  (ISM)  and  the  finite  sub¬ 
strate  model  (FSM).  Our  numerical  simulations  show  that 
while  the  FSM  is  necessary  to  quantitatively  compare  with 
the  experimental  measurements,  the  ISM  captures  the  essen¬ 
tial  features  of  the  transmittance  spectra. 

In  this  paper,  because  of  computational  constraints,  we 
use  the  simpler  ISM  to  numerically  study  the  behavior  of 
crossed  gratings  and  use  the  FSM  whenever  we  compare  the 
numerical  modeling  with  the  experimental  measurements . 


c:  Convergence  of  the  numerical  model 

Our  numerical  model  is  based  on  RCWA,  and  we  study 
the  convergence  behavior  of  this  algorithm  for  the  ISM  and 
FSM. 

For  the  infinite  substrate  case  (ISM),  we  take  the  silicon 
substrate  to  be  infinitely  thick,  and  for  normal  plane  wave 

illumination,  take  advantage  of  the  symmetry  of  the  con- 
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figuration  to  reduce  the  numerical  size  of  the  problem. 
Specifically  for  normal  incidence,  the  equivalence  of  the 
positive  and  negative  Fourier  coefficients  allows  reduction 
of  the  final  eigenvalue  problem  from  2(2N+1)2  to  2(N+1)2, 
where  N  is  the  number  of  positive  or  negative  spatial  har¬ 
monics  for  both  the  dielectric  function  and  the  field  expan¬ 
sion.  This  reduction  allows  the  algorithm  to  be  implemented 
on  a  personal  computer  for  reasonably  high  values  of  N. 

Validating  the  performance  of  any  numerical  algorithm  is 
an  essential  part  of  establishing  the  utility  of  the  algorithm 
Unlike  the  situation  for  one-dimensional  gratings  where  the 
convergence  properties  of  RCWA  for  metallic  gratings  have 

27,28 

been  well  established  ,  little  is  known  of  the  behavior  of 
two-dimensional  RCWA  algorithms.  Initially,  we  examined 
the  numerically  calculated  diffracted  power  in  the  transmit¬ 
ted  zero  order  of  capacitive  grids  as  a  function  of  the  mode 
number  N  up  to  the  maximum  number  of  modes  as  allowed 
by  our  computational  resources.  Numerical  evaluations 
show  that  the  variation  in  diffracted  power  as  a  function  of 
mode  number  N  is  at  the  most  ±5%  after  N  =15.  In  the  re¬ 
sults  shown  in  this  paper  we  have  used  N  =  15  for  numerical 
modeling  and  expect  the  convergence  error  to  behave  simi¬ 
larly  for  all  structures .  However,  the  final  proof  of  the  valid¬ 
ity  of  the  two-dimensional  RCWA  lies  in  the  experimental 
verification  presented  later  in  this  paper. 

For  the  FSM,  we  include  the  finite  thickness  of  the  sili¬ 
con  substrate.  An  efficient  way  to  implement  this  is  to  use  a 

scattering  matrix  algorithm?^.  Here  we  use  the  full  scatter¬ 
ing  matrix  recursion^  to  account  for  all  propagating  orders 
incident  on  the  grating  from  the  substrate  side.  Depending 
on  the  grating  parameters,  the  diffraction  orders  propagating 
in  the  substrate  are  obliquely  incident  on  the  grating  from 
the  substrate  side.  The  grating  configuration  in  this  case  is 
no  longer  symmetric  and  unlike  the  ISM ,  we  cannot  reduce 
the  numerical  size  of  the  eigenvalue  problem.  Therefore  for 
the  FSM  we  restrict  the  mode  number  to  N  =10,  and  expect 
the  convergence  error  to  be  within  ±7%. 

IV.  Surface  plasma  waves  and  crossed  gratings 
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It  has  been  experimentally  demonstrated  that  inductive 
grids,  or  metal  films  perforated  with  sub  wavelength  hole 
arrays,  have  a  higher  transmittance  than  that  predicted  by 
single  aperture  theory31.  Transmittance  efficiencies  showing 
a  1000-fold  increase31  are  observed  in  subwavelength  induc¬ 
tive  grids  when  compared  with  the  results  of  single  aperture 
theory.  This  phenomenon  is  attributed  to  the  resonant  cou¬ 
pling  of  the  incident  light  to  the  SPW.  The  wavelength  at 
which  SPW  coupling  occurs  is  described  by  wavevector 
conservation,  which  is  essentially  a  ‘kinematic’  model.  This 
model,  which  gives  the  in -plane  momentum  necessary  to 
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couple  to  SPWs,  explains  the  wavelength  dependent  fea¬ 
tures  of  the  experimental  data.  However,  a  detailed  analysis 
of  the  magnitude  and  variation  of  the  coupling  requires  a 
rigorous  diffraction  model.  We  apply  both  the  ‘kinematic’ 
model  and  RCWA  to  study  metallic  crossed  gratings. 

To  illustrate,  consider  first  the  surface  plasma  wave  equa¬ 
tion  that  gives  the  momentum  conservation  condition  neces¬ 
sary  to  couple  normally  incident  radiation  to  an  infinite 
metal  sheet32 


diets  coupling  to  the  (0,  ±1)  or  (±1,  0)  substrate-metal  SPW 
at  X  =  4.16  pm.  The  results  of  the  RCWA  model  for  induc¬ 
tive  grids  are  presented  in  .  1,  which  shows  the  transmit¬ 
tance  and  absorptance  of  a  grid  with  a  thickness  of  0.1  jam 
and  a  square  air  aperture  of  0.36  pm  .  The  absorptance  A  is 
simply  given  by 

A  =  l -LK-Lt*.  (4) 
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Here  kn  is  the  projection  of  the  incident  wave  vector  par¬ 
allel  to  the  dielectric -metal  interface,  A,  is  wavelength  of 
incident  light  and  8m  and  8d  are  the  complex  permittivities  of 
the  metal  and  the  incident  medium  respectively.  A  metal 
film  periodically  perforated  with  holes  can  be  considered  as 
a  perturbation  of  smooth  metal  film,  and  Eq.  (1)  may  be 
used  to  determine  the  coupling  to  surface  plasma  waves  for 
inductive  grids.  For  the  special  case  of  a  square  lattice  and 
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normally  incident  radiation,  /cM  is  given  as  - ^ l  +  J  , 

A 

where  A  is  the  pitch  of  the  grating  and  i,  j  are  integers  (at 
least  one  of  which  is  nonzero).  Substituting  into  Eq.  (1),  we 


get 
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The  dielectric  medium  in  this  equation  is  either  the  super¬ 
state  (assumed  to  be  air)  or  the  substrate  (silicon).  The  air- 
metal  interface  and  the  substrate-metal  interface  will  each 
provide  a  set  of  (i,  j)  for  Eq.  (2)  corresponding  to  air-metal 
SPW  coupling  and  substrate-metal  SPW  coupling  . 

A  phenomenon  closely  associated  with  the  appearance  of 
SPW  coupling  in  metallic  gratings  is  known  as  Wood’s 

anomaly  34  This  is  the  condition  where  the  wavevector  of  a 
given  grating  order  becomes  tangent  to  the  grating  plane, 
and  is  marked  by  rapid  variations  in  the  intensity  of  the  re¬ 
maining  orders.  The  wavelength  condition  for  Wood’s 
anomalies  for  normally  incident  illumination  on  a  crossed 
grating  is  given  as 3 1 


hi_=  A 

a  ~W TFT 


(3) 


Here  8^  is  the  permitivity  of  the  superstrate  or  substrate, 
and  accordingly,  the  air  and  silicon  interfaces  each  give  a  set 
of  Wood’s  anomalies.  From  Eqs.  (2)  and  (3),  it  may  be 
noted  that  for  metallic  gratings  where  lej  »  8^,  the  wave¬ 
length  conditions  for  SPW  coupling  and  Wood’s  anomalies 
are  very  close. 

Now,  let  us  consider  the  case  of  a  patterned  gold  film  on 
a  silicon  substrate  with  a  pitch  of  1.2  pm.  Equation  (2)  pre¬ 


where  and  T^  are  the  diffraction  efficiencies  for  order 
‘d’  in  the  incident  and  transmitted  media  respectively.  The 
modeling  presented  in  this  figure  takes  the  substrate  to  be 
infinite,  includes  the  dispersion  of  dielectric  function  of 
gold35  and  takes  the  index  of  silicon  to  be  3.44  for  2-5  pm 
wavelength  range.  Also,  the  transmittance  is  defined  to  be 
the  sum  of  diffraction  efficiencies  in  all  the  transmitted  or¬ 
ders.  Figure  1  also  shows  the  absorptance  of  a  homogeneous 
0.1  jam  thick  gold  film  on  a  silicon  substrate;  the  difference 
between  the  absorptance  spectra  of  the  inductive  grid  and  a 
gold  film  is  indicative  of  the  power  carried  by  SPWs.  From 
Fig.  1  it  may  be  seen  that  the  SPW  wavelength  as  calculated 
from  Eq.  (2)  matches  the  rigorous  diffraction  model  very 
well  although  the  exact  wavelengths  of  SPW  coupling  are 
offset  slightly  from  those  given  by  Eq.  (2),  (e.g.,  for  the  (0, 
1)  substrate-metal  coupling  the  transmittance  reaches  a 
maximum  at  X  =  4.28  pm  instead  of  A  =  4.16  jam).  Note  that 
we  take  the  transmittance  maximum  as  the  SPW  coupling 
wavelength33.  A  possible  reason  for  this  wavelength  differ¬ 
ence  as  obtained  from  wavevector  conservation  (Eq.  (2)) 
and  RCWA  is  the  fact  that  Eq.  (2)  is  a  perturbation  on  the 
SPW  dispersion  relation  for  homogenous  metal  films  and 
therefore  can  only  describe  the  SPW  coupling  wavelength 
approximately.  Also  shown  in  Fig.  1  are  examples  of  the 


Figure  1:  Transmittance  and  SPW  coupling  for  an  inductive 
grid  with  square  air  holes  in  gold  on  a  silicon  substrate.  Refer¬ 
ring  to  the  inset,  A  =  1.2  pm,  d  =  0.6  pm,  and  h-  0.1  pm. 
Wavelength  resolution  for  the  numerical  modeling  is  0.02  pm 
and  index  of  silicon  is  taken  as  3.44,  independent  of  wave¬ 
length.  The  locations  of  SPW  coupling  for  air-metal  and  sili¬ 
con-metal  are  obtained  from  Eq.  (2)  and  are  labeled  by  solid 
vertical  line  segments.  The  locations  of  Wood’s  anomalies  are 
obtained  from  Eq.  (3)  and  are  labeled  as  dashed  vertical  line 
segments.  Also  shown  in  the  figure  is  the  absorptance  of  a  0.1 

pm  thick  gold  film  on  a  silicon  substrate. 
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air-metal  SPW  (denoted  by  solid  vertical  lines),  substrate- 
metal  SPW  (shown  by  solid  vertical  lines)  and  Wood’s 
anomalies  for  the  incident  and  transmitted  media  (denoted 
by  dashed  vertical  lines).  For  the  case  of  inductive  grids,  we 
generally  observe  well-defined  transmittance  minima  and 
maxima  corresponding  to  the  Wood’s  anomalies  and  SPW 
coupling,  respectively33. 

We  now  study  the  applicability  of  the  ‘kinematic’  model 
to  capacitive  grids.  Clearly  some  additional  evidence  is  nec¬ 
essary  to  apply  Eq.  (2),  as  unlike  the  inductive  grids  where  a 
continuous  metal  film  periodically  perforated  with  holes 
allows  for  extended  conduction,  the  capacitive  grids  are 
metal  islands  having  no  conduction  between  them.  It  re¬ 
mains  to  be  shown  that  SPWs  are  established  in  capacitive 
grids. 

One  of  the  important  features  of  SPW  s  is  the  energy  car¬ 
ried  by  these  surface  waves  near  the  coupling  wavelengths. 
This  is  evident  for  inductive  grids  in  Fig.  1,  where  we  ob¬ 
serve  maxima  in  the  absorptance  spectrum  near  the  location 
of  SPW  coupling.  Fig.  1  also  shows  that  the  energy  carried 
by  the  SPW  occurs  in  a  narrow  spectral  band.  Furthermore, 
the  features  associated  with  Wood’s  anomalies  and  SPWs 
coupling  affect  the  transmittance  spectrum  differently; 
Wood’s  anomalies  occur  as  minima  in  transmittance  while 
SPW  coupling  occur  as  maxima  in  transmittance.  Also  note 
that  the  wavelengths  of  the  Wood’s  anomalies  (as  defined 
by  Eq.  (3))  and  of  the  SPW  coupling  (as  defined  by  Eq.  (2)) 
are  almost  superimposed.  We  now  examine  the  absorptance 
spectrum  of  capacitive  grids.  Figure  2  shows  the  transmit¬ 
tance  and  absorptance  for  a  capacitive  grid  having  a  pitch  of 
A  =  1.2  qm,  a  grating  thickness  of  0.1  jam  with  square  gold 
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Figure2:  Transmittance  and  SPW  coupling,  for  a  capacitive  grid 
with  square  gold  scatterers,  on  a  silicon  substrate.  Referring  to 
the  inset,  A  =  1.2  pm,  d  =  0.6  pm,  and  h  =  0. 1  pm.  The  wave¬ 
length  resolution  of  numerical  modeling  is  0.02  pm  and  index  of 
silicon  is  taken  as  3.44  for  all  wavelengths.  The  locations  of 
SPW  coupling  for  air-metal  and  silicon-metal  are  obtained  from 
Eq.  (2)  and  are  labeled  by  solid  vertical  line  segments.  The  loca¬ 
tions  of  Wood’s  anomalies  are  obtained  from  Eq.  (3)  and  are 
labeled  as  dashed  vertical  line  segments.  Apart  from  the  (0,  1) 
substrate-metal  SPW,  coupling  to  higher  order  SPWs  is  very 
weak  for  this  particular  capacitive  grid  and  is  only  visible  as 
discontinuity  in  the  slope  of  transmittance  spectrum.  Also  shown 
is  the  absorptance  of  0.1  pm  thick  gold  film  on  a  silicon  substrate 
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scatterers  of  area  0.36  pm  .  From  the  absorptance  curve  in 
Fig.  2  we  observe  SPW -like  features  near  the  (0,  1)  sub¬ 
strate-metal  coupling;  also  in  contrast  to  the  previous  induc¬ 
tive  grid,  the  energy  carried  by  these  plasma -like  waves  is 
spread  over  a  wider  spectral  band,  which  may  be  indicative 
of  high  damping  of  these  plasma -like  waves.  The 
transmittance  spectrum  of  this  capacitive  grid,  unlike  that  of 
the  inductive  grid,  does  not  show  well-defined  minima  and 
maxima  near  the  locations  of  the  Wood’s  anomalies  and  of 
SPWs  coupling,  respectively.  Instead,  the  features  associ¬ 
ated  with  Wood’s  anomalies  and  SPWs  influence  the  trans¬ 
mittance  spectrum  in  a  similar  manner;  they  are  character¬ 
ized  either  by  a  minimum  in  the  transmittance  spectrum  (as 
shown  in  Fig.  2  for  (0,1)  substrate -metal  coupling)  or  by 
points  of  inflection  in  the  transmittance  spectrum  (as  shown 
in  Fig.  2  for  higher-order  substrate -metal  coupling).  The 
absorptance  spectrum  of  the  capacitive  grid  shows  a 
maximum  that  suggests  its  behavior  may  be  described  in 
terms  of  SPW  coupling. 

V.  Capacitive  and  inductive  grids 

From  Figs.  1  and  2,  we  observe  that  surface  plasma  wave 
features  occur  for  both  capacitive  and  inductive  grids.  We 
experimentally  verify  our  modeling  by  fabricating  inductive 
and  capacitive  grids  of  the  same  pitch  and  grating  thickness. 
The  transmittance  spectra  for  these  structures  are  shown  in 
Figs.  3  and  4  along  with  the  results  of  numerical  computa¬ 
tions.  Here  in  the  numerical  model  we  are  taking  into  ac¬ 
count  the  finite  thickness  of  the  silicon  substrate  (FSM).  The 
FSM  has  a  larger  convergence  error  than  the  ISM;  further¬ 
more  the  size  of  the  circular  scatterers  is  determined  by 
scanning  electron  microscopy  (SEM),  which  also  introduces 
some  measurement  error.  In  order  to  account  for  these  fac¬ 
tors,  we  carry  out  a  mean  square  fit  between  the  experimen¬ 
tal  and  numerical  data.  While  the  numerical  model  assumes 
illumination  with  a  normally  incident,  linearly  polarized 
plane  wave,  experimental  characterization  of  inductive  and 
capacitive  grids  is  done  using  an  unpolarized  source  in  the 
FTIR.  The  transmittance  or  reflectance  spectra  of  square 
lattices  under  normal  plane  wave  illumination  are  polariza¬ 
tion  independent2. 

Figures  3  and  4  show  that  the  experiment  and  numerical 
model  are  in  reasonably  good  agreement.  Furthermore,  the 
inductive  and  capacitive  grids  display  surface  plasma  wave 
features  at  approximately  the  same  wavelengths,  but  the 
strength  and  shape  of  surface  plasma  wave  coupling  is  very 
different  for  each  of  them.  For  inductive  grids  the  amount 
of  light  transmitted  away  from  the  SPW  resonance  is  small 
and  changes  in  transmittance  spectrum  are  sharp  and  narrow 
at  the  coupling  wavelengths,  whereas  in  capacitive  grids,  we 
note  a  gradual  change  in  transmittance  near  the  wavelengths 
of  SPW  coupling.  One  of  the  reasons  is  a  dependence  on  the 
grid  fill  factor  for  capacitive  and  inductive  grids,  FFC  and 
FF:,  defined  as 


for  circular  scatterers  in  capacitive  grids  and  as 


4 


W  a  velength(  jam) 


Figure  3:  A  comparison  between  experimental  and  rigorous  model¬ 
ing  results  for  transmission  through  an  inductive  grid  of  circular  air 
holes  in  gold  on  a  silicon  substrate.  Here  A  =  1.24  jam,  h  =  0.1  Jim 
and  the  air  hole  diameter  obtained  from  both  SEM  measurement 
and  mean  square  fitting  of  the  numerical  model  to  the  data  is  0.40 
jam.  The  wavelength  resolution  of  the  numerical  modeling  is  0.025 
jam. 
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for  circular  air  holes  in  an  inductive  grid.  Here  d  is  the 
diameter  of  circular  scatterers  for  capacitive  grids  or  of  the 
air  hole  for  inductive  grids.  The  structures  shown  in  Figs.  3 
and  4  have  a  fill  factor  of  91.8%  for  the  inductive  grid  and 
29.5%  for  the  capacitive  grid.  Also,  for  this  particular  induc¬ 
tive  grid  (shown  in  Fig.  3),  the  absolute  efficiency  of  trans¬ 
mittance  (obtained  by  dividing  the  fraction  of  light  transmit¬ 
ted  to  fraction  of  the  surface  area  occupied  by  holes6)  at 
positions  of  maxima  is  -0.25.  This  is  much  less  than  that 
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Figure  4:  Experimental  and  rigorous  modeling  results  for  trans¬ 
mittance  through  a  capacitive  grid  of  circular  gold  scatterers  on  a 
silicon  substrate.  Here  A  =  1.24  jam,  h  =  0.1  jam  and  diameter  of 
the  gold  scatterers  obtained  from  SEM  measurement  is  0.76  jam 
and  from  mean  square  fitting  of  the  numerical  model  to  experi¬ 
mental  data  is  0.75  jam.  The  wavelength  resolution  for  numerical 
model  is  0.025 jam. 
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Figure  5:  The  experimental  and  numerically  calculated  dependence 
of  transmittance  through  a  capacitive  grid  with  fill  factor.  The 
transmittance  is  shown  for  the  case  of  (0,  1)  substrate-metal  SPW 
coupling  at  X  =  4.3  jam.  Referring  to  the  inset  in  Fig.  2,  the  fill  fac¬ 
tor  for  a  capacitive  grid  is  given  as  (n/4)(d/A)2.  The  grid  consisted 
of  circular  gold  scatterers  on  a  silicon  substrate,  had  a  pitch  of  1.24 
jam  and  a  thickness  of  0.1  jam.  Note  the  ‘noise’  in  numerically  calcu¬ 
lated  transmittance  due  to  convergence  error  in  the  FSM.  The  insets 
in  the  figure  also  show  SEM  images  of  capacitive  grids  with  fill 

factors  of  a)  10.8%,  b)  19.6%  and  c)  34.3%. 

observed  by  Ebbesen  et.  al.6,  for  the  case  of  subwavelength 
hole  arrays  in  silver  films  made  on  a  quartz  substrate.  This 
apparent  discrepancy  is  a  candidate  for  further  study,  al¬ 
though  one  of  the  reasons  may  be  the  high-index  silicon 
substrate  used  in  this  study.  For  the  capacitive  grid  (shown 
in  Fig.  4),  it  may  be  noted  the  wavelength  location  of  SPW 
coupling  is  offset  between  the  experimental  data  and  m- 
merical  modeling;  this  difference  is  attributed  to 
convergence  error  in  the  FSM. 

VI.  Fill  factor  study 

We  experimentally  and  numerically  investigated  the  de¬ 
pendence  of  the  (0,  1)  substrate-metal  SPW  coupling  as  a 
function  of  the  fill  factor  for  capacitive  grids.  All  samples 
were  0.1  jam  thick  circular  gold  scatters  on  a  silicon  sub¬ 
strate  and  with  a  pitch  of  1.24  jam  Figure  5  shows  the  de¬ 
pendence  of  the  experimental  and  numerical  transmittance 
at  X=  4.3  jam  on  fill  factor.  For  the  modeling,  the  fill  factor 
is  varied  by  changing  the  diameter  of  circular  gold  scatterers 
from  0.4  jam  to  0.9  jam  in  steps  of  0.05  jam.  Numerical 
modeling  for  capacitive  grids  of  different  fill  factors  is  done 
using  the  FSM  and  a  mean  square  fit  was  not  performed  for 
the  modeled  and  experimental  measurements.  SEM  meas¬ 
urements  are  done  to  determine  the  size  of  the  circular  gold 
scatterers  and  the  transmittance  spectra  are  obtained  using 
FTIR.  We  note  from  Fig.  5  that  the  convergence  error  in  the 
FSM  appears  as  discontinuous  variations  in  the  slope  of  the 
numerically  calculated  transmittance.  Nevertheless,  the 
model  and  experiment  display  the  same  trend,  specifically, 
transmittance  through  the  grid  at  the  wavelength  of  the  (0, 
1)  substrate-metal  SPW  coupling  =  4.3  jam  in  this  case) 
decreases  monotonically  with  fill  factor. 
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VII.  Conclusion 

We  have  experimentally  validated  numerical  results 
based  on  RCWA  for  metallic  inductive  and  capacitive  grids 
in  the  mid-IR.  We  find  that  the  features  in  the  transmittance 
spectrum  of  both  inductive  and  capacitive  grids  are  defined 
by  coupling  to  surface  plasma  waves  and  the  RCWA  algo¬ 
rithm  adequately  describes  both  the  position  and  magnitude 
of  these  features.  A  systematic  study  of  the  behavior  of 
capacitive  grids  as  a  function  of  fill  factor  showed  that 
transmittance  decreases  monotonically  with  fill  factor  at  the 
wavelength  of  SPW  coupling. 
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Abstract — We  report  an  experimental  evaluation  of  the 
performance  of  silicon  (Si)  photodetectors  incorporating  one-di¬ 
mensional  (1-D)  arrays  of  rectangular  and  triangular-shaped 
nanoscale  structures  within  their  active  regions.  A  significant 
2  x )  enhancement  in  photoresponse  is  achieved  in  these  devices 
across  the  400-  to  900-nm  spectral  region  due  to  the  modification 
of  optical  absorption  properties  that  results  from  structuring 
the  Si  surface  on  physical  optics  scales  smaller  than  the  wave¬ 
length,  which  both  reduces  the  reflectivity  and  concentrates  the 
optical  field  closer  to  the  surface.  Both  patterned  (triangular  and 
rectangular  lineshape)  and  planar  Ni-Si  back-to-back  Schottky 
barrier  metal-semiconductor-metal  photodetectors  on  n-type 
(~  5  x  1014  cm-3)  bulk  Si  were  studied.  1-D  ~50-250-nm 
linewidth,  ^1000-nm  depth,  grating  structures  were  fabricated  by 
a  combination  of  interferometric  lithography  and  dry  etching.  The 
nanoscale  grating  structures  significantly  modify  the  absorption, 
reflectance,  and  transmission  characteristics  of  the  semicon¬ 
ductor:  air  interface.  These  changes  result  in  improved  electrical 
response  leading  to  increased  external  quantum  efficiency  (from 
^  44%  for  planar  to  ~  81%  for  structured  devices  at  A  =700 
nm).  In  addition,  a  faster  time  constant  (^  1700  ps  for  planar  to 
~  600  ps  for  structured  at  A  =900  nm)  is  achieved  by  increasing 
the  absorption  near  the  surface  where  the  carriers  can  be  rapidly 
collected.  Experimental  quantum  efficiency  and  photocurrents 
results  are  compared  with  a  theoretical  photocurrent  model  based 
on  rigorous  coupled-wave  analysis  of  nanostructured  gratings. 

Index  Terms — Interferometric  lithography,  metal-semicon¬ 
ductor-metal  (MSM),  nanostructures,  photodetectors,  rigourous 
coupled-wave  analysis  (RCWA),  silicon. 


I.  Introduction 

METAL-semiconductor-metal  (MSM)  photodetectors  are 
attractive  for  many  applications  because  of  their  high 
sensitivity-bandwidth  product.  [1]  MSM  photodetectors  are 
generally  planar  devices  with  Schottky  barriers  on  either  side 
of  an  exposed  semiconductor  absorption  region.  The  electrodes 
are  often  interdigitated  to  increase  the  active  region  area  while 
optimizing  the  electric  fields  in  the  carrier  collection  region.  In 
the  ultraviolet  and  blue  spectral  regions,  Si  is  a  very  attractive 
material  [2]  for  MSMs  because  of  its  ready  integration  with 
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Si  electronics  and  comparable  carrier  saturation  velocities 
with  III-V  semiconductors.  However,  at  longer  wavelengths, 
the  relatively  weak  absorption,  resulting  from  the  Si  indirect 
bandstructure,  results  in  loss  of  both  sensitivity  and  speed  as 
carriers  generated  in  field-free  regions  below  the  depletion 
region  are  subject  to  recombination  losses  as  they  slowly 
diffuse  into  the  collection  region  of  the  MSM  detector. 

There  is  considerable  interest  in  photodetectors  for  near-IR 
wavelengths  (A  =  0.77  —  0.85  ji m)  for  data  communications 
applications  due,  in  part,  to  the  advantages  of  GaAs-based  diode 
lasers.  Both  Si  and  GaAs  are  possible  candidates  for  fabrication 
of  optical  receivers  [3].  GaAs  is  attractive  due  to  its  short  ab¬ 
sorption  length  (~  1.0  ji m  at  A  =  850  nm),  which  makes  it 
possible  to  have  large  bandwidth  combined  with  good  respon- 
sivity  [4],  but  integration  with  Si  electronics  remains  a  signifi¬ 
cant  issue.  Silicon  offers  the  potential  of  lower  fabrication  cost 
and  direct  integration  with  complementary  metal-oxide- semi¬ 
conductor  (CMOS)  technology;  the  difficulty  is  in  the  long  ab¬ 
sorption  length  (~  15  jim  at  A  =  850  nm)  that  leads  to  difficult 
tradeoffs  between  efficiency  and  speed  of  MSM  detectors.  Si 
optoelectronic  circuits  with  good  sensitivity,  but  poor  response 
times  have  been  reported.  [5]  Si  MSM  lateral  interdigitated  pho¬ 
todetectors  combine  high  speed  with  CMOS  process  compati¬ 
bility.  These  detectors  operate  at  very  high  frequencies  at  ultra¬ 
violet  [2]  and  visible  [6]  wavelengths. 

Some  attempts  have  been  reported  aimed  at  improving  the  Si 
MSM  detector  quantum  efficiency  at  visible  and  near  IR  wave¬ 
lengths  by  etching  deep  vertical  [7]  and  U-shaped  [8]  trench 
electrodes.  These  devices  exhibit  improvement  in  the  visible 
wavelength  range  in  comparison  to  planar  contacts.  However 
in  order  for  these  schemes  to  be  effective  for  the  near  IR  re¬ 
gion  the  trench  depths  would  need  to  be  several  tens  of  mi¬ 
crometers  making  the  fabrication  process  complicated  and  ex¬ 
pensive,  and  dramatically  increasing  the  device  capacitance.  Liu 
et  al.  [9]  reported  improved  response  speed  by  fabrication  of 
MSM  photodetectors  on  silicon-on-insulator  (SOI)  substrates. 
The  key  feature  in  speed  enhancement  is  the  buried  oxide  layer 
that  limits  the  active  Si  thickness.  By  reducing  the  Si  film  thick¬ 
ness  to  ~  100  nm,  a  photodetector  bandwidth  of  140  GHz  at  a 
780-nm  wavelength  was  achieved,  however,  at  the  cost  of  a  very 
low  quantum  efficiency  ~  1%.  Lee  et  al.  [10]  have  proposed  a 
MSM  detector  configuration  on  5-^m-thick  Si  membrane  using 
a  geometrical-texturing  scheme  for  enhanced  light  trapping  on 
the  back  surface.  Light  trapping  in  the  thin  membrane  results  in 
minimal  reduction  in  responsivity,  while  reducing  carrier  transit 
times;  however,  the  fabrication  is  complex  and  incompatible 
with  standard  Si  device  processing.  Other  attempts  to  improve 
the  absorption  by  using  hydrogenated  amorphous  Si  (<r-Si:H) 
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have  focused  on  modifying  the  long-range  structural  symmetry 
of  crystalline  Si  thereby  relaxing  the  /c-vector  selection  rule  for 
optical  transitions  [11].  However,  device  efficiency  is  reduced 
as  a  result  of  the  very  short  carrier  lifetimes  (~  1  ps)  in  amor¬ 
phous  Si,  and  again  integration  with  standard  Si  processing  is 
problematic. 

In  this  paper,  we  introduce  a  new  Si  MSM  device  concept 
that  integrates  a  1000-nm-deep  sub  wavelength  grating  on  the 
top  of  the  active  region.  As  we  demonstrate  below,  the  grating 
both  serves  as  an  effective  anti-reflection  coating,  reducing  the 
Si  reflectivity  over  a  broad  wavelength  range  and  at  the  same 
time  enhances  both  the  efficiency  and  the  speed  of  the  device  by 
coupling  energy  into  higher  diffraction  orders  that  are  absorbed 
closer  to  the  Si  surface.  These  gains  are  achieved  while  retaining 
the  capability  of  integration  with  standard  Si  CMOS  process 
flows.  In  this  paper,  we  report  on  one-dimensional  (1-D)  grating 
structures,  which  leads  to  polarization  anisotropy  in  the  detector 
response.  For  applications  that  require  polarization-independent 
response,  a  2-D  grating  structure  can  be  used. 

In  Section  II,  we  present  the  rigorous  coupled-wave  analysis 
(RCWA)  used  in  the  analysis  of  the  absorption,  reflection  and 
transmission  properties  of  1-D  Si  nanostructures.  The  fabrica¬ 
tion  details  of  the  nanometer  period,  1-D  Si  gratings  and  the 
MSM  photodetectors  are  presented  in  Sections  III  and  IV.  Sec¬ 
tions  V  and  VI  present  the  experimental  dc  dark-current  and 
spectral  photoresponse  results  respectively  and  introduce  a  de¬ 
tailed  model  to  explain  the  mechanism  behind  the  enhanced 
photoresponse  signal.  In  Section  VII,  we  present  the  experi¬ 
mental  time  response  results  and  provide  a  qualitative  interpre¬ 
tation  of  the  results  based  on  the  model  given  in  Section  VI. 
Section  VIII  concludes  with  an  analysis  and  discussion. 

II.  Reflection,  Transmission,  and  Absorption 
Properties  of  1-D  Nanostructure  Gratings 

The  electromagnetic  wave  interaction  with  nanoscale  grating 
structures  is  a  complex  function  of  both  optical  (wavelength, 
polarization,  angle  of  incidence)  and  structural  (period,  profile, 
line: space  ratio,  depth)  parameters  [12].  Analysis  of  these  ef¬ 
fects  starts  from  a  Floquet  expansion  of  the  scattered  and  trans¬ 
mitted  fields  in  terms  of  the  grating  wavevector 

km  =  ko  ^sin((9)  +  m^j  ;  k0  =  ^  (1) 

where  0  is  the  angle  of  incidence,  d  is  the  grating  period,  A  is  the 
wavelength,  and  m  is  an  integer.  A  well-developed  approach  is 
the  RCWA  [13],  where  the  inhomogeneous  dielectric  constant 
e(x)  in  the  grating  region  is  expanded  into  a  Fourier  series  that 
couples  the  equations  for  various  values  of  m.  A  RCWA-based 
code  was  used  to  model  the  reflectance  and  transmission  prop¬ 
erties  of  the  1-D  structured  gratings  presented  in  this  paper.  The 
absorption  ( Ag )  in  the  nano  structured  grating  region  can  be  cal¬ 
culated  by  energy  conservation  from 

Ag  =  l-J2Tm-J2Rm'  (2) 

m  m' 

where  to,  to'  =  0,  ±1,  ±2,  ±3  ...  are  the  diffraction  orders  in 
the  Si  and  the  air  space,  respectively.  The  wavelength  depen- 
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Fig.  1.  Reflection  and  transmission  diffraction  orders  for  normal  incident 
irradiance  for  silicon  nano  structured  gratings. 

dence  of  the  refractive  index  coefficients  used  in  the  RCWA 
analysis  here  was  taken  from  Palik  et  al.  [14].  The  reflection 
and  transmission  for  a  planar  polished  Si  surface  as  a  function 
of  wavelength  are  necessarily  restricted  to  the  zero  order.  On 
the  other  hand,  for  nano  structured  surfaces  as  a  result  of  the 
high  Si  refractive  index  ( n  ~  3.77  at  700  nm),  there  are  nu¬ 
merous  orders  propagating  in  the  Si  even  at  the  longest  wave¬ 
lengths  studied.  For  an  850-nm  period  grating,  there  are  four 
propagating  diffracted  orders  in  the  Si  at  a  wavelength  of  800 
nm  and  nine  at  400  nm,  while  in  the  air  there  are  two  and  three, 
respectively. 

Fig.  1  shows  a  schematic  of  the  reflection  and  transmission 
diffraction  orders  as  well  as  the  region  of  the  grating  resonant 
modes.  The  optical  power  in  the  reflected  zeroth  to  second 
diffraction  orders  and  transmitted  zeroth  to  fifth  diffraction 
orders  as  a  function  of  wavelength  are  shown  Fig.  2.  These 
RCWA  plots  are  for  transverse  magnetic  (TM)  polarization 
(defined  relative  to  the  grating  wavevector),  normal-inci¬ 
dence  irradiance  on  a  triangular  shaped  structure  with  period 
~  850  nm,  linewidths:  top  ~  50  nm,  middle  ~  300  nm, 
bottom  ~  830  nm,  and  depth  ~  1000  nm.  The  behavior 
is  quite  complex,  exhibiting  numerous  resonances  and  the 
intensity  shifts  between  the  various  diffracted  orders  as  the 
wavelength  is  varied.  These  resonances  are  due,  at  least  in  part, 
to  Fabry-Perot  resonances  of  the  grating  modes  in  the  vertical 
direction  in  these  1000-nm-deep  gratings. 

Since  the  grating  is  not  blazed,  the  corresponding  negative 
order  powers  are  equivalent.  The  cutoff  of  the  fifth  order  around 
675  nm  is  evident.  There  are  higher  orders  at  shorter  wave¬ 
lengths  that  are  not  shown,  but  are  included  in  the  data  anal¬ 
ysis.  For  nanostructured  grating  surfaces,  the  higher  diffracted 
orders  contain  a  significant  amount  of  energy  and  cannot  be  ne¬ 
glected.  Fig.  3(a)  shows  the  sum  of  all  reflection  orders,  sum 
of  all  transmission  orders,  and  the  absorption  in  the  triangular 
grating  region,  as  a  function  of  wavelength  for  TM  polarization. 
The  overall  trend  to  larger  grating  absorption  at  shorter  wave¬ 
lengths  is  due  to  the  increasing  Si  absorption  coefficient.  For 
comparison,  the  reflection  and  transmission  power  as  a  func¬ 
tion  of  wavelength  of  a  planar  polished  surface  is  plotted  in 
Fig.  3(b).  Note  the  dramatic  reduction  in  the  reflection  for  the 
triangular- structured  surface  compared  to  the  planar  surface. 
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Fig.  2.  RCWA  power  distribution  profiles  for  triangular  gratings  as  a  function  of  wavelength  for  normal  incidence  TM  polarization,  (a)  Zeroth,  first,  and  second 
reflection  diffraction  orders,  (b)  Zeroth,  first,  and  second  transmitted  diffraction  orders,  (c)  Third,  fourth,  and  fifth  transmited  diffraction  orders. 
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Fig .  3 .  (a)  Sum  of  all  reflected  and  all  transmitted  diffraction-order  powers  and 

the  absorption  in  the  nano  structured  grating  region  as  a  function  of  wavelength 
for  a  triangular  grating  for  normal-incidence,  TM-polarization  irradiation,  (b) 
Reflected  and  transmitted  power  for  a  planar  surface  as  a  function  of  wavelength. 

Fig.  4  shows  comparable  results  for  the  TM-incidence  reflec¬ 
tion,  transmission  and  grating  absorption  for  the  rectangular 
structures.  The  overall  behavior  is  quite  similar  to  the  triangular 
profile  structures  with  some  additional  structure  resulting  from  the 
more  well-defined  mode  structure  in  the  grating  region.  Similarly, 
TE-polarization  reflection,  transmission,  and  absorption  profiles 
have  been  evaluated  for  the  triangular  and  rectangular  shaped 
nanostructured  grating  using  the  RCWA  method.  In  Section  VI, 
we  will  use  the  diffractive-order  power  distribution  of  the  struc¬ 
tures  to  evaluate  the  photocurrents  of  the  structured  devices . 


-^-Grating  Reflection 
Grating  Transmission 


Fig.  4.  The  sum  of  all  reflected  and  all  transmitted  diffraction-order  powers 
and  the  absorption  in  nanostructured  grating  region  as  a  function  of  wavelength 
for  a  rectangular  grating. 

III.  Nanostructure  Grating  Fabrication 

Interferometric  lithography  (IL)  is  ideally  suited  to  inexpen¬ 
sive,  large-area  nanopatterning  and  was  used  in  the  nanostruc¬ 
ture  fabrication  process.  [15]  A  typical  IL  configuration  con¬ 
sists  of  a  collimated  laser  beam  incident  on  a  Fresnel  mirror 
(FM)  arrangement  mounted  on  a  rotation  stage  for  period  vari¬ 
ation.  [16]  The  interference  between  two  coherent  laser  beams 
results  in  a  periodic  pattern  at  a  period  of  A/2  sin  6  where  A 
is  the  optical  wavelength  and  20  is  the  included  angle  between 
the  interfering  beams.  The  nanoscale  grating  patterns  were  first 
formed  in  photoresist  followed  by  pattern  transfer  onto  the  un¬ 
derlying  substrate  using  reactive-ion-etching  (RIE)  in  a  parallel 
plate  reactor  using  CHF3/O2  plasma  chemistry  [17].  By  con¬ 
trolling  the  plasma  parameters,  the  profile  can  be  varied  from 
vertical  to  triangular.  Fig.  5(a)  and  (b)  shows  rectangular  and 
triangular-shaped  Si  gratings  a  period  of  ~  850  nm  and  depth 
of  1000  nm. 

IV.  MSM  Detector  Fabrication 

Both  rectangular  and  triangular  shaped  grating  structures 
were  fabricated  in  localized  active  areas  of  the  MSM  devices 
using  IL,  along  with  conventional  I-line  contact  mask  printing 
[18]  for  defining  the  device  active  areas  and  metal  contact 
regions.  For  comparison  purposes,  we  fabricated  identical 
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(b) 


Fig.  5.  Scanning  electron  microscope  (SEM)  images  of  the  nanostructured 
gratings  in  the  active  regions  of  the  MSM  photodetectors:  (a)  Top  view  of 
rectangular  shaped  gratings  (b)  Side  view  of  triangular  shaped  gratings. 

devices  with  planar  active  regions  side-by-side  with  the  nanos- 
tractured  devices. 

Fig.  6  shows  a  process  flow  sequence.  A  bare  Si  wafer  was 
coated  with  photoresist  (PR)  and  exposed  to  an  interference  pat¬ 
tern,  as  described  above.  A  30-nm-thick  blanket  layer  of  Cr  was 
then  deposited  by  e-beam  evaporation  and  a  lift-off  step  was  used 
to  create  an  array  of  Cr  lines  that  act  as  an  effective  RIE  etch  mask. 
The  wafer  was  again  coated  with  PR  layer  and  a  mask  was  then 
used  to  selectively  pattern  the  PR  to  expose  the  Cr  lines  in  lo¬ 
calized  regions  where  the  nanostructures  were  etched  into  the  Si 
substrate  by  RIE,  thus  defining  the  nanostructured  active  areas. 
After  the  RIE  step,  the  PR  and  the  Cr  etch  mask  lines  are  removed. 
There  is  considerable  damage  left  on  the  nanostructured  region 
by  the  RIE  step.  Two  rapid  thermal  anneal  (RTA)  steps  were  per¬ 
formed  next.  The  first  RTA  was  performed  at  900  °C  for  5  min 
in  a  nitrogen  environment  to  anneal  the  damaged  surface  [19], 
followed  by  a  second  RTA  step  for  3  min  at  450  °C  in  a  hydrogen 
environment  to  passivate  the  Si  surface. 

A  low-temperature,  evaporated  SiC>2  isolation  layer 
100-nm  thick)  was  e-beam  evaporated  on  the  sample  as 
an  insulating  layer  [Fig.  6(b)].  A  second  mask  was  used  to 
define  active  area  windows  in  the  oxide.  Windows  (ranging 
from  50  x  50  /mi2  to  150  x  150  /mi2)  were  opened  in  the 
oxide  using  a  buffered  oxide  etch.  Using  a  third  mask,  an 
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Fig.  6.  Process  flow  sequence  for  fabrication  of  nanostructured  and  planar 
MSM  photodetectors. 

interdigitated  electrode  pattern  with  contact  pads  extending 
onto  the  oxide  was  defined  and  50-nm-thick  nickel  Schottky 
barrier  electrodes  were  deposited  by  e-beam  evaporation  and 
lift-off  with  gaps  ranging  from  10  to  20  /mi  and  aspect  ratios  of 
5:1.  Thicker  50:300  nm  Cr:Au  bonding  metallization  was  then 
defined  and  evaporated  on  the  contact  pads  above  the  oxide 
using  a  fourth  mask  and  deposited  by  evaporation  and  lift-off. 
Fig.  7(a)  and  (b)  shows  optical  micrographs  of  fully  fabricated 
structured  MSM  and  planar  surface  MSM  devices,  respectively. 
The  active  areas  of  the  devices  shown  are  50  x  50  /mi2.  The 
interdigitated  electrode  gaps  are  10  /mi,  with  the  grating  lines 
running  perpendicular  to  the  interdigitated  contact  fingers. 
Note  that  the  active  area  of  the  structured  device  is  much  darker 
than  the  planar  device  due  to  the  reduced  grating  reflection. 

V.  Dark  Currents 

Dark  currents  as  a  function  of  applied  bias  are  shown  in  Fig.  8 
for  nanostructured  and  planar  surface  devices.  Typical  devices 
had  150  x  150  /mi2  active  areas  and  10-/mi  interdigitated  elec¬ 
trode  gaps.  For  both  devices,  the  dark  currents  were  compa¬ 
rable  up  to  27  V.  At  higher  biases,  the  structured  devices  typ¬ 
ically  exhibit  lower  leakage  than  the  planar  devices.  This  differ¬ 
ence  is  not  fully  understood  at  this  time.  However,  it  is  worth 
mentioning  the  lower  dark  current  in  the  textured  devices  sug¬ 
gests  the  possibility  of  higher  sheet  resistance  in  the  nanos¬ 
tructured  active  regions  compared  to  the  planar  surfaces  due  to 
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Fig.  7.  Optical  micrographs  of  fully  fabricated  MSM  photodetectors  with 
50  x  50-jum2  active  regions  and  10- (im  interelectrode  gaps,  (a)  Planar  surfaced 
active  area  device,  (b)  Nanostructured  (triangular)  active  area  device. 

the  smaller  cross-sectional  area  of  the  former.  In  addition,  re- 
active-ion-etching  (RIE)  of  the  Si  active  areas  introduces  deep 
energy  levels  (surface  states)  which  act  as  efficient  energy  re¬ 
combination  centers  that  can  lead  to  further  reduction  in  the  dark 
current  of  the  structured  devices  [20]-[22].  However,  the  pho¬ 
toresponse  results  suggest  that  surface  recombination  does  not 
play  a  major  role  in  the  response. 

VI.  Spectral  Response  Measurements,  Modeling, 
and  Analysis 

The  photoresponse  as  a  function  of  wavelength  was  measured 
using  a  continuous- wave  (CW)  xenon  lamp  source,  1/4-m 
monochrometer,  polarizer,  chopper,  transimpedance  amplifier 
(TIA),  detector  biasing-tee,  and  a  lock-in  amplifier.  The  lamp 
beam  was  focused  onto  the  monochrometer  entrance  slit.  The 
spectrally  filtered  output  from  the  monochrometer  was  focused 
at  normal  incidence  onto  the  active  region  of  the  MSM  devices. 
The  focused  optical  beam  spot  overfilled  the  active  area  of  the 
MSM  detector.  Both  structured  and  planar  150  x  150  ^m2  active 
area  devices  with  interdigitated  10- (im  electrode  gaps  were 
characterized  at  a  40- V  bias,  depletion  width  ^depletion  ~  10  fi m 
(for  n  rsj  5  x  1014  cm-3)  extending  fully  across  the  active  area. 

The  photoresponse  signal  for  both  the  rectangular  and  trian¬ 
gular  grating  MSM  devices  was  normalized  to  an  adjacent  planar 
device  photoresponse  signal.  Since  the  structured  and  planar 
devices  were  identical  in  all  aspects  except  for  the  nanostructures 
in  the  active  region,  we  expect  the  normalized  signal  to  reflect 
the  photocurrent  ratio  profile  between  structured/planar  devices, 
although  device-to-device  variations  in  the  photocurrent  of  about 
A  <  ±15%  were  observed  within  a  nominally  uniform  ~  1  cm2 
processed  area,  presumably  due  to  contaminants/defects  in 


Fig.  8.  Log  scale-plot  of  dark  current  for  structured  and  planar  devices  as  a 
function  of  bias  voltage. 
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Fig.  9.  Experimental  and  theoretical  plots  of  normalized 
(triangular-structured/planar)  MSM  device  photocurrent  responses  for  normal 
incidence:  (a)  TM  polarization  and  (b)  TE  polarization  irradiation.  The 
lower  theoretical  curve  includes  the  anti-reflection  properties  of  the  grating. 
The  upper  theoretical  curve  (closer  to  the  experiment)  shows  the  results  of 
a  calculation  accounting  for  the  smaller  absorption  depth  of  the  diffracted 
radiation  (see  text  for  details). 

the  metal/semiconductor  interface  junction.  These  variations 
contribute  a  small  dc  shift  to  the  normalized  curves.  We  sampled 
six  devices  of  each  kind  and  report  best  results. 

The  grating  structure  (rectangular  and  triangular)  parameters 
were  given  in  Section  II  for  the  characterized  device  results  pre¬ 
sented  in  this  section.  Figs.  9  and  10  show  experimental  re¬ 
sults  of  normalized  spectral  responses  as  a  function  of  wave- 
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Fig.  10.  Experimental  and  theoretical  plots  of  normalized 
(rectangular-structured/planar)  MSM  device  photocurrent  responses  for  normal 
incidence:  (a)  TM  and  (b)  TE  polarizations.  Theoretical  curves  as  in  Fig.  9. 


length  for  the  (triangular  and  rectangular)  shaped  nanostruc- 
tured  MSM  devices  for  both  TM  and  TE  polarized  normal  in¬ 
cident  irradiance.  As  seen  from  Fig.  9(a),  the  experimental  TM 
normalized  spectral  response  of  the  triangular  structured  device 
shows  a  broadband  enhancement  of  ~  2  x  over  the  entire  400- 
to  800-nm  range.  The  rectangular  profile  grating  devices  show  a 
somewhat  lower  overall  enhancement,  but  with  larger  enhance¬ 
ments  at  both  the  shortest  and  longest  wavelengths  investigated. 
The  noise  at  the  long-wavelength  end  results  from  the  low  signal 
levels  in  this  experimental  arrangement. 

Some,  but  not  all,  of  the  enhancement  is  due  to  the  anti-re- 
flection  properties  of  the  nano  structured  surfaces.  For  example, 
at  a  wavelength  of  700  nm,  the  reflectivity  of  a  planar  Si  sur¬ 
face  is  33%  compared  with  ~  3%  (triangular)  and  ~  7%  (rect¬ 
angular)  for  the  structured  devices.  In  order  to  evaluate  the  ab¬ 
solute  quantum  efficiency  (77)  at  this  wavelength,  a  CW  mode- 
locked  Ti:  AI2O3  laser  was  used  and  the  beam  was  focused  using 
a  10  x  lens  to  a  spot  size  smaller  than  the  10-/xm  interelec¬ 
trode  gap.  The  calculated  enhancement  for  the  triangular  struc¬ 
ture  is  1.44.  From  the  experiment,  the  photocurrent  density  of 
~  0.25  A/cm2  (planar)  and  ~  0.46  A/cm2  (triangular)  was 
measured  for  an  incident  optical  power  of  0.24  mW  or  (1.067 
W/cm2),  giving  an  enhancement  of  ~  1.85.  This  results  in 
an  external  quantum  efficiency  ??ext  of  ~  44%  (planar)  and 


TABLE  I 

Experimental  Results  of  Surface  Reflectance,  Responsivity,  External 
and  Internal  Q.E.  for  Planar  and  Nanostructured  Devices  at 
A  =  700  nm  for  TM  Polarization  at  Normal  Incidence 


Device 

Surface 

Reflection 

Responsivity 

(AAV) 

External 

Q.E. 

Internal 

Q.E. 

Planar 

-33% 

-0.23 

-44% 

-66% 

Structured 

-3% 

-0.43 

-81% 

-84% 

81%  (triangular)  and  an  internal  quantum  efficiency  [23]  ?/nt  = 
?7ext/ (1 — R)  of  66%  (planar)  and  84%  (triangular)  (see  Table  I). 

Simply  correcting  the  photoresponse  for  the  anti-reflection 
properties  of  the  gratings  gives  the  lower  theoretical  curves 
shown  along  with  the  experimental  curves  in  Figs.  9  and  10.  As 
can  be  seen  from  Fig.  9,  at  the  shorter  wavelengths,  the  results 
are  close  to  the  experimental  results,  however  as  the  wavelength 
increases  the  gap  between  the  theory  and  experiment  increases 
for  both  the  TE  and  TM  polarizations.  A  similar  trend  is  found 
for  the  rectangular  grating  case  in  Fig.  10.  The  RCWA  calcula¬ 
tions  are  sensitive  to  the  details  of  the  grating  lineshape  for  both 
the  rectangular  and  triangular  cases.  The  triangular  structures 
[Fig.  5(b)]  are  closer  to  ideal  then  the  rectangular  structures, 
which  have  curved  rather  than  square  bottoms.  Therefore,  the 
data  analysis  will  be  concentrated  on  the  triangular  gratings. 

We  postulate  that  the  additional  enhancement  in  the  quantum 
efficiency,  beyond  the  antireflection  properties  of  the  grating,  is 
due  to  carriers  generated  in  the  grating  region  and  to  redirec¬ 
tion  of  a  significant  part  of  the  power  transmitted  into  the  Si 
into  higher  diffraction  orders  that  propagate  at  an  angle  to  the 
normal  and  therefore  are  absorbed  closer  to  the  surface  where 
the  probability  of  collection  is  higher.  Note  from  the  above  anal¬ 
ysis  that  ?7int  of  the  planar  device  is  only  66%,  i.e.,  that  34%  of 
the  photoexcited  carriers  are  not  collected.  We  are  not  aware 
of  any  models  for  the  dependence  of  the  internal  quantum  effi¬ 
ciency  on  depth  within  the  semiconductor,  but  it  is  reasonable 
to  assume  that  this  efficiency  is  higher  for  carriers  excited  at  the 
surface  where  the  bias  fields  of  the  MSM  are  large  and  that  it 
decreases  as  the  carriers  are  generated  deeper  within  the  semi¬ 
conductor.  To  allow  a  calculation,  we  make  the  simple  ansatz 

r]  =  e_Q!c*  (3) 


where  z  is  the  depth  into  the  semiconductor  and  ac  is  a  scaling 
parameter,  determined  from  the  planar  device  experimental  re¬ 
sults,  that  reflects  the  assumption  that  carriers  generated  near 
the  surface  to  contribute  a  higher  quantum  efficiency  than  those 
generated  deeper  into  the  material. 

Then  the  total  external  quantum  efficiency  is  given  by 


^total  —^grating 
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where  Pm  and  are  the  power  diffracted  into  and  the  propaga¬ 
tion  angle  of  the  mth  order,  respectively,  P,  is  the  incident  power, 
and  a  is  the  Si  absorption  coefficient.  In  the  limit  of  ac  — ►  0,  i.e,. 
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TABLE  II 

Theoretical  Calculations  of  Component  and  Total  External  QE  at  A  =  700  nm  for 
TM  POLARIZATION  AT  NORMAL  INCIDENCE  BASED  ON  RCWA  PLOTS  OF  FlG.  2 


Contributing 
factor  TM 
polarization 
A=700nm 

Fraction 
of  Power 

Angle  of 
propagation 

(0t) 

Photocurrent 
density  per 
contributor 
factor 

Component 

Q.E. 

Partial 

Q.E. 

Surface  reflection 

-0.03 

-- 

Nanostructured 
absorption  (a) 

-0.10 

-- 

-  0.06A/cm2 

-  100% 

-  10% 

0th  order 

-0.04 

-0° 

-  0.0 16 A/cm2 

-63% 

-  2.5% 

±lst  order 

-0.05 

-13° 

-  0.02A/cm2 

-  63.6% 

-  3.2% 

±2nd  order 

-0.31 

-26° 

-  0.12A/cm2 

-  65.4% 

-  20.3% 

±3rd  order 

-0.33 

-41° 

-  0.1 4 A/cm2 

-  69.2% 

-23% 

±4th  order 

-0.14 

-61° 

-  0.07  A/cm2 

-  77.8% 

-  11% 

Total  Power  1.00 

Total  Photocurrent  density 
-  0.43A/cm2 

Total  Z(partial  Q.E) 
next -70% 

in  the  absence  of  any  dependence  of  the  collection  efficiency  on 
the  absorption  depth,  this  reduces  to  the  usual  result. 

Recall  that  the  planar  and  structured  devices  are  identical  in 
all  aspects  except  for  the  grating  structures;  therefore,  the  carrier 
collection  mechanisms  in  the  two  devices  should  be  the  same. 
If  we  estimate  the  fitting  parameter  ac  from  the  planar  device 
results,  it  should  be  directly  applicable  to  the  structured  de¬ 
vice.  For  normal  incident  irradiance  on  a  planar  surfaced  device, 
where  there  are  no  high-order  modes,  all  of  the  energy  is  con¬ 
tained  in  the  zeroth-diffraction  order  mode,  which  propagates 
at  #o  =  0.  As  shown  in  Fig.  3(b),  at  700  nm,  a  planar  silicon 
surface  reflects  ~  33%  of  the  incident  light,  or  Po/Pi  ~  0.67. 
The  absorption  coefficient  a  =  1.8  x  103  cm-1  at  this  wave¬ 
length,  and  therefore  from  (3),  ac  ~  8.8  x  102  cm-1.  We  can 
now  estimate  the  total  external  efficiency  for  the  structured  de¬ 
vices  by  using  the  RCWA  modeling  results.  From  Figs.  2  and  3, 
at  700  nm,  the  modeled  reflection  loss  is  ~  3%,  and  ~  10%  of 
the  incident  irradiance  energy  is  absorbed  in  the  nanostructured 
grating  region.  The  rest  of  the  incident  irradiance  energy  is  dis¬ 
tributed  as  follows:  1)  ~  4%  is  coupled  into  the  zeroth  order;  2) 
~  2  x  2.5%  is  coupled  into  the  =blst  order  with  a  propagating 
transmitted  angle  of  (9dbj  ~  ±13°  with  respect  to  the  normal; 
3)  ~  2  x  15.5%  is  coupled  into  the  ±2nd  orders  propagating  at 
~  ±26° ;  4)  ~  2  x  16.5%  is  coupled  into  the  ±3rd  orders  propa¬ 
gating  at  ~  ±41° ;  and  5)  ~  2  x  7%  of  the  energy  is  coupled  into 
the  ±4th  order  propagating  at  ~  ±61°.  Using  (4),  we  can  calcu¬ 
late  the  partial  quantum  efficiencies  for  each  order,  with  the  re¬ 
sults  shown  in  Table  II.  The  ratios  of  photocurrents  between  the 
structured  and  planar  devices  for  this  case  are:  ~  1.7  x  (~  70% 
total  external  quantum  efficiency)  for  TM  polarization,  which 
is  in  the  correct  direction,  but  insufficient  to  fully  explain  the 
experimental  results.  The  curves  in  Figs.  9  and  10  labeled  theo¬ 
retical  (RCWA  calculation)  show  similar  calculations  across  the 
spectral  region  for  both  TE  and  TM  polarizations  for  the  trian¬ 
gular  and  rectangular  gratings. 


TABLE  III 

Depletion  Depth  for  n  ~  5  x  1014  cm-3  Doped  Silicon  as  a  Function 
of  Applied  Bias  Voltage  Across  a  10-jum  Electrode  Gap 


Bias  (V) 

2.5 

5 

10 

15 

20 

25 

30 

35 

40 

Depletion 
Depth  (pm) 

-2.5 

-3.6 

-5.1 

-6.2 

-7.2 

-8.0 

-8.8 

-9.5 

-10.1 

VII.  Time  Response 

In  the  previous  section,  we  analyzed  the  impact  of  nanos¬ 
tructuring  the  Si  surface  on  the  CW  quantum  efficiency  of  the 
photodetectors.  Another  beneficial  aspect  of  generating  carriers 
closer  to  the  surface  is  faster  carrier  collection. 

Pulsed  response  measurements  are  taken  using  ~  150-fs 
duration  excitations  at  two  different  wavelengths  (900  and 
400  nm)  from  a  CW  mode-locked  Ti:Al203  laser  (doubled 
for  the  short  wavelength,  0.24-mW  average  power  for  both 
wavelengths,  7 7 -MHz  repetition  rate).  Photodetectors  were 
probe  tested  using  an  18  GHz  probe  and  a  high-speed  sampling 
oscilloscope.  The  laser  spot  size  was  ~  50  pm  in  diameter.  Al¬ 
though  the  spot  size  was  larger  than  the  interdigitated  electrode 
gap  (10  pm)  the  total  focal  spot  was  well  within  the  150  x  150 
pm2  active  region  of  the  devices  tested.  Normal-incidence  TM 
polarization  relative  to  the  grating  wavevector  excitation  was 
used  for  both  wavelengths.  At  900  nm,  the  absorption  depth 
extends  to  ~  20  pm,  while  at  400  nm  the  absorption  depth  is 
<  1  p m.  These  two  absorption  depths  provide  insight  regarding 
the  relative  contributions  of  carriers  generated  within  or  below 
the  depletion  region.  For  both  wavelengths,  the  time  response 
measurements  were  taken  for  bias  voltages  varying  from  2.5  to 
40  V.  Table  III  shows  the  depletion  width  at  each  bias  voltage. 

Before  the  experimental  data  and  analysis  is  provided,  it  is 
useful  to  review  [24]  the  three  primary  factors  that  impact  MSM 
photodetector  response  speed. 

1)  Drift  of  carriers  through  the  depletion  region.  At  high 
E  fields,  the  drift  velocities  of  both  electrons  and  holes 


1658 


IEEE  JOURNAL  OF  QUANTUM  ELECTRONICS,  VOL.  38,  NO.  12,  DECEMBER  2002 


5 

4 

> 

3 

CD 

TS 

13 

f  2 

£ 

<  1 


'  ■ '  '  i  i  1 1  '  i  1 1  i  1  1 1 1  1  i  1 1  1  1  1  i 1  1  1  1  rr~' 11  i  ■  '  ■  ' 

|  Triangular  (TM  Polarization) ; 

ji  900nm  2.5V  Bias  : 

jk  ; 

mV) 

CO 

o  o 

$  '  1  '  1 . ^ 

Triangular  (TM  Polarization) 

|  \  900nm  20V  Bias 

100 

>  80 
£ 

— r-i— , -r , — i ■  I  ■  ■  ■  ■  i  '  » ■ 

':f  T riangular  (TM  Polarization) 
:j\  900nm  40V  Bias 

:j  \  ' 

L  \ 

<13 

1 V'  • 

60 

\af\  : 

jl  V 

5  K  -^Planar  : 

■§  20 

\l!l  j 

T5 

ZS 

\\  -^-Planar 

S-  40 
£ 

|  \i  -®-  Planar  : 

|  ^  FWHM-  0.6ns  : 

Q_ 

E 

FWHM  -  0.42ns 

>  V  FWHM  ~  0.29  ns; 

--a- -Structured 

<10 

Via  -a- Structured 

< 

20 

0t 

--^--Structured 

FWHM  ~  0.5ns  : 

0i 

NX  FWHM  ~  0.37ns 

N^_FWHM  ~  0.24ns  ' 

[— i — i — i — i — i — i — i — i — i — i — i i i L_ — i i  i — P-r-i 

0  1 


2  3  4  5 

Time  (ns) 

(a) 


6  7  8 


2 

Time  (ns) 

(b) 


1 

Time  (ns) 

(C) 


Fig.  11.  Time-response  measurements  for  TM -polarization,  normal-incidence  irradiance  at  900  nm  as  a  function  of  bias  voltage,  (a)  2.5  V.  (b)  20  V.  (c)  40  V. 


in  Si  saturate  [25]  at  about  1  xlO7  cm/s.  Provided  the 
field  within  the  depletion  region  exceeds  the  saturation 
value  for  most  of  its  length,  we  assume  that  the  carriers 
move  with  a  constant  velocity  usat.  The  longest  transit 
time  t drift  =  W'Y'Udrift  result  when  carriers  are  gen¬ 
erated  near  one  edge  of  depletion  layer  width  W.  For  a 
10- pm  drift  region  in  Si,  Tdrift  ~  100  ps. 

2)  Diffusion  of  carriers  to  the  depletion  region.  The  time  it 
takes  for  carriers  to  diffuse  a  distance  d  is  raiff  =  d?  /  2D, 
where  D  is  the  carrier  diffusion  coefficient,  for  holes  in 
Si  and  d  ~  10  pm,  ~  40  ns  Thrift .  Those  carriers 
generated  within  the  depletion  region  respond  rapidly, 
while  those  generated  outside  give  rise  to  a  “slow”  tail. 

3)  Junction  and  parasitic  capacitance  effects.  A  metal-semi¬ 
conductor  junction  under  reverse  bias  exhibits  a  voltage- 
dependent  capacitance  caused  by  the  variation  in  stored 
charge  at  the  junction  represented  by  the  relation  Cj  = 
A/2{2e£sNd)1^2y~1^2 .  This  capacitance  is  usually  quite 
small  for  MSM  devices  as  a  result  of  their  planar  design 
(A  small,  depletion-width  large)  There  are  also  parasitic 
circuit  capacitances  associated  with  the  packaging  and  ca¬ 
bling  that  often  dominate  the  junction  effects,  as  well  as 
the  limiting  response  of  the  electronics.  Both  structured 
and  planar  detectors  have  an  identical  rcircuit  and  mea¬ 
surement  limitations  for  these  measurements. 

We  first  present  the  time  response  measurements  for  an  ex¬ 
citation  wavelength  of  900  nm  (absorption  length  of  ~  20  pm, 
planar  device  reflection  loss  ~  28%,  and  the  structured  device 
reflection  loss  ~  5%).  For  a  low  bias  voltage  of  2.5  V,  the  de¬ 
pletion  width  is  only  ~  2.5  pm,  extending  only  part  way  be¬ 
tween  the  electrodes.  In  this  case,  large  number  of  carriers  are 
generated  outside  the  depletion  region  and  the  device  time  is 
limited  by  carrier  diffusion.  The  time  response  signals  for  both 
structured  and  planar  devices  are  shown  in  Fig.  11.  Both  the 
peak  signal  amplitude  (planar  2.0  mV;  triangular  4.7  mV)  and 
total  area  under  the  curve  (planar  ~3.3  V-ps;  triangular  ~7.8 
V -ps)  are  significantly  smaller  for  the  planar  device.  The  ratio 
of  the  integrated  areas  is  2.4,  higher  than  the  ratio  of  the  external 
quantum  efficiencies  of  about  1.8  at  this  wavelength,  suggesting 
that  there  is  an  even  longer  response  tail  for  the  planar  devices 
due  to  carriers  generated  deep  in  the  material  that  contribute  to 
the  total  response.  The  peak  response  reflects  this  same  ratio, 
the  high  speed  response  is  somewhat  slower  for  the  textured  de¬ 


vice.  The  long  tail  for  both  devices  is  due  to  carrier  diffusion 
from  electron-hole  pairs  being  generated  outside  the  depletion 
region. 

As  the  bias  voltage  is  increased,  the  depletion  region  extends 
across  the  active  region  thus  increasing  the  drift  component 
signal  strength  as  seen  in  Fig.  1 1(b)  and  (c)  for  both  planar  and 
structured  devices.  The  peak  amplitude  of  the  time  response 
versus  applied  voltage  is  plotted  in  Fig.  12(a),  as  the  bias  is 
increased  more  carriers  are  collected  and  the  signal  becomes 
larger.  At  this  long  wavelength  of  900  nm,  no  saturation  in  car¬ 
rier  collection  is  observed  up  to  the  highest  applied  bias  where 
the  depletion  region  extends  fully  across  the  device,  but  only  to 
a  depth  of  ~  10  pm  (fringing  fields  from  the  surface  electrodes), 
whereas  the  absorption  depth  is  >  20  pm. 

To  further  analyze  these  results,  fast  and  slow  components 
of  the  time  constants  were  evaluated  from  the  slopes  of  the  log 
(time  response),  as  plotted  in  Fig.  12(b)  and  (c).  The  structured 
device  has  a  shorter  fast  response  time  at  low  voltages;  the  time 
responses  are  similar  at  high  applied  biases. 

For  a  400-nm  excitation,  the  absorption  occurs  within  the 
first  micrometer  from  the  surface,  e.g.,  almost  totally  within  the 
grating  for  the  textured  device.  At  this  wavelength,  the  planar 
device  reflects  about  ~  49%  of  the  incident  light  and  the  struc¬ 
tured  device  reflects  about  ~  7%.  Fig.  13  shows  the  time  re¬ 
sponse  for  the  structured  and  planar  device  at  biases  ranging 
from  2.5  to  40  V.  While  the  absorption  depth  is  short,  the  deple¬ 
tion  width  extends  only  part  way  across  the  transverse  dimen¬ 
sion  of  the  devices  at  low  biases  and  there  is  still  a  significant 
diffusion  contribution  to  the  response.  At  the  highest  bias,  the 
depletion  region  extends  across  the  active  region  of  the  device 
and  there  is  no  diffusion  contribution.  The  time  responses  are 
significantly  faster  than  at  the  longer  wavelength,  corresponding 
to  the  shortened  transit  times  for  this  short  wavelength  absorp¬ 
tion.  The  structured  device  has  a  larger  area  near  the  upper  part 
of  the  signal  (fast  component)  indicating  a  larger  total  power 
collection  due  to  the  improved  light  coupling  and  low  reflec¬ 
tion  losses  compared  to  the  planar  device.  The  integrated  time 
responses  [Fig.  14(a)]  are  roughly  equal  at  low  bias,  but  show 
the  expected  factor  of  ~  2  (from  the  CW  results)  at  high  bi¬ 
ases.  As  the  bias  is  increased,  more  carriers  are  collected  until 
saturation  is  reached,  in  contrast  to  the  long-wavelength  results 
where  saturation  was  not  observed  [Fig.  13(a)].  Fig.  14(b)  shows 
the  carrier  time  constant  curve  as  a  function  of  bias  voltage;  at 
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(a)  (b)  (c) 

Fig.  12.  Comparative  analysis  of  the  time  response  of  triangular- structured  and  planar  devices  for  normal  incident  irradiance.  (a)  Signal  amplitude  as  a  function 
of  bias  voltage,  (b)  Fast  time-decay  constant  as  a  function  of  bias  voltage,  (c)  Slow  time-decay  constant  as  a  function  of  bias  voltage. 
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Fig.  13.  Time-response  measurements  for  TM  polarization,  normal-incidence  irradiance  at  400  nm  as  a  function  of  bias  voltage,  (a)  2.5  V.  (b)  20  V.  (c)  40  V. 
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Fig.  14.  Comparative  analysis  of  the  time  response  of  triangular- structured 
and  planar  devices  for  normal  incident  irradiance.  (a)  Signal  amplitude  as  a 
function  of  bias  voltage,  (b)  Time-decay  constant  as  a  function  of  bias  voltage. 


higher  biases  the  response  time  reaches  an  instrumental  limit  of 
~  100-150  ps  for  both  planar  and  structured  devices.  The  evi¬ 
dence  of  undershoot  seen  in  Fig.  13  also  confirms  that  the  re¬ 
sponse  is  instrumentally  limited. 

VIII.  Analysis  and  Conclusion 

We  employed  the  optical  reflection,  transmission,  and  absorp¬ 
tion  properties  of  Si  nanoscale  structures  to  improve  the  per¬ 
formance  of  silicon  MSM  photodetectors.  MSM  photodetectors 
were  fabricated  with  and  without  nanostructures  in  the  active 
area  for  comparison  purposes.  Interferometric  lithography  was 
used  for  the  nanopatteming  the  structures  and  conventional  op¬ 
tical  lithography  was  used  for  defining  the  device  active  and 
metal  contact  areas.  Devices  characterized  had  a  150  x  150  fim2 
area  with  10-/xm  interdigitated  electrode  gaps. 

RCWA  was  used  to  model  the  reflection  and  transmission 
diffraction  order  modes  as  a  function  of  wavelength.  The  power 
and  angle  of  transmission  of  the  transmitted  diffraction  higher 
orders  as  a  function  of  nanostructure  geometry  is  the  key  mech¬ 
anism  that  improved  the  device  performance  by  generating  car¬ 
riers  near  the  surface  of  the  active  region.  The  internal  quantum 
efficiency  increased  from  ~  66%  to  ~  84%  at  A  =  700  nm.  By 
increasing  the  absorption  near  the  surface,  we  not  only  increase 
the  total  carrier  collection  efficiency,  but  also  improve  the  time 
response  of  the  signal.  This  was  shown  by  the  time  response  data 
signals  at  A  =  900  nm  that  show  the  fast  part  of  the  signal  orig¬ 
inating  from  higher  diffraction  order  photo  generated  carriers 
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that  are  collected  near  the  surface  have  a  narrower  width.  How¬ 
ever,  the  slower  component  of  the  time  response  signal,  origi¬ 
nating  from  the  zeroth-order  mode,  is  comparable  in  both  planar 
and  structured  devices,  due  to  the  same  absorption  depth  from 
which  the  photogenerated  carriers  are  diffusing  from  into  the  de¬ 
pletion  region.  In  order  to  improve  the  complete  time  response 
signal,  we  need  to  investigate  a  structured  design  that  eliminates 
the  deep  carriers.  For  future  work,  we  propose  to  fabricate  sim¬ 
ilar  devices  in  silicon-on-insulator  materials  where  the  absorp¬ 
tion  depth  is  restricted  by  the  device  structure  rather  than  the 
intrinsic  Si  absorption. 
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Radiation  from  a  dipole  embedded  in  a  multilayer  slab 
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An  analytical  solution  for  the  radiation  emitted  from  a  dipole  embedded  in  an  arbitrary,  planar  dielectric  film 
stack  is  presented.  The  calculation  uses  a  rigorous  Hertz-vector  formalism  to  treat  the  electromagnetic  bound¬ 
ary  conditions.  The  radiation  fields  are  then  evaluated  in  a  far-field  approximation  to  get  the  radiated  fields  far 
from  the  dipole.  Both  two-dimensional  (2D)  emission  into  bound  modes  of  the  dielectric  stack  and  three- 
dimensional  (3D)  emission  into  radiation  fields  above  and  below  the  stack  are  evaluated.  These  solutions  are 
explored  for  two  simple  cases:  a  InGaAs  slab  symmetrically  clad  with  up  to  four  high-contrast  (A1203  /GaAs) 
Bragg  mirror  pairs  and  semi-infinite  air  spaces,  and  a  similar  asymmetric  structure  with  a  GaAs  substrate  on 
one  side.  The  symmetric  structure  supports  both  2D  bound  and  3D  radiation  fields.  The  asymmetric  structure 
only  supports  3D  radiation  fields  since  there  are  no  strictly  bound  modes,  but  “leaky”  modes  appear  that  are 
very  similar  to  the  bound  modes  in  the  symmetric  structure  except  that  the  radiated  power  ultimately  is 
transmitted  into  the  substrate  in  a  very  highly  directional  beam.  This  calculation  is  applicable  to  a  wide  range 
of  solid-state  photonic  devices,  including  vertical-cavity  and  edge-emitting  lasers,  spontaneous  light-emitting 
diodes,  and  photodetectors. 
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I.  INTRODUCTION 

The  coupling  of  an  electric  dipole  with  electromagnetic 
fields  in  solids  is  fundamental  to  solid-state  light  emitting 
and  detecting  devices  (LEDs,  lasers,  displays,  photodetec¬ 
tors,  etc.).  In  a  uniform  infinite  medium  the  dipole  radiation 
fields  are  very  well  known.  The  study  of  these  radiation 
fields  in  the  presence  of  metallic  and  dielectric  interfaces  has 
been  a  recurring  topic  in  electromagnetism.  Sommerfeld 
[1,2]  made  seminal  contributions  to  the  solution  that  provide 
the  basis  of  the  work  presented  here.  He  was  concerned  with 
issues  of  long- wavelength  radio  wave  propagation  over  the 
surface  of  the  earth  and  addressed  the  single  interface  prob¬ 
lem  of  a  dipole  above  a  lossy  dielectric.  As  is  very  well 
known  [3,4],  the  addition  of  a  second  interface,  e.g.,  a  dipole 
embedded  within  a  dielectric  slab,  adds  significant  complex¬ 
ity  to  the  problem,  since  the  radiation  is  now  distributed 
between  three-dimensional  (3D)  radiation  out  the  sides  of  the 
slab  and  the  two-dimensional  (2D)  bound  modes  supported 
by  the  slab.  With  the  development  of  multilayer  structures 
with  very  high,  and  precisely  tailorable,  reflectivities,  such  as 
vertical-cavity  lasers,  resonant  cavity  LEDs,  and  photodetec¬ 
tors,  there  is  a  need  for  a  more  detailed  understanding  of  the 
distribution  of  the  radiation  between  these  components  and 
the  corresponding  angular  and  thickness/wavelength  depen¬ 
dencies.  Both  approximate  analytic  treatments  [5,6]  and  nu¬ 
merical  solutions  [7]  have  appeared  in  the  literature.  Re¬ 
cently,  two  equivalent  detailed  analytical  treatments  for  the 
simple  slab  case  have  been  reported  [4,8].  Both  of  these 
treatments  presented  fully  analytic  results  for  the  radiation 
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into  both  the  2D  bound  modes  and  the  3D  radiation  modes 
and  investigated  in  detail  the  dependence  on  dielectric  con¬ 
trast  and  slab  thickness. 

The  purpose  of  this  paper  is  to  extend  these  results  [4]  to 
arbitrary  multilayer  dielectric  structures  and  to  investigate  in 
particular  the  impact  of  high-reflectivity  Bragg  mirrors  on 
the  dipole  radiation.  The  analysis  is  extended  to  asymmetric 
cases,  including  mode-free  cases  that  support  “leaky”  modes 
that  are  not  bound  in  the  strict  sense,  but  rather  build  up 
significant  intensity  within  the  multilayer  structure  and  ulti¬ 
mately  radiate  their  power  into  3D  radiation  within  the  sub¬ 
strate. 

Section  II  presents  the  Hertz  vector  formulation  and  intro¬ 
duces  a  significant  algebraic  simplification  by  introducing 
both  electric  and  magnetic  Hertz  vectors.  Section  III  presents 
the  closed  form  analytic  results  for  arbitrary  multilayer  struc¬ 
tures.  Detailed  investigation  of  the  results  for  a  symmetric 
structure  with  high-reflectivity  Bragg  reflectors  is  presented 
in  Sec.  IV  A.  A  related  asymmetric  structure  with  a  high- 
index  substrate  is  analyzed  in  Sec.  IV  B.  Finally,  conclusions 
are  presented  in  Sec.  V. 


II.  HERTZ  VECTOR  FORMULATION 
A.  Geometry  of  the  problem 

The  geometry  of  the  problem  is  shown  in  Fig.  1.  A  dipole 
is  embedded  in  an  arbitrary  film  stack  taken  as  infinite  in  the 
(v,y)  directions.  The  z  direction  is  perpendicular  to  the 
films.  The  dipole  is  oriented  at  an  angle  0d  from  the  z  direc¬ 
tion  in  the  (x9z)  plane.  The  layer  stacks  above  and  below  the 
dipole  have  arbitrary  numbers  of  layers  with  arbitrary  rela¬ 
tive  dielectric  constants  (K^n?)  and  thicknesses  (L*).  The 
total  physical  thicknesses  of  the  top  and  bottom  cladding 
layers  from  the  position  of  the  dipole  to  the  edge  of  the 
outermost  layer  are  Lt_tot  and  Lb_tot .  For  the  sample  calcu¬ 
lations  presented  below,  the  inner  cladding  layers  (shaded) 


1063-65  lX/2003/68(3)/036608(13)/$20.00 


68  036608-1 


©2003  The  American  Physical  Society 


BRUECK,  SMAGLEY,  AND  ELISEEV 


PHYSICAL  REVIEW  E  68,  036608  (2003) 


*9 


FIG.  1.  Geometry  of  the  problem.  A  radiating  dipole  oriented  at 
an  angle  0d  in  the  (jc,z)  plane  is  embedded  in  the  center  of  a  infinite 
slab  of  material  of  relative  dielectric  constant  k0  and  thickness  L0 . 
The  slab  is  bound  with  arbitrary  film  stacks  ending  in  top  and  bot¬ 
tom  semi-infinite  media  of  relative  dielectric  constant  Kt  and  Kb . 
For  the  example  calculations,  the  shaded  inner  cladding  thicknesses 
L  are  identical  and  variable.  The  total  physical  thicknesses  of  the 
upper  and  lower  claddings  from  the  position  of  the  dipole  to  the 
edge  of  the  semi-infinite  media  are  indicated  by  Lt_tot  and  Lb_tot. 

have  identical  dielectric  constants,  Kn  =  Kbl  =  K  =  n2 ,  and  an 
equal  thickness,  Ltl  =  Lbl  =  L  that  is  varied  in  the  calcula¬ 
tion.  The  thickness  of  the  layer  containing  the  dipole  is  de¬ 
noted  as  L0 . 

B.  Hertz  vector  formalism 

In  our  previous  paper  [4]  we  used  the  Hertz  vector  for¬ 
malism  introduced  by  Sommerfeld  [1]  to  evaluate  the  radia¬ 
tion  from  a  simple  dielectric  slab  structure.  Lukosz  [9,10] 
has  developed  an  alternate  formulation  of  the  Hertz  vector 
source  terms  that  has  the  significant  advantage  that  the 
boundary  conditions  are  uncoupled  for  the  TE  and  TM  radia¬ 
tion  from  a  horizontal  dipole.  This  dramatically  simplifies 
the  algebra  of  the  calculation;  the  final  results  are  identical. 
Lukosz ’s  formulation  is  briefly  described  here  and  is  fol¬ 
lowed  in  the  evaluations  of  the  radiated  power. 

Lukosz  [9,10]  has  shown  that  the  electromagnetic  field  for 
any  orientation  of  the  dipole  can  be  represented  by  two  scalar 
functions,  <fi(x)  and  if/(x),  which  are  the  z  components  (per¬ 
pendicular  to  the  layers)  of  an  electric  and  a  magnetic  Hertz 
vector,  respectively, 

n(£)(f)= (o,o, </>(*)), 

riw(x)  =  (0,0,^(x)).  (i) 

Both  scalar  functions  satisfy  the  homogeneous  Hemholtz 
equation  in  source-free  regions,  viz., 

(V2- Kie0kl)(f>(x)  =  (V2- Kis0kl)tp(x)  =  0,  (2) 


where  s0  is  the  free-space  permittivity,  /c,  =  n2  is  the  relative 
dielectric  permittivity  of  the  medium,  and  k0  =  (x)/c  is  the 
magnitude  of  the  free-space  photon  wave  vector.  The  fields 
are  given  by 

E(x) =(o.vx  n(H\x ) + (  ke0)  - 1  v  x  v  x  ri(£)(f ), 

H(x) = - i wv x rL>(i) +w‘vxvx n(H\x).  (3) 

where  s0  and  jul0  are  the  free-space  values  of  the  electric  and 
magnetic  permeabilities,  respectively. 

From  Eqs.  (1)  and  (3)  it  is  straightforward  to  show  that 
c fi(x )  corresponds  to  a  TM  wave  and  if/(x)  describes  a  TE 
wave;  since  these  are  orthogonal  there  is  no  coupling  be¬ 
tween  them.  In  the  Sommerfeld  basis  [1],  there  is  only  an 
electric  Hertz  vector  with  components  both  in  the  z  direction 
and  in  the  x  direction;  the  boundary  conditions  at  interfaces 
are  coupled  which  algebraically  complicates  the  general  so¬ 
lution  for  an  arbitrary  film  stack. 

C.  Expansion  of  the  dipole  field  into  TE  and  TM  plane  waves 

To  begin  the  calculation,  it  is  necessary  to  express  the 
field  of  a  dipole,  located  at  the  origin  in  an  infinite  medium, 
in  a  superposition  of  TE  and  TM  plane  waves.  Then  the 
boundary  value  problem  for  each  independent  plane  wave 
can  be  directly  solved  by  standard  thin-film  electromagnetic 
methods;  finally  the  results  are  transformed  back  to  real 
space  using  the  techniques  developed  by  Sommerfeld 
[1,2,11]. 

The  infinite-medium  real-space  electric-Hertz  vector  cor¬ 
responding  to  a  dipole  is  simply 

einikor 

^x)  =  PoT—,  (sin  6^4  + cos  0dez),  (4) 

'  **  k o ' 

where  pQ  is  the  dipole  moment,  (ex,ey)  are  unit  vectors  in 
the  corresponding  directions.  The  total  radiated  power  for 
this  dipole  in  an  infinite  medium  is  given  by  the  well-known 
result  [12] 

n  _k40^(ni)\p0\2 

^inf  2  (5) 

127T77o£0 

where  y0=  V julq/sq.  In  the  following,  radiated  powers  are 
shown  normalized  to  this  value.  The  corresponding  Fourier 
transform  of  the  infinite-medium  Hertz  vector  is 

riL£)(£)=  —  2  ^2 - (sin  0dex  +  cos  0dez) 

S0Kik0  K  ~Ki 

=  ^(K)( sin  0dex  +  cos  Qde^),  (6) 

where  k0K  is  the  transform  coordinate.  This  expression  can 
be  put  in  a  more  suitable  form  for  the  planar  geometry  of  the 
problem  by  carrying  out  the  inverse  transform  in  the  z  direc¬ 
tion  and  expressing  the  remaining  integrals  in  cylindrical  co¬ 
ordinates, 
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®(p,z) 


—  f  —  e~y‘lzJ0(Xp)XdX 
k,  Jo  T« 

J  -oo  Yi 


(V) 


where  k0X  is  the  radial  transform  coordinate  \\2=^ljrr]1 
with  k0£  (k0y)  the  x(y) -directed  transform  coordinate],  p 
=  k0p  and  z  =  k0z  are  the  dimensionless  radial  and  vertical 
spatial  coordinates,  and 

yt=  yj\2  —  Ki - >  ~int,  (8) 

x^o 


It  is  a  standard  multiple-interface  thin-film  matrix-analysis 
problem  to  solve  for  the  Hertz  potential  everywhere.  The 
result  is 


,v  __ 
00  “ 


Po 
2  Kx 


Hl0(Xp)XdX 
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e-yoLoRP(i+e-yoLoRP)  _  _ 

- — - (>  yoz 

l-e~2yoLo  rPrp 


e 

+  - 


yoLoRP(l+e~yoLoRP) 
1  —  e~2y°L°RPRP 


eyo2  +  e_roUI 


(ID 


where  the  sign  of  the  square  root  is  chosen  to  ensure 
Re(y,)>0  for  convergence  of  Eq.  (7).  J0  is  the  zeroth-order 
Bessel  function  and  H q  is  the  Hankel  function  of  the  first 
kind  of  zeroth  order.  The  second  form  of  this  expression  will 
be  useful  for  evaluation  of  the  complex  integrations  since  the 
integration  extends  along  the  entire  X  axis. 

The  scalars  and  ijj  for  the  unbounded  medium  are  found 
in  terms  of  by  equality  of  the  z  components  of  the  electric 
and  magnetic  fields  from  Eqs.  (3)  and  (6),  giving 


0(£,  77;z)  =  <E(\;z)  cos  0d+i 


,€Vi  . 


sin  0, 


(9a) 


and 


for  the  slab  region  (subscript  0),  and 

<tf=  —  \  —Hl0(Xp)XdX 

2  Kt  J  -«>  yt 

TPe-y°L°,2(l+e~y°L°Rpb) 


l-e~2yoLo  RPbRP 


-yt(z-Lt-tot) 


<K  =  T^  [°  —HL0(Xp)XdX 
2KbJ-°°yb 


X 


Tpbe-yoLo,2(l+e~y°L°Rp) 


l-e-2y0^o  RPbRP 


e  +  yb(z  +  Lb  —  tot)  (12) 


^(^,77;z)  =  ‘&(X;z)l  77opsin^|,  (9b) 

where  the  (  — )  sign  in  Eq.  (9a)  is  appropriate  in  the  upper 
half  plane  (z>0)  and  the  (  +  )  sign  in  the  lower  half  plane 
(z<0).  For  a  vertical  dipole  (sin  0^=0)  there  are  only  TM 
waves  (only  a  single  scalar  function  cj)(^r]\z)  is  required); 
for  a  horizontal  dipole  (sin  9d=  1)  there  are  both  TM  and  TE 
waves  [both  scalar  functions  <p(^,rj;z)  and  if/(i;,7};z)  are 
required].  The  source  terms  for  a  horizontal  dipole  (sin  6d 
=  1)  depend  on  the  transverse  transform  coordinates  (£,77) 
which  results  in  a  dependence  of  the  radiated  fields  on  the 
polar  angle  9  where  9  =  0  is  the  radial  direction  parallel  to 
the  dipole. 


III.  EVALUATIONS  OF  THE  RADIATED  FIELDS 
AND  POWER 

A.  Boundary  value  problem  and  evaluation  of  the  Hertz 
potential;  vertical  dipole  (cos  0d=  1) 

The  boundary  conditions  for  (j)  at  an  interface  between 
media  1  and  2  are  [9] 


01  =  02> 

1  d(pi  1  d(f>2 
Kl  dz  K2  dz 


(10) 


for  the  top  [above  the  slab  and  top  film  stack  (subscript  t)] 
and  bottom  [below  the  slab  and  bottom  film  stack  (subscript 
b )]  semi-infinite  outer  cladding  regions.  In  these  expres¬ 
sions,  kQlLt_tot  and  kQlLb_tot  are  the  total  physical  thick¬ 
nesses  of  the  top  and  bottom  cladding  layers  from  the  posi¬ 
tion  of  the  dipole  to  the  edge  of  the  outer  cladding  regions  as 
shown  in  Fig.  1.  The  Hertz  potential  within  the  film  stacks  is 
readily  evaluated  from  these  results.  Here  the  R’s  and  T s 
refer  to  the  magnetic  field  reflectivity  looking  from  inside  the 
slab  from  the  multilayer  top  (bottom)  stacks  for  a  TM  wave 
propagating  upwards  (downwards)  in  the  slab  Rpt(Rpb)  and 
the  T9 s  are  the  corresponding  transmissions  through  the  en¬ 
tire  film  stacks.  In  the  limit  of  a  single  dielectric  interface  at 
the  top  of  the  slab  these  reduce  to  the  familiar  results  for  a 
TM-wave  incident  from  the  slab  towards  the  top  dielectric 
(and  similar  expressions  for  the  bottom  interface) 


*7 To"  *7)7; 
Kt7o+Koyt’ 


Kt  2Ktyt 

Tf  =  — (1  ~RP)  — - 3^—. 

k0  xtyo+K0yt 


(13) 


Comparing  this  result  with  Eq.  (10)  of  Ref.  [1]  for  a 
simple  dielectric  slab,  the  only  changes  are  (1)  the  substitu¬ 
tion  of  the  multilayer  refection/transmission  coefficients  for 
their  single  interface  values;  (2)  the  separate  identification  of 
top  and  bottom  reflection  and  transmission  coefficients  re- 
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quired  for  this  generalized  asymmetric  calculation;  and  (3) 
setting  the  dipole  position  to  the  center  of  the  slab  (a  =  0 
from  Ref.  [1])  since  any  shift  of  the  dipole  position  can  be 
accommodated  by  adding  additional  films  (with  the  same 
dielectric  constant).  The  bracketed  terms  have  simple  physi¬ 
cal  interpretations.  There  are  two  contributions  to  the  fields 
in  the  top  region,  one  direct  transmission  from  the  upward 
directed  plane  waves  of  the  dipole  and  a  second  reflected 
from  the  bottom  slab  interface.  The  resonant  denominator 
accounts  for  the  multiple  reflections  within  the  slab.  The  ze¬ 
ros  of  this  denominator  correspond  to  poles  of  the  integrand 
and,  thus,  the  radiation  into  2D  bound  waveguide  modes. 
There  are  branch  cuts  in  the  X  plane  as  a  result  of  the  mul¬ 
tivalued  %’s.  Integration  around  these  branch  cuts  gives  the 
3D  free-space  radiation.  The  inverse  transform  will  be  dis¬ 
cussed  following  the  presentation  of  the  boundary  value  so¬ 
lutions  for  the  horizontal  dipole  in  the  next  section. 


for  the  slab,  and 


1 

—  sin  (pHl(\p)dX 
7t 


r^-roV2(1+^-ro  lors) 

X  — _ - _ _  e  ~  JtG-  Lt-tot )9 

1  —  e~2y°L°RsbRst 


4>^= -  k0  cos  ipH])(\p)d\ 

2  Kt  J-°° 

Tpe ~ roLo/2( i-e~ voLorP)  _  _ 

X - — - £,  ~  tot)  ^ 

l~e~2yo  lorprp 


(16a) 


B.  Boundary  value  problem  and  evaluation  of  the  Hertz 
potential;  horizontal  dipole  (cos  0d=0) 

For  a  horizontal  dipole,  both  cp  and  p  must  be  evaluated. 
Since  the  boundary  conditions  are  uncoupled,  the  solution 
proceeds  straightforwardly  using  the  source  terms  in  Eq.  (9) 
for  6d=  it  12.  The  boundary  conditions  are  [9] 

<t>l  =  <t>2 >  *Al=</'2, 


1  d(f>i  1  d<p2  dif/ 1  dp2 
KX  dz  K2  dz  dz  dz 


(14) 


This  decoupling  of  the  boundary  conditions  for  the  two  sca¬ 
lar  functions  is  a  major  algebraic  advantage  of  the  Hertz 
vector  representation  introduced  by  Lukosz  [9].  The  solu¬ 
tions  again  are  obtained  by  straightforward  thin-film  analysis 
based  on  the  boundary  conditions  Eq.  (14)  and  the  source 
terms  Eq.  (9), 
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—  sin  <pH\{Xp)dX 
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„  Po  f00  k0  1 
fib=—y o  — sin  cp  H0(Xp)dX 
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Tshe~7^\i-^e-yoLoRs)  _  _ 
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l-e~2y°LoRsbRst 
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2  xbJ-°° 

Tphe~yoLo/2(l-e~yoLoRP)  _  _ 
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l-e~2yoLoRPRP 


(16b) 


for  the  top  and  bottom  semi-infinite  outer  cladding  regions. 
In  these  expressions,  Rstb,Tstb  are  the  electric  field  reflectivi¬ 
ties  for  TE  polarized  waves  incident  from  the  slab  on  the 
respective  interfaces.  In  the  limit  of  a  simple  dielectric  inter¬ 
face  these  reduce  for  the  top  interface  to 
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(15) 


R’= 


To  ~Yt 
To  +7t’ 


Ts=l-Rs  = 


ro+r/ 


(17) 


In  Eq.  (15),  the  minus  sign  in  the  parentheses  is  appropriate 
for  z> 0  and  the  plus  sign  for  z< 0.  As  mentioned  above,  the 
polar  angle  cp ,  defined  relative  to  the  orientation  of  the  dipole 
in  the  slab,  arises  from  the  presence  of  the  transverse  trans¬ 
form  coordinates  (£,77)  in  the  source  terms  for  the  horizontal 
dipole. 

The  physical  interpretation  of  these  equations  is  similar  to 
that  for  the  vertical  dipole.  Now  the  coupling  is  to  both  TE 
and  TM  radiation  and  bound  modes.  The  zeros  of  the  de¬ 
nominators  correspond  to  the  bound  modes  (TM  for  1 
- e~2y0L0RPbRP  =  0 ;  TE  for  \~e~2yoLoRsbRst  =  0).  In  each 
numerator  there  is  a  term  that  corresponds  to  the  radiation 
directly  incident  from  the  dipole  and  a  second  term  from  the 
radiation  reflected  from  the  opposite  interface.  For  the  poten¬ 
tial  within  the  slab  there  are  upward  and  downward  reflected 
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waves  and  the  source  terms.  For  the  potential  in  the  half¬ 
spaces  the  numerators  correspond  to  transmission  from  the 
source  at  the  center  of  the  slab.  The  denominators  account 
for  the  multiple  reflections,  and  include  all  near  field  terms; 
no  approximations  to  the  full  electromagnetic  calculation 
have  been  made  to  this  point.  These  approximations  will  be 
made  in  the  branch  cut  integrals,  which  are  evaluated  by  the 
method  of  steepest  descents  that  is  only  valid  far  (many 
wavelengths)  from  the  source  point,  and  in  the  evaluation  of 
the  contributions  from  the  poles  using  the  asymptotic  limits 
of  the  Hankel  function,  again  valid  many  wavelengths  from 
the  source  point.  There  are  branch  cuts  corresponding  to  each 
of  the  thin  film  layers.  In  the  half-spaces  above  and  below 
the  structure,  only  the  integral  around  the  branch  cut  corre¬ 
sponding  to  that  specific  medium  contributes  to  the  radiation 
fields;  the  other  branch  cuts  contribute  only  to  the  near  fields 
and  thus  are  excluded  from  the  subsequent  analysis. 


mode  index  (/)  [jjj=  7/-(X)|x=^].  The  corresponding  indi¬ 
ces  on  the  reflection  coefficients  are  suppressed  for  conve¬ 
nience. 

From  this  result,  the  next  step  is  to  evaluate  the  fields  and 
to  integrate  across  the  mode  profile  to  get  the  radiated  power 
with  the  result  for  the  central  slab  containing  the  dipole, 


A  2D 
rVJM 
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C.  Evaluation  of  the  inverse  transform 

The  apparatus  of  a  complex  analysis  can  now  be  applied 
to  the  remaining  integrals.  As  noted  above,  there  are  contri¬ 
butions  due  to  integration  around  the  branch  cuts  (corre¬ 
sponding  to  3D  radiation  into  the  semi-infinite  half-spaces 
above  and  below  the  structure)  and  due  to  poles  of  the  de¬ 
nominators  (2D  bound  modes).  The  manipulations  have  been 
presented  for  a  single  interface  in  great  detail  in  Ref.  [11]  for 
a  single  interface  and  in  Ref.  [4]  for  the  simple  slab  case. 
Only  the  final  results  will  be  presented  here. 


1.  Vertical  dipole 

2D  bound  modes.  For  the  vertical  dipole,  the  Hertz  poten¬ 
tial  within  the  slab  is  evaluated  using  the  asymptotic  limit  of 
the  Hankel  function  [Hq(z)~ >  —  ielz\l2/irz]  and  taking  the 
residues  at  the  zeros  of  the  denominator, 


Xl(l+RPyme  T0.;Lo)cOSh  (y%,z) 

+  RPasyme~yP°-lLOsinh('yP0,lZ)],  (18) 


where  the  denominator  is  expanded  around  the  roots  , 
where  the  subscript  is  the  mode  index)  of  the  TM  ( p  super¬ 
script)  modal  dispersion  relation,  viz., 


Dp,  =  l~e 


2^L0RPRr~(\-kP) 


dDp 

dk 


=  (\  —  ky  )  SDf 

k1 

(19) 


and 


where  the  summation  is  over  all  modes  (/),  7o,/=Toj 
+  i  y^l ,  and  the  coefficients  c0,z  and  J0  /  are  the  complex 
amplitudes  of  the  upward  and  downward  directed  fields  in 
each  layer  at  the  center  of  the  slab.  From  Eq.  (18) 

c0,i  =  \(l+  RPsyme  -  r<yZ°  +  RPasyme  ■  ro’'Z°) 

=  i-(l+Rfe~TwLo)) 
doj  =  \  (  1  +  R  V  r°'iZ°  -  Rasyme  ~  rwZ°) 

=  l-(\+Rpbe~^il»).  (22) 

Similar  expressions  need  to  be  evaluated  over  the  entire  film 
stack,  including  the  top  and  bottom  semi-infinite  regions,  and 
summed  to  get  the  total  2D  radiated  power. 

3D  radiated  energy.  The  calculation  proceeds  by  substi¬ 
tuting  the  integral  expression  for  the  Hankel  function 

4  roo  /  2 

Hl0(z)  =  ~elz  (4 iz-y2)~me~y  l2dy^elz  yj - — , 

7T  Jo  '  ITT Z 

(23) 

where  the  last  approximation  is  valid  in  the  radiation  zone, 
many  wavelengths  from  the  dipole.  Then  the  Hertz  potential 
in  the  semi-infinite  cladding  regions  can  be  evaluated  by 
integrating  around  the  branch  cut  by  the  method  of  steepest 
descents  with  the  result  for  the  top  half- space: 

p0k0ein> *  1  ~RP  [i+g'-Wos  WRP(0)] 

4>t  = - - - 

R  k0  l-e2lnoLomsWRP(0)RPb(O) 


RP  +  Rpb  RP-Rpb 

rp  =  _L - b_  rp  =  _L - b_  (20) 

sym  n  ’  asym  o  ’  ^  ' 


The  superscript  on  the  /  s  takes  note  of  the  fact  that  they  are 
evaluated  at  the  roots  of  the  TM  dispersion  relation  and  the 
subscripts  on  the  / s  refer  to  the  layer  index  (j)  and  the 


X  £*(wo  cos  ®~nt cos  0)Lo/2  (24) 

where  R  is  the  dimensionless  radial  distance  coordinate,  #  is 
the  propagation  angle  inside  the  slab  for  an  external  angle  of 
6  [cos2  #=1  —  (iq/ic^sin2  0\  and  the  reflection  coefficients 
are  evaluated  at  the  external  angle  6. 
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For  the  radiated  power,  normalized  to  the  power  radiated  by  the  dipole  in  an  infinite  medium,  the  final  result  is 
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(25) 
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with  a  similar  expression  for  the  power  radiated  into  the 
bottom  half-space  with  the  obvious  interchange  (tob).  As 
expected,  this  result  is  identical  to  that  for  the  simple  slab  [4] 
with  the  substitution  of  the  film-stack  reflectivities  for  those 
of  the  simple  dielectric  interface  and  the  generalization  to  an 
asymmetric  cladding  structure. 

2.  Horizontal  dipole 

A  horizontal  dipole  couples  to  both  TE  and  TM  radiation. 
The  calculations  are  very  similar  to  those  presented  above 
and  only  the  results  are  given  here. 

2D  bound  modes.  For  TE  coupling 
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For  TM  coupling 
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3D  radiation  into  top  half- space.  For  TE  coupling 
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For  TM  coupling 
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FIG.  2.  Radiated  power  from  a  dipole  embed¬ 
ded  in  a  slab  (top)  and  in  the  same  slab  with  four 
mirror  stacks  (bottom).  The  results  for  a  vertical 
dipole  are  shown  in  the  left  panels  and  for  a  hori¬ 
zontal  dipole  in  the  right  panels.  In  each  case  the 
dipole  is  embedded  in  a  thin  (10-nm-wide) 
InGaAs  (n  =  3.55)  section  clad  top  and  bottom 
with  GaAs  (n  =  3.495)  spacers  of  thickness  L. 
For  the  slab,  the  semi-infinite  outer  claddings  are 
air  (n=  1);  for  the  mirror  stacks  are  X/4  pairs  of 
A1203  (ft  =1.76)  and  GaAs,  again  with  semi¬ 
infinite  air  outer  claddings.  See  text  for  detailed 
discussion. 


with  equivalent  expressions  for  the  radiation  into  the  bottom 
half  space. 

Again,  it  is  instructive  to  compare  these  expressions  for  a 
horizontal  dipole  with  the  equivalent  expressions  derived  for 
the  symmetrically  clad  slab  with  Sommerfeld’s  Hertz  vector 
approach  [4].  For  the  TE  coupling  the  results  are  again  very 
similar  with  the  straightforward  substitutions  of  the 
multilayer  reflectivities,  the  more  careful  tracking  of  reflec¬ 
tion  coefficients  necessary  for  nonequivalent  surfaces,  and 
the  automatic  placement  of  the  dipole  in  the  center  of  the 
slab.  The  TM  results  are  significantly  simpler  in  this  formu¬ 
lation,  particularly  for  the  Hertz  potential  and  the  3D  radia¬ 
tion  contributions.  In  the  previous  treatment,  based  on  the 
Sommerfeld  formulation,  the  possible  algebraic  simplifica¬ 
tion  was  only  recognized  for  the  bound  mode  contributions. 

IV.  RESULTS 

In  the  following  two  sections,  results  using  these  expres¬ 
sions  are  presented  for  two  related  vertical-cavity  structures. 
First,  we  consider  a  slab  bound  by  two  symmetric  quarter 
wavelength  mirror  stacks  with  air  on  both  sides.  The  power 
is  radiated  into  both  bound  modes  and  3D  radiation.  In  the 
second  case,  the  same  structures  are  atop  a  GaAs  substrate, 
again  with  air  cladding  the  top  surface.  Now  there  are  no 
bound  modes,  but  there  are  “leaky”  modes  with  field  distri¬ 
butions  very  similar  to  the  bound  modes,  but  whose  energy 
leaks  into  radiation  modes  in  the  substrate.  Absorption  has 
been  set  to  zero  (all  indices  taken  as  pure  real  quantities), 
interface  roughness  has  been  neglected,  and  the  structure  is 
taken  as  infinite  within  the  plane.  Additionally,  the  dipole 
radiation  has  been  assumed  to  be  at  a  continuous  single  fre¬ 
quency  a ),  without  any  dephasing  ( T2 )  or  decay  (Tx)  events. 
The  impact  of  all  of  these  assumptions  is  to  allow  the  coher¬ 
ent  addition  of  fields  over  many  transits  of  the  multilayer 
structure,  e.g.,  to  investigate  very  high  finesse  resonances. 
This  allows  beautiful  examples  of  resonance  effects;  how¬ 
ever,  due  consideration  of  these  limiting  effects  must  be 
taken  in  applying  this  analysis  to  the  interpretation  of  experi¬ 
mental  results. 

A.  Slab  with  DBR  mirrors — bound  modes  and  radiation 

The  slab  structure  consists  of  a  thin  (10  nm)  InGaAs 
quantum  well  (n  =  3.55)  with  the  dipole  in  the  center  of  the 


well,  symmetric  GaAs  inner  cladding  layers  (n  =  3.495) 
whose  thicknesses  are  equal  and  are  varied  in  the  calculation, 
symmetric  X/4  Al203/GaAs  (n  =  1.76/3.495)  mirror  pairs 
(results  for  zero  to  four  mirror  pairs  are  presented),  and  air 
(n=  1)  upper  and  lower  outer  cladding  half-spaces.  The  spa¬ 
tial  index  profile  is  shown  as  an  inset  in  Fig.  2.  These  index 
values  are  appropriate  to  a  wavelength  of  980  nm;  results  are 
shown  normalized  to  the  wavelength  in  the  GaAs  layers. 

The  calculated  radiated  power,  normalized  to  the  radiated 
power  in  an  unbounded  InGaAs  medium  as  a  function  of  the 
GaAs  inner  cladding  thickness  for  a  simple  slab  (top,  no 
mirror  stacks)  and  for  a  slab  bound  on  each  side  by  four 
mirror  stacks  (bottom)  is  shown  in  Fig.  2  for  both  a  vertical 
dipole  source  (left)  and  a  horizontal  dipole  source  (right). 
For  the  vertical  dipole  source,  all  of  the  radiation  is  into  TM 
bound  and  radiation  modes.  For  the  air-clad  slab,  the  3D 
radiation  shows  strong  Fabry-Perot  resonances  for  slab  thick¬ 
nesses  near  2nxL~(2jJr 1)\/2,  7  =  0,1,...,  and  much 
weaker  resonances  for  even  numbers  of  half-wave  reso¬ 
nances.  This  alternation  arises  from  the  factoring  of  the  reso¬ 
nant  denominator  for  this  symmetric  case  and  the  cancella¬ 
tion  of  one  set  of  resonances  by  the  numerator  in  Eq.  (25). 
The  horizontal  dipole  has  a  similar  set  of  resonances  but  now 
at  even  numbers  of  half-waves  and  the  cancellation  for  odd 
numbers  of  half-waves,  for  both  the  TE  and  TM  radiation 
components.  These  resonance  conditions  are  a  direct  conse¬ 
quence  of  the  boundary  conditions  in  Eqs.  (10)  and  (14).  The 
TM  radiation  boundary  conditions  for  a  vertical  dipole  re¬ 
quire  that  the  Hertz  potential  be  zero  at  the  edges  of  the  inner 
cladding  as  the  magnitude  of  the  reflectivity  approaches 
unity,  whereas  the  TE  and  TM  boundary  conditions  for  a 
horizontal  dipole  force  the  derivative  of  the  potential  to  zero 
at  the  same  locations.  The  high  symmetry  of  a  dipole  located 
in  the  precise  center  of  the  slab  results  in  these  alternating 
resonance  selection  rules;  more  complex  behavior  is  found 
for  a  dipole  displaced  from  the  center  of  the  slab.  For  the  2D 
bound  modes,  the  vertical  dipole  exhibits  a  strong  suppres¬ 
sion  of  the  radiation  for  very  thin  slabs  (L<X/n)  while  the 
radiation  is  more  pronounced  for  the  horizontal  dipole  at 
these  thicknesses.  In  both  cases,  the  total  radiated  power 
rises  approximately  to  that  for  the  dipole  in  an  infinite  me¬ 
dium  as  the  inner  cladding  thickness  is  increased;  the  slightly 
lower  value  ( <  1 )  than  in  an  infinite  InGaAs  medium  is  be- 
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cause  the  normalization  is  to  the  dipole  in  the  higher  index 
InGaAs,  but  the  majority  of  the  bound  mode  radiation  is 
confined  to  the  GaAs  layers.  Each  of  the  contributions  to  the 
total  radiated  power  has  been  multiplied  by  various  factors  to 
make  them  visible  in  the  figure.  As  is  well  known,  for  the 
vertical  dipole,  only  very  weak  radiation  into  the  air  spaces  is 
observed  (3DX100).  More  interesting  is  the  result  for  the 
horizontal  dipole,  where  most  of  the  radiation  is  into  the  TE 
mode;  the  2D  TM  radiation  is  multiplied  by  10,  while  the  3D 
TM  radiation  contribution  to  the  total  power  is  multiplied  by 
2. 

The  impact  of  the  mirror  pairs  on  the  distribution  of 
power  between  the  individual  radiation  components  is  sig¬ 
nificant,  whereas  the  total  radiated  power  is  only  modestly 
impacted.  For  a  vertical  dipole,  there  is  a  noticeable  increase 
in  the  radiated  power  into  the  2D  TM  mode  for  very  thin 
slabs.  This  is  a  result  of  the  increased  mode  volume  due  to 
the  mirrors  (see  the  left  side  of  Fig.  2).  For  the  air-clad  slab, 
only  the  lo west-order  mode  is  propagating  for  2n1L^\ 
while  for  the  four-mirror-clad  slabs,  there  are  six  propagating 
modes  even  for  L  =  0.  For  both  dipole  orientations,  the  3D 
radiation  is  significantly  increased  at  the  alternating  half¬ 
wave  resonances,  and  suppressed  away  from  these  reso¬ 
nances.  For  a  horizontal  dipole  at  the  first  two  resonances, 
the  total  radiated  power  is  slightly  larger  (  —  15%)  than  that 
for  a  dipole  in  an  infinite  medium  corresponding  to  a  de¬ 
crease  in  the  radiative  lifetime.  The  3D  radiation  is  strongly 
peaked  at  the  even  integral  resonances  for  both  the  TE  and 
TM  components.  Away  from  these  resonances,  essentially  all 
of  the  radiated  power  is  in  the  2D  bound  modes.  As  the 
number  of  mirror  pairs  is  increased,  more  of  the  energy  is 
shifted  into  the  TM  radiation,  although  it  remains  a  signifi¬ 
cantly  smaller  fraction  of  the  total  power  than  that  radiated 
into  the  TE  mode.  There  is  a  more  equal  balance  of  radiated 
power  between  TE  and  TM  for  the  3D  radiation. 

The  evolution  of  the  total  radiated  power  as  the  number  of 
mirror  pairs  is  shown  more  clearly  in  Fig.  3.  Results  are 
shown  for  both  vertical  (top)  and  horizontal  (bottom)  dipoles 
for  the  slab  case  and  for  one  through  four  mirror  pairs.  The 
curves  are  offset  for  clarity.  In  each  case,  the  radiation  from 
a  dipole  in  an  infinite  medium  is  shown  as  the  associated 
dotted  line. 

For  a  vertical-cavity  laser,  the  radiated  power  for  a  hori¬ 
zontal  dipole  in  the  normal  direction,  perpendicular  to  the 
layers,  is  of  particular  interest.  The  maximum  intensity  into 
the  normal  direction  occurs  when  the  TE  and  TM  powers  are 
equal,  just  at  the  leading  edge  of  the  resonances  of  Fig.  2  (see 
also  Fig.  5B).  Figure  4  (top)  shows  the  normalized  forward 
direction  emission  for  a  horizontal  dipole  as  a  function  of  the 
slab  thickness  for  the  five  cases  of  a  slab  and  one  through 
four  mirror  pairs.  The  angular  variation  at  the  peak  is  also 
shown  in  Fig.  4  (bottom).  The  variation  with  slab  thickness 
shows  the  expected  Lorentzian  line  shape  from  the  expan¬ 
sion  of  the  resonant  denominator  at  an  increasing  cavity  fi¬ 
nesse  F  as  the  reflectivity  increases  with  the  number  of  mir¬ 
ror  pairs.  (The  calculated  normal  emission  power 
reflectivities  and  corresponding  F’s  are  |F|2  =  0.31,  F— 2.5, 
slab;  |F|2  =  0.75,  F— 11,  one  mirror  pair;  |F|2  =  0.93,  F 
—  43,  two  mirror  pairs;  |F|2  =  0.982,  F—  173,  three  mirror 
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FIG.  3.  Evolution  of  the  total  radiated  power  for  a  symmetric 
structure  as  the  number  of  mirror  pairs  is  varied.  The  curves  are 
offset  for  clarity;  in  each  case  the  dashed  line  represents  the  power 
radiated  by  the  dipole  into  an  infinite  InGaAs  medium. 


pairs;  and  1 7^ | 2  =  0.995,  F=  628,  four  mirror  pairs.)  The 
slight  shift  of  the  resonance  away  from  2 nL/X  =  1  is  due  to 
the  optical  thickness  of  the  InGaAs  quantum  well.  As  ex- 
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FIG.  4.  Vertical  emission  (normal  to  the  layer  structure)  into  3D 
radiation  modes  for  a  horizontal  dipole  embedded  in  a  symmetri¬ 
cally  clad  slab  as  a  function  of  the  inner  cladding  thickness  (top) 
and  the  angular  variation  for  the  peak  vertical  emission  (bottom), 
both  with  the  number  of  mirror  pairs  as  a  parameter.  As  expected, 
the  increasing  finesse  of  the  resonance  with  increasing  numbers  of 
mirror  pairs  is  reflected  in  both  plots. 
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FIG.  5.  Angular  variation  of  the  3D  radiation  from  a  horizontal 
dipole  in  a  symmetrically  clad  slab  with  four  mirror  pairs  as  the 
inner  cladding  thickness  is  varied.  The  top  panel  shows  an  ex¬ 
panded  view  of  the  integrated  TE  and  TM  3D  radiation  in  the  vi¬ 
cinity  of  the  first-resonance  inner  cladding  thickness.  The  angular 
traces  (A  though  G)  show  the  evolution  of  the  angular  dependence 
of  the  emission  as  the  inner  cladding  thickness  is  varied  in  this 
region. 

pected,  the  angular  variation  of  the  emission  at  the  peak  of 
each  of  these  resonances  narrows  significantly  as  F  in¬ 
creases. 

As  the  inner  cladding  thickness  L  increases  beyond  this 
resonance,  the  peak  of  the  3D  emission  shifts  to  steeper 
angles  for  both  TE  and  TM  radiation,  and  the  total  3D  radi¬ 
ated  power  increases  dramatically.  The  top  panel  of  Fig.  5 
shows  an  expanded  view  of  the  total  integrated  3D  radiated 
power  emitted  from  both  surfaces  around  the  first  peak  re¬ 
gion  (nL/\~  0.5)  for  the  four-mirror-pair  case  for  a  horizon¬ 
tal  dipole.  From  the  symmetry  of  this  geometry,  the  same 
power  is  emitted  from  both  surfaces.  The  angular  depen¬ 
dence  of  the  radiated  power  is  shown  for  various  values  of  L 
(A  through  G)  as  indicated  in  the  top  panel.  Case  A  is  before 
the  resonance;  the  radiated  power  is  quite  small  and  does  not 
show  a  pronounced  angular  dependence.  Case  B  is  the  maxi¬ 
mum  of  the  normal-direction  emitted  power  shown  previ¬ 
ously  in  Fig.  4.  Notice  that  this  peak  normal  emission  corre¬ 
sponds  to  a  very  small  fraction  of  the  power  radiated  from 
the  dipole.  Case  C  is  just  past  the  sharp  increase  in  the  3D 
emitted  power  for  both  TE  and  TM  radiation.  There  is  a  peak 
in  the  angular  dependence,  for  both  TE  and  TM  radiation,  at 
a  small  angle  away  from  normal.  This  peak  can  be  traced  to 


the  round-trip  resonance  condition  in  the  denominators  [Eqs. 
(31)  and  (33)]  that  must  move  away  from  normal  as  the 
thickness  is  increased.  The  explicit  dependence  in  Eqs.  (31) 
and  (33)  is  on  the  thickness  of  the  central  slab  medium  con¬ 
taining  the  dipole,  which  is  fixed  in  this  calculation.  The 
dependence  on  the  thickness  of  the  cladding  is  contained  in 
the  phase  of  the  reflection  coefficients.  This  lowest  order 
round  trip  resonance  condition  is  simply,  4 nGaAs  cos  ftGaASL 
+2ninGaAsC0S  ^hiGaAs +2 (f)(0)  =  X/2 77,  which  clearly  exhibits 
the  dependence  of  ^GaAs  >  the  angle  of  propagation  in  the 
inner  cladding,  on  L.  The  final  phase  term  (f)(6)  represents 
the  phase  shift  in  the  reflector  stack  beyond  the  inner  clad¬ 
ding.  The  external  angle  6  is  related  to  # G a As  by  the  Fresnel 
condition  nGaAs  sin  ftGaAs=sin  6  for  the  present  case  of  a 
semi-infinite  air  (nt=  1)  outer  cladding.  The  change  in  the 
external  angle  for  a  given  change  in  L  is  amplified  by  the 
GaAs  refractive  index.  For  case  D,  corresponding  roughly  to 
the  middle  of  the  thickness  region  with  substantial  radiation, 
the  peaks  for  TE  and  TM  radiation  occur  at  different  angles 
and  the  TE  peak  is  significantly  sharper.  This  is  because  of 
the  lower  TM  reflectivity  related  to  Brewster’s  angle  for  TM 
radiation  incident  on  a  single  interface.  The  splitting  is  due  to 
phase  shifts  as  the  angle  is  increased.  This  trend  is  continued 
for  case  E,  just  before  the  cutoff  of  the  TE  radiation.  The 
angular  peak  in  the  TE  emission  is  close  to  7t/2.  For  a  further 
increase  in  the  inner  cladding  thickness,  the  peak  shifts  be¬ 
yond  the  angle  for  total  internal  reflection  and  the  emitted 
power  shifts  from  3D  radiation  out  the  sides  of  the  structure 
to  2D  modes  propagating  within  the  structure.  This  same 
transition  is  more  gradual  for  the  TM  radiation  (case  F)  con¬ 
sistent  with  the  more  gradual  onset  of  the  2D  TM  emission 
with  inner  cladding  thickness  seen  in  Fig.  2.  Finally,  case  G 
is  for  a  thickness  beyond  the  significant  3D  emission  region 
where  the  total  3D  power  is  weak  and  again  does  not  show 
any  strong  angular  dependence. 

B.  Active  region  and  mirrors  on  substrate — No  bound  modes 

We  now  make  a  change  in  the  structure,  replacing  the 
bottom  semi-infinite  air  cladding  with  a  semi-infinite  GaAs 
cladding.  This  simple  change  greatly  modifies  the  calcula¬ 
tion,  but  of  course  the  results  should  not  be  dramatically 
different  since,  at  least  as  the  number  of  mirrors  is  increased, 
the  dipole  should  not  “see”  the  substrate  refractive  index. 
The  major  change  in  the  calculation  is  that  there  are  no 
bound  modes  in  this  structure,  in  the  sense  that  the  zeros  of 
the  dispersion  relation  all  correspond  to  “modal”  indices  that 
are  lower  than  the  GaAs  substrate  refractive  index  for  the 
entire  thickness  region  L  investigated.  Thus  the  radiated 
power  is  either  emitted  out  the  top  surface,  or  “leaks”  from 
the  film  stack  into  the  GaAs  substrate.  As  before,  some  of  the 
power  is  radiated  into  the  upper  air  space.  Mathematically, 
having  the  substrate  index  higher  than  the  solution  to  the 
dispersion  relation  puts  the  poles  on  a  different  sheet  of  the 
Riemann  surface  of  the  inverse  transform  integrand  (e.g.,  the 
negative  sign  must  be  taken  for  the  square  root  correspond¬ 
ing  to  the  substrate  propagation  constant  whereas  a  positive 
sign  was  taken  in  the  true  bond  mode  case)  so  that  the  poles 
do  not  contribute  to  the  inverse  transform  integration. 
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FIG.  6.  Radiated  power  from  a  dipole  embed¬ 
ded  in  an  asymmetric  film  structure  with  air  up¬ 
per  and  GaAs  lower  outer  claddings.  The  top  pan¬ 
els  are  for  a  simple  InGaAs/GaAs  structure,  the 
bottom  panels  include  three  Al203/GaAs  X/4 
mirror  pairs.  In  each  case  the  dipole  is  embedded 
at  the  center  of  a  thin  InGaAs  region.  The  left 
panels  are  for  a  vertically  oriented  dipole;  the 
right  for  a  horizontal  dipole.  The  only  emission  is 
into  3D  radiation  for  these  cases.  There  are  no 
strictly  bound  modes,  however,  for  the  three 
mirror-pair  case;  there  are  “leaky”  modes  largely 
confined  to  the  film  stack  but  with  radiation  that 
leaks  into  the  substrate. 


Figure  6  shows  the  total  radiated  power  for  a  simple  slab 
case  (top),  with  just  a  buried  InGaAs  layer,  and  for  three 
mirror  pairs  (bottom).  Results  are  shown  for  both  a  vertical 
dipole  (left)  and  a  horizontal  dipole  (right).  As  there  are  no 
bound  modes,  there  are  no  2D  contributions;  the  calculation 
proceeds  only  from  Eqs.  (31)  and  (33)  and  their  bottom  sur¬ 
face  counterparts.  For  a  vertical  dipole,  the  results  are  very 
similar  to  those  for  the  symmetrically  clad  slab  with  air  on 
both  sides.  There  are  somewhat  larger  differences  for  a  hori¬ 
zontal  dipole.  The  roughly  factor  of  two  decrease  compared 
with  the  equivalent  figure  for  the  symmetrically  clad  geom¬ 
etry  is  due  to  the  counting  of  the  radiated  power  out  of  both 
top  and  bottom  surfaces  in  Figs.  2  and  5,  whereas  the  top  and 
bottom  surface  radiated  powers  are  shown  individually  in 
this  figure.  There  are  differences  in  the  details  of  the  emis¬ 
sion  dependence  on  inner  cladding  thickness  that  will  be 
discussed  in  conjunction  with  Figs.  8  and  9.  The  total  hori¬ 
zontal  dipole  emission  does  not  exhibit  the  sharp  transitions 
between  2D  and  3D  emission  that  characterized  the  symmet¬ 
ric  air-clad  slab,  but  the  overall  dependence  of  the  emission 
on  the  inner  cladding  thickness  is  similar.  The  total  emission 
at  the  peak  around  nL/\~  0.5  is  larger  than  for  the  similar 
symmetric  air-clad  three-mirror-pair  case.  There  is  also  a  sig¬ 
nificant  change  in  the  distribution  of  energy  between  TE  and 
TM  emission,  the  TM  emission  being  more  pronounced  in 
this  substrate  geometry. 

Figure  7  shows  the  progression  of  the  total  emitted  power 
as  a  function  of  the  inner  cladding  thickness  with  the  number 
of  mirror  pairs  as  a  parameter.  The  curves  have  been  offset 
for  clarity.  The  radiated  power  for  a  dipole  in  an  infinite 
medium  is  shown  as  the  dashed  line  associated  with  each 
curve. 

An  expanded  view  of  the  first  top  surface  emission  region 
for  a  horizontal  dipole,  in  the  vicinity  of  nL/\~ 0.5,  is 
shown  in  the  top  panel  of  Fig.  8.  This  should  be  compared 
with  the  comparable  panel  in  Fig.  5  for  the  symmetrically 
air-clad  slab.  Again,  the  overall  results  are  similar,  but  there 
are  some  noticeable  differences.  The  overall  shapes  are  in¬ 
verted,  in  Fig.  5  the  TE  power  increases  slightly  as  the  inner 
cladding  thickness  increases  up  to  a  very  sharp  cutoff  while 
the  TM  power  decreases  and  shows  a  much  more  gradual 
cutoff.  In  Fig.  8  the  TM  power  increases  while  the  TE  power 
decreases  as  the  inner  cladding  thickness  increases.  The  lim¬ 


its  of  the  emission  with  inner  cladding  thickness  are  more 
gradual  and  less  well  defined  for  this  case.  The  TM  emission 
is  larger  than  the  TE,  the  opposite  of  the  symmetrically  clad 
slab  case. 

The  angular  dependencies  of  the  emission  out  the  top  (air- 
clad)  surface  are  shown  in  panels  A  through  G.  Panel  A  is  for 
inner  cladding  thicknesses  smaller  than  those  that  correspond 
to  the  peak  emission.  The  emission  is  relatively  weak  and 
featureless.  Panel  B  corresponds  to  the  peak  of  the  normal 
emission.  Notice  that  this  peak  occurs  for  a  thickness  further 
up  the  knee  of  the  emission  than  was  the  case  for  the  sym¬ 
metrically  air-clad  slab  (Fig.  5B).  Nonetheless,  the  emission 
is  weaker  in  the  present  asymmetric  case.  In  comparable 
units,  the  emission  at  normal  for  the  air-clad  case  with  three 
mirror  pairs  was  60,  both  top  and  bottom;  in  this  case  it  is  11 
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Normalized  Cladding  Width,  nUX 


FIG.  7.  Evolution  of  the  total  radiated  power  for  an  asymmetri¬ 
cally  clad  slab  with  an  air  upper  and  a  GaAs  substrate  lower  clad¬ 
ding.  The  curves  have  been  offset  for  clarity;  in  each  case  the 
dashed  line  represents  the  power  radiated  by  the  dipole  into  an 
infinite  InGaAs  medium. 
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Normalized  Cladding  Thickness,  nUX 


FIG.  8.  Angular  variation  of  the  3D  radiation  (horizontal  dipole) 
from  the  top  (air)  surface  of  an  asymmetrically  clad  slab  (air  upper 
and  GaAs  lower  outer  claddings)  with  three  mirror  pairs  as  the 
inner  cladding  thickness  is  varied.  The  top  panel  shows  an  ex¬ 
panded  view  of  the  integrated  TE  and  TM  3D  radiation  in  the  vi¬ 
cinity  of  the  first-resonance  inner  cladding  thickness.  The  angular 
traces  (A  though  G)  show  the  evolution  of  the  angular  dependence 
of  the  emission  as  the  inner  cladding  thickness  is  varied  in  this 
region.  Curve  G  is  for  a  normalized  inner  cladding  thickness 
nL/\~  0.65,  beyond  the  thicknesses  plotted  in  the  top  curve. 

top  and  134  bottom.  Overall  the  normal  emission  is  compa¬ 
rable,  but  much  is  lost  to  substrate  emission  in  this  asymmet¬ 
ric  case.  Panel  C  is  for  an  inner  cladding  thickness  near  the 
peak  TE  emission.  Both  TE  and  TM  emissions  show  compa¬ 
rable  peak  emission  angles  as  was  the  case  for  Fig.  5.  Panel 
D  is  for  an  inner  cladding  thickness  near  the  mid  point  of  the 
significant  top-surface  emission.  As  was  the  case  for  the 
symmetrically  clad  structure,  at  these  higher  angles  a  split¬ 
ting  is  observed  between  the  TE  and  TM  peak  emission 
angles  as  a  result  of  different  phase  contributions  from  the 
TE  and  TM  reflectivities.  Panel  E  is  for  a  thickness  near  the 
upper  edge  of  the  TE  emission.  The  peak  of  the  TE  emission 
is  close  to  7t/2  as  expected.  Panels  F  and  G  are  at  the  upper 
edge  of  the  TM  emission  and  beyond  both  TE  and  TM  emis¬ 
sion  regions.  Panel  G  is  for  a  nL/X  of  0.65,  outside  the  range 
of  the  top  panel,  which  was  plotted  on  the  same  horizontal 
scale  as  Fig.  5  for  comparison.  Since  these  curves  are  for 
emission  out  the  top  surface,  the  same  amplification  of  the 


external  angle  relative  to  the  internal  angle  as  was  found  for 
the  previous  symmetrically  clad  structure  is  also  found  in 
these  cases. 

In  contrast  to  the  relatively  orderly  behavior  of  the  top 
surface  emission  shown  in  Fig.  8,  the  bottom  surface  emis¬ 
sion  shows  a  much  more  pronounced  angular  dependence  as 
shown  in  Fig.  9  for  five  different  values  of  the  normalized 
inner  cladding  thickness,  again  for  a  horizontal  dipole.  Note 
the  much  larger  vertical  axis  scale  from  1010  to  10-5.  The 
TE  and  TM  emission  angular  dependencies  are  shown  side 
by  side  for  clarity  in  this  figure.  The  top  pair  of  panels  is  for 
the  same  inner  cladding  thickness  as  panel  B  of  Fig.  8,  just  at 
the  peak  of  the  normal  emission.  In  addition  to  the  angular 
peak  at  (f)  =  0,  both  the  TE  (left)  and  TM  (right)  emission 
show  additional  structure.  There  are  sharp  dips,  which  are 
relatively  independent  of  inner  cladding  thickness  and  are 
related  to  changes  in  reflectivity.  A  notable  feature  is  the  very 
strong  peak  in  the  TE  emission  at  </>—  1.25  (labeled  TE1)  and 
the  comparable,  but  significantly  smaller  peak  in  the  TM 
emission  at  </>— 0.66  (TM1).  These  correspond  to  energy 
transmitted  into  the  substrate  from  the  lowest  order  “leaky” 
mode.  That  is  from  a  field  distribution  in  the  upper  cladding 
layers  that  looks  very  similar  to  the  zero-order  bound  mode 
for  the  symmetrically  air-clad  structure.  Compared  to  the 
peak  normal  emission  of  —134  into  the  substrate,  the  value 
of  this  emission  is  5X  107!  The  resonance  line  width  is  cor¬ 
respondingly  narrow,  so  the  normalized  integrated  radiated 
power  is  —0.5  as  shown  in  Fig.  6.  As  the  inner  cladding 
thickness  is  increased,  both  TE1  and  TM1  peaks  shift  to 
steeper  angles.  Since  the  refractive  index  of  the  substrate 
(3.495)  is  very  similar  to  that  of  the  InGaAs  slab  (3.55),  the 
amplification  of  the  angle  by  refraction  is  small.  As  the  inner 
cladding  thickness  is  further  increased,  both  of  these  peaks 
saturate  at  angles  close  to  ir/2.  The  last  pair  of  panels  in  Fig. 
9  is  for  an  inner  cladding  thickness  such  that  there  are  two 
propagating  leaky  modes  and  consequently  two  pairs  of 
sharp  peaks,  TE1  and  TE2  and  TM1  and  TM2.  These  very 
sharp  angular  dependences  posed  some  numerical  integration 
challenges.  It  was  easy  to  miss  the  peak  in  the  overall  inte¬ 
grals  [the  bottom  surface  equivalents  of  Eqs.  (31)  and  (33)]. 
Care  had  to  be  taken  to  track  the  peaks  as  a  function  of  inner 
cladding  thickness  and  integrate  carefully  over  the  peak  with 
a  very  fine  step  size  to  get  accurate  results.  This  is  also  the 
reason  that  only  the  three-mirror-pair  rather  than  the  four- 
mirror-pair  case  was  evaluated  for  the  asymmetric  structure; 
the  resonances  became  too  narrow  and  too  peaked  to  deal 
with  within  the  16-bit  precision  of  the  present  computer  pro¬ 
gram.  Adding  losses  to  the  dielectric  constant  limits  the  peak 
value,  but  also  results  in  energy  loss  to  absorption  within  the 
dielectric  stack  that  must  be  accounted  for  in  evaluating  the 
total  radiated  energy. 

V.  SUMMARY  AND  CONCLUSIONS 

A  fully  analytic  treatment  of  the  radiation  from  a  dipole 
embedded  in  an  arbitrary  dielectric  stack  has  been  presented. 
The  treatment  starts  from  a  Hertz-vector  formalism  intro¬ 
duced  by  Sommerfeld  [1]  and  extended  by  Lucosz  [9].  The 
analysis  proceeds  by  finding  the  Fourier  transform  of  the 
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FIG.  9.  Angular  variation  of  the  3D  radiation  from  the  bottom  (GaAs  substrate)  of  an  asymmetrically  clad  slab  (air  upper  and  GaAs 
lower)  with  three  mirror  pairs  as  the  inner  cladding  thickness  is  varied  for  a  horizontal  dipole.  The  various  values  of  nL/X  are  given  in  the 
figure.  The  top  panels,  nL/X  =  0.48,  correspond  to  position  B,  the  peak  normal  emission,  in  Fig.  8  (top).  The  second  set  of  panels,  nL/X 
=  0.53,  corresponds  to  position  E,  the  edge  of  the  TE  emission  peak,  in  Fig.  8  (top).  The  third  row  of  panels,  nL/X  =  0.79,  corresponds  to 
a  cladding  thickness  just  before  the  emergence  of  the  second  top-surface  emission  peak  (Fig.  6).  The  final  row  of  panels,  nL/X=  1.57, 
corresponds  roughly  to  the  middle  of  the  third  top-surface  emission  peak  in  Fig.  6. 


dipole  fields  for  an  unbounded  medium,  solving  the  indepen¬ 
dent  boundary  conditions  across  the  multilayer  stack  for  each 
Fourier  component,  and  transforming  back  to  real  space. 
This  procedure  provides  a  rigorous  solution  to  the  multilayer 
boundary  conditions.  Two  different  classes  of  singularities 
are  present  in  the  integrand  for  the  inverse  transform:  poles 
that  correspond  to  bound  2D  waveguide  modes  confined 
within  the  multilayer  stack,  and  branch  cuts  that  correspond 
to  3D  radiation  emitted  from  the  sides  of  the  stack  into  the 
upper  and  lower  outer  cladding  half- spaces.  While  the 
boundary  conditions  are  strictly  valid,  approximations  are 
made  in  order  to  evaluate  the  radiated  fields  that  are  accurate 
only  many  wavelengths  from  the  position  of  the  dipole,  e.g., 
in  the  radiation  zone  away  from  the  near  fields.  The  results 
are  directly  related  to  previous  work  [4]  that  investigated  a 
simple  dielectric  slab  geometry  with  the  straightforward  re¬ 
placement  of  single  layer  reflectivities  with  their  multilayer 
counterparts. 

This  formalism  was  applied  to  two  closely  related  struc¬ 
tures,  both  of  which  are  germane  to  the  design  of  semicon¬ 
ductor  vertical-cavity  lasers.  Both  structures  contain  a  thin 
active  region  containing  the  dipole  with  symmetric  inner 
claddings  and  an  equal  numbers  of  Bragg  reflector  mirror 
pairs  on  both  sides.  In  the  first  case,  the  structure  is  fully 
symmetric  with  an  air  cladding  both  above  and  below  the 
stack.  In  the  second  case,  the  lower  cladding  is  replaced  with 
a  substrate  with  the  same  refractive  index  as  the  inner  clad¬ 
ding.  These  two  cases  illustrate  some  interesting  points.  For 
the  symmetrically  air-clad  structure,  the  dipole  couples  to 
both  the  bound  modes  of  the  slab  and  the  radiation  fields  in 
the  air  spaces.  For  the  asymmetric  structure,  on  the  other 


hand,  there  are  no  bound  modes  because  of  the  high-index 
outer  cladding  and  only  the  3D  radiation  calculation  is  op¬ 
erative.  However,  for  high  reflectivities,  e.g.,  a  sufficient 
number  of  Bragg  reflector  pairs,  the  results  should  be  essen¬ 
tially  equivalent  since  the  dipole  can  not  “see”  the  substrate 
because  of  the  high  reflectivity.  Instead  of  true  bound  modes, 
there  are  leaky  modes  that  look  very  much  like  the  bound 
modes  of  the  air-clad  structure,  but  whose  power  is  transmit¬ 
ted  into  the  substrate  after  many  round  trips  within  the  film 
stack.  Because  of  the  very  high  reflectivities  in  these  struc¬ 
tures,  especially  at  steep  angles,  very  sharp  angular  depen¬ 
dences  are  found  for  this  radiated  power,  corresponding  to 
the  bound  modes  in  the  symmetric  air-clad  case.  In  both 
cases,  the  3D  radiation  into  the  upper  low-index  (air)  half¬ 
space  is  a  sharp  function  of  the  thickness  of  the  inner  clad¬ 
ding  with  TM  and  TE  resonances  at  alternating  half-wave 
inner  cladding  thicknesses.  The  asymmetric  case  is  very 
close  to  that  of  some  LEDs  on  solid  substrates.  For  example, 
visible-emission,  nitride-based  quantum- well  LEDs  [13]  are 
examples  of  mode-free  structures  [14].  Modes  similar  to 
guided  modes  but  losing  power  due  to  leakage  into  the  sub¬ 
strate  are  observed  in  such  structures. 

Finally,  we  present  the  following  conclusions. 

(1)  Analytic  solutions  are  presented  using  a  rigorous 
Hertz-vector  formalism  for  dipole  emission  within  a 
multilayer  dielectric  structure.  These  solutions  are  suitable 
for  application  to  edge-emitting  and  vertical-cavity  surface- 
emitting  lasers  (VCSELs),  resonant  light-emitting  diodes  and 
photodetectors,  and  other  multilayer  optoelectronic  devices. 
Expressions  are  presented  as  functions  of  a  normalized  opti¬ 
cal  thicknesses  of  the  various  layers  and  can  be  used  to  ana- 
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lyze  either  thickness  or  wavelength  variations. 

(2)  Enhancement/suppression  of  the  dipole  radiation  rela¬ 
tive  to  that  in  an  infinite  medium  is  obtained.  The  results  are 
very  dependent  on  the  structure;  however,  even  with  a  very 
high  finesse  planar  cavity,  the  maximum  enhancement  is  lim¬ 
ited  to  —31%  for  a  horizontal  dipole  at  the  first  resonance 
peak,  e.g.,  for  a  total  inner  cladding  thickness  of  —  nX.  The 
inner  cladding  thickness  for  peak  emission  is  close  to,  but 
slightly  longer  than,  the  peak  normal  emission  thickness. 

(3)  A  larger  fractional  suppression  of  the  dipole  emission 
is  found  for  a  vertical  dipole  for  very  short  inner-cladding 
thicknesses  lengths  <n\/ 2.  This  is  due  to  destructive  inter¬ 
ference  between  the  radiated  and  reflected  fields  at  the  posi¬ 
tion  of  the  dipole. 

(4)  The  distribution  of  the  radiated  power  between  3D 
radiation  into  the  outer  claddings  and  2D  radiation  into  the 


waveguide  modes  shows  a  larger  variation  with  structure 
than  does  the  total  radiated  power.  For  a  free-standing  sym¬ 
metric  structure,  the  addition  of  high-reflectivity  Bragg  mir¬ 
rors  modifies  significantly  the  fraction  of  power  emitted  into 
the  air  space  and  its  inner  cladding  thickness  and  angular 
variations.  As  expected,  the  angular  width  and  inner  cladding 
thickness  tolerance  of  the  normal-emission  resonance  de¬ 
creases  with  increasing  mirror  reflectivities. 

(5)  In  the  asymmetric  case,  with  a  high-index  substrate, 
the  bound  modes  of  the  structure  evolve  into  leaky  modes 
whose  energy  is  radiated  into  the  substrate.  The  top-surface 
emission  for  this  case  is  similar  to  that  for  the  symmetric 
case.  The  bottom  (substrate)  3D  emission  is  characterized  by 
a  complex  dependence  at  small  angles  corresponding  to  the 
reflectivity  coefficient  for  this  multilayer  structure,  and  by 
very  intense  and  narrow  resonance  peaks  at  higher  angles 
corresponding  to  the  leaky  modes. 


[1]  A.  Sommerfeld,  Ann.  Phys.  (Leipzig)  28,  665  (1909). 

[2]  A.  Sommerfeld,  Partial  Differential  Equations  in  Physics 
(Academic  Press,  New  York,  1949). 

[3]  D.  Marcuse,  Theory  of  Dielectric  Optical  Waveguides  (Aca¬ 
demic  Press,  San  Diego,  1991). 

[4]  S.  R.  J.  Brueck,  IEEE  J.  Sel.  Top.  Quantum  Electron. 
JSTQE-6,  899  (2000). 

[5]  G.  Bjork,  S.  Machida,  Y.  Yamamoto,  and  K.  Igeta,  Phys.  Rev. 
A  44,  669  (1991). 

[6]  H.  Rigneault  and  S.  Monneret,  Phys.  Rev.  A  54,  2356  (1996). 

[7]  Y.  Xu,  R.  K.  Lee,  and  A.  Yariv,  Phys.  Rev.  A  61,  033807 
(2000). 

[8]  S.-T.  Ho,  Y.  Chu,  J.-P.  Zhang,  S.  Wu,  and  M.-K.  Chin,  in 


Optical  Processes  in  Microcavities ,  edited  by  R.  K.  Chang  and 
A.  J.  Campillo  (World  Scientific,  Singapore,  1996),  pp.  339- 
387. 

[9]  W.  Lukosz  and  R.  E.  Kunz,  J.  Opt.  Soc.  Am.  67,  1607  (1977). 

[10]  W.  Lukosz,  Phys.  Rev.  B  22,  3030  (1980). 

[11]  A.  Banos,  Dipole  Radiation  in  the  Presence  of  a  Conducting 
Half -Space  (Pergamon  Press,  Oxford,  1966). 

[12]  J.  A.  Stratton,  Electromagnetic  Theory ,  1st  ed.  (McGraw-Hill, 
New  York,  1941),  pp.  573-575. 

[13]  S.  Nakamura,  M.  Senoh,  N.  Iwasa,  and  S.  Nagahame,  Jpn.  J. 
Appl.  Phys.,  Part  2  34,  L797  (1995). 

[14]  I.  V.  Akimova,  P.  G.  Eliseev,  M.  A.  Osinski,  and  P.  Perlin, 
Quantum  Electron.  26,  1039  (1996). 


036608-13 


Single-longitudinal-mode  emission  from  interband 
cascade  DFB  laser  with  a  grating  fabricated  by 
interferometric  lithography 


J.L.  Bradshaw,  J.D.  Bruno,  J.T.  Pham,  D.E.  Wortman,  S.  Zhang  and  S.R.J.  Brueck 


Abstract:  A  distributed-feedback,  type-II  interband  cascade  laser  is  demonstrated  in  pulsed  mode, 
emitting  near  3.145  jxm  at  temperatures  between  50  and  80  K.  Feedback  is  provided  by  a  surface- 
etched  grating  formed  using  interferometric  lithography.  Between  50  and  80  K,  the  device’s  single 
longitudinal  mode  red-shifts  with  temperature  at  ~0.1nm/K.  At  70  K  and  190  mA  of  pulsed 
injection  current,  side-mode  suppression  exceeds  30  dB.  The  characteristics  of  related  Fabry -Perot 
IC  lasers  are  also  described  and  contrasted  with  those  of  the  distributed  feedback  device. 


1  Introduction 

In  this  paper,  we  report  on  the  first  demonstration  of  a 
distributed  feedback  (DFB)  interband  cascade  (IC)  laser. 
Light -current -voltage  (L-I-V)  and  spectral  character¬ 
istics  of  the  DFB  device  are  reported,  along  with  the  L-I-V 
characteristics  of  related  Fabry-Perot  (FP)  devices  for 
comparison.  Possible  improvements  to  the  DFB  device 
structure  are  also  identified. 

The  results  reported  below  are  significant  because  they 
represent  another  step  forward  in  the  ongoing  development 
of  quantum-cascade-based  semiconductor  diode  lasers 
(SDLs),  which  include  the  type-I  intersubband  quantum 
cascade  (QC)  lasers  and  the  type-II  IC  lasers.  These  cascade 
laser  designs  partially  overcome  fundamental  limitations  of 
more  conventional  mid-IR  laser  designs,  and  hold  the 
potential  of  providing  single-mode  SDLs  throughout  the 
mid-IR  region  that  can  operate  under  CW  conditions  at 
ambient  temperatures  (or  temperatures  accessible  with 
thermoelectric  coolers). 

The  QC  laser  has  recently  demonstrated  CW  operation  at 
300  K  from  a  FP  device,  emitting  at  9.1  jxm  [1],  and  single¬ 
mode  DFB  QC  lasers  have  been  demonstrated  between  4.5 
and  16.5  [Jim  (see  [2]  for  a  recent  review).  FP  IC  lasers  have 
been  demonstrated  in  the  3  to  5  p,m  wavelength  region,  and 
recently,  room  temperature,  pulsed  operation  was  demon¬ 
strated  at  3.51  ixm  [3].  However,  single-mode  IC  lasers  have 
not,  to  our  knowledge,  been  reported  previously. 

2  Experimental  details 

The  laser  sample  used  to  fabricate  the  DFB  device,  referred 
to  below  as  M2,  was  grown  by  molecular  beam  epitaxy 
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(MBE)  on  a  p-type  (001)  GaSb  substrate.  The  layer  structure 
and  DFB  device  geometry  are  shown  schematically  in  Fig.  1 . 
A  2.26-jxm-thick  n-type  InAs/AlSb  superlattice  region 
serves  as  the  waveguide’s  lower  cladding  and  is  followed 
by  a  1.34-ptm-thick  core  region.  The  core  consists  of  18 
identical  cascaded  active  regions  separated  by  n-type 
injection  regions.  A  0.3-pum-thick,  p-doped  GaSb  layer 
was  grown  above  the  core  region  and  serves  as  a  separate 
confinement/top  contact  layer  and  host  layer  for  the  etched 
grating  of  the  DFB  devices. 

The  active  region  of  the  laser’ s  core  consists  of  InAs  and 
GalnSb  quantum  well  (QW)  layers  and  GaSb  and  AlSb 
blocking/barrier  layers,  and  the  injection  regions  consist  of 
digitally  graded  n-InAs  and  AlSb  layers.  A  schematic 
energy  band  diagram  of  a  single  period  of  a  typical  IC 
laser’s  cascaded  active  and  injection  regions  is  shown  in 
Fig.  2.  Electrons  injected  into  the  active  region’s  InAs  QW 
relax  to  the  lowest  heavy-hole  state  in  the  neighbouring 
GalnSb  QW  with  the  emission  of  a  photon.  Subsequent 
transport,  facilitated  by  interband  tunnelling,  moves  the 
electrons  into  the  adjacent  injection  region,  where  they  are 
transported  through  the  region’s  conduction  miniband  for 
injection  into  the  next  active  region.  The  active/injection 
regions  of  M2  and  the  MBE  procedures  used  in  its  growth 
are  the  same  as  those  used  in  the  growth  of  the  sample  in  [3] 
and  have  been  described  elsewhere  [4] . 

After  MBE  growth,  the  M2  sample  was  cleaved  to 
provide  laser  material  for  the  fabrication  of  both  DFB  and 
FP  devices.  Prior  to  wet-etching  for  mesa  definition  of  DFB 
devices,  a  grating  was  etched  into  the  top  of  the  DFB 
material  as  follows.  First,  an  antireflection  coating  (ARC) 
and  photoresist  (PR)  layer  were  deposited  onto  the  top 
p-GaSb  epilayer  of  the  M2  sample.  Two  beams  formed  from 
the  third  harmonic  of  a  yttrium-aluminium-garnet  (YAG) 
laser  at  355  nm  were  then  directed  from  opposite  positions 
onto  the  sample,  each  making  angles  of  ~45°  relative  to  the 
sample’s  surface.  The  angle  was  chosen  so  that  the  beam’s 
interference  formed  a  465-nm-period  standing- wave  pattern 
in  the  PR  above  the  ARC  layer,  exposing  the  PR.  The 
grating  period  was  determined  by  dividing  the  expected 
emission  wavelength  of  the  M2  device  (3.32  jim  at  80  K)  by 
twice  the  waveguide’s  modal  index  n.  The  latter  was 
estimated  at  3.47  by  measuring  the  FP  fringe  spacing,  A2, 


288 


IEE  Proc.-Optoelectron.,  Vol.  150,  No.  4,  August  2003 


top  Au  contact 
on  Si02 

|  * -  top  Au  contact  on  Si02  - ► 

p-GaSb  contact  and  optical  confinement  layer  with  grating 

Cascaded  active  and  injection  regions 

n-type  InAs/AISb  SLS  lower  cladding 

GaSb  buffer  layer  on  p-(001)  GaSb  substrate 

a 


p-GaSb  contact  and  optical  confinement  layer  with  grating 


Cascaded  active  and  injection  regions 


n-type  InAs/AISb  SLS  lower  cladding 


GaSb  buffer  layer  on  p-(001)  GaSb  substrate 


b 


Fig.  1  Schematic  layer  structure  and  device  geometry  used  for  DFB  devices 

a  Cross  sectional  view  showing  DFB  edge- contacting  scheme 
b  Side  view  showing  top-grating  orientation 


of  a  similar  device’s  below-threshold  amplified  spontaneous 
emission  spectrum  and  using  the  relation  n  =  22/(2LA2), 
where  A2  is  the  separation  between  adjacent  FP  fringes,  X  is 
the  expected  emission  wavelength  and  L  is  the  device’s 
cavity  length. 

The  exposed  PR  and  underlying  ARC  layers  were  then 
removed,  and  an  ion  beam  was  used  to  etch  a  shallow 
vertical  trench  into  the  exposed  p-GaSb  layer.  After 
cleaning,  this  process  resulted  in  an  ^40-nm-deep,  465- 
nm-period  grating  with  a  duty  cycle  of  approximately  50% 
on  the  top  p-GaSb  layer  of  the  DFB  laser  material. 

The  FP  and  DFB  materials  were  processed  into  ridge 
mesa  devices,  with  mesa  widths  of  65  and  115  pim,  defined 
with  optical  lithography  and  wet  chemical  etching.  Insulat¬ 
ing  layers  of  Si02  were  deposited  after  the  mesas  were 
defined,  leaving  openings  on  top  of  the  ridges  to  allow  for 
electrical  contacts  to  the  devices  (Fig.  1).  In  the  case  of  DFB 
devices,  only  5 -pan- wide  windows  were  opened  for  metal 
contact  along  each  top  edge  of  the  mesa  ridge.  This  left  most 
of  the  top  of  the  (wider)  DFB  mesas  without  a  metal  contact 


Fig.  2  Energy  band  diagram  showing  single  stage  of  a  typical  IC 
laser  structure  at  operating  bias 

Active  region  (left)  shows  interband  optical  transition  and  interband 
tunnelling  transition;  injection  region  (right)  transports  electrons  to  next 
stage  of  device 


over  the  grating;  a  step  intended  to  reduce  optical  losses. 
Metal  contacts  were  then  deposited  onto  the  tops  of  the 
mesas,  as  appropriate,  and  onto  the  bottom  substrate  for 
electrical  contact.  Some  DFB  devices  were  intentionally 
fabricated  with  metal  contacts  covering  their  gratings  for  the 
purpose  of  performance  comparisons.  Laser  bars  were 
cleaved  into  cavities  approximately  1-mm  long,  and  both 
facets  were  left  uncoated. 

The  fabricated  lasers  were  indium-soldered  onto  a  copper 
heatsink  and  mounted  onto  the  temperature-controlled  cold 
finger  of  an  optical  cryostat.  L-I-V  and  spectral  charac¬ 
terisations  were  carried  out  using  a  liquid-nitrogen-cooled 
InSb  detector  and  a  grating  monochromator  with  lock-in 
electronic  detection. 

3  Performance  of  Fabry- Perot  lasers 

Figure  3  shows  the  L-I-V  characteristics  of  a  115-pum- 
wide  by  0.96-mm-long  cavity  length  M2  FP  device.  This 
device  had  the  entire  top  of  the  mesa  contacted  with  a  0.3- 
[xm-thick  Au  contact  layer.  The  threshold  currents  at 


Fig.  3  CW  L-I-V  characteristics  of  a  11 5- pm  x  0.96- pm  M2 
FP  device  at  heatsink  temperatures  of  40  and  60  K 

Device  does  not  have  a  grating  structure,  and  the  top  mesa  is  covered  with  a 
thick  Au  contact 
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heatsink  temperatures  of  40  and  60  K  are  40  and  80  mA, 
respectively  (corresponding  to  current  densities  of  36.2 
A/cm2  and  72.4  A/cm2,  respectively).  The  output  power  is 
very  low:  at  100mA  of  CW  injection  current  at  60  K,  the 
measured  power  from  a  single  facet  is  below  0.1  mW.  The 
device  slope  efficiencies  at  40  and  60  K  are  also  low. 

For  comparison  purposes,  Fig.  4  shows  the  L-I-V 
characteristics  of  a  65-p,m-wide  by  0.98-mm-long  cavity 
length  FP  IC  laser,  fabricated  from  a  laser  structure  that 
included  an  upper  cladding  region.  The  Fig.  4  device  is  the 
one  described  in  [3]  (referred  to  below  as  Ml)  and  has  active 
and  injection  regions  that  are  nominally  identical  to  those  of 
the  M2  FP  device.  The  differences  lie  above  the  active/ 
injection  regions:  in  the  Ml  device,  an  upper  cladding 
region  separates  the  laser  core  from  the  top  metal  contact, 
whereas  in  the  M2  device  the  optical  mode  is  confined 
between  the  lower  cladding  and  the  metal  contact  and/ 
or  air,  depending  on  how  the  mesa  top  is  contacted.  The 
Ml  characteristics  show  a  threshold  current  density  of 
13.2  A/cm2  and  an  initial  slope  efficiency  of  0.65  W/A  at  a 
heatsink  temperature  of  80  K.  The  initial  slope  efficiency 
corresponds  to  a  differential  external  quantum  efficiency  of 
~350%  (approximately  19%  differential  efficiency  per 
stage)  when  the  output  from  both  facets  is  assumed  to  be 
equal.  At  100  mA  of  CW  current,  the  output  power 
measured  from  a  single  facet  was  57  mW.  The  I-V 
characteristics  of  Ml  exhibit  a  high  resistance  below 
threshold  and  an  abrupt  decrease  in  differential  resistance 
at  threshold,  with  the  decrease  occurring  at  6.95  V.  Based  on 
the  observed  80  K  laser  emission  at  3.315  p,m  (373.9  me V) 
and  the  presence  of  18  cascaded  stages,  this  corresponds  to  a 
voltage  efficiency  of  96%.  By  comparison,  the  output  power 
and  slope  efficiencies  of  the  M2  FP  device  are  greatly 
reduced.  It  is  clear  that  this  reduction  in  M2’s  performance 
relative  to  Ml  is  not  due  to  the  active/injection  region 
designs,  but  is  instead  related  to  the  waveguide  structure 
and/or  to  the  quality  of  the  electrical  contacts  made  to  the 
core  region.  In  the  M2  FP  design,  the  optical  mode  is  larger 
at  the  top  metal  contact  than  in  the  Ml  device.  Conse¬ 
quently,  optical  losses  were  expected  to  be  larger  in  the  M2 
FP  and  DFB  devices.  Because  of  this,  the  p-GaSb  separate- 
confinement/contact  layer  was  doped  with  Be  at  a  relatively 
low  level  of  5-6  x  1017  cm-3.  This  doping  level  was  chosen 
as  a  compromise  between  the  simultaneous  requirements  of 
high  electrical  conductivity  in  the  layer  and  low  free-carrier 
losses.  Despite  the  generally  inferior  characteristics  of  the 
M2  device  relative  to  the  Ml  device,  the  M2  device  still  has 
a  reasonably  good  diode  characteristic,  showing  a  high 
initial  resistance  to  current  flow  and  a  rapid  drop  in 
resistance  above  approximately  7  V. 


current,  mA 


Fig.  4  CW  L—I—V  characteristics  of  the  65- /an  x  0.98- pm  FP 
device  of  [3]  at  a  heatsink  temperature  of  80  K 


4  Performance  of  DFB  lasers 

L-I-V  characteristics  of  the  DFB  laser  structure 
processed  with  the  M2  grating  material  and  with  metal 
contacts  along  only  the  top  edges  of  the  mesa  are  shown 
in  Fig.  5.  The  laser  exhibited  a  threshold  current  of 
~  165  mA  in  the  pulsed  mode  at  60  K.  Note  that  the  I-V 
characteristics  do  not  show  the  high  resistance  at  low 
current  that  was  observed  for  the  M2  FP  device  (Fig.  3) 
whose  mesa  was  covered  with  a  thick  Au  contact.  To 
determine  whether  this  might  be  related  to  the  relatively 
low  doping  in  the  Be-doped  contact  layer,  another  DFB 
device  was  fabricated  from  the  same  M2  DFB  material, 
but  a  100-A-thick  Au  contact  was  deposited  over  the 
entire  top  of  the  grating  structure  (electrically  connecting 
the  thicker  Au  edge  contacts).  The  L-I-V  characteristics 
of  this  device  are  shown  in  Fig.  6,  and  the  data  shows 
significantly  improved  I-V  characteristics  with  a  high 
initial  resistance  and  improved  forward  bias  diode 
characteristics.  Additionally,  the  device  exhibited  a 
lower  threshold  current  of  ~10mA  at  80  K,  correspond¬ 
ing  to  a  threshold  current  density  of  27  A/cm2.  This 
threshold  current  density  compares  favourably  with  the 
threshold  current  density  of  72  A/cm2  at  60  K  of  the 
device  shown  in  Fig.  5,  presumably  due  to  the  thinner 
gold  contact  layer  resulting  in  a  lower  optical  loss  and 
reduced  spreading  resistance  in  the  top  GaSb  layer.  Note, 
however,  that  this  same  device  did  not  emit  in  a  single 
longitudinal  mode  but  exhibited  FP  emission 
characteristics. 

Lasing  spectra  acquired  under  pulsed  conditions  are 
shown  in  Fig.  7  for  the  DFB  laser  device  shown  in  Fig.  5. 
The  lasing  spectrum  exhibits  a  single  longitudinal  mode 
near  3.142  jxm  at  60  K.  When  the  device  is  processed  with 
the  grating-free  laser  material  in  a  FP  geometry,  it  exhibits  a 
FP  lasing  spectra  centred  near  a  wavelength  of  3.30  p,m 
(multimode  spectra  at  right-hand  side  of  Fig.  7).  The 
separation  between  these  lasing  energies  is  nearly  19meV; 
an  energy  comparable  to  the  width  of  the  device’s 
spontaneous-emission  spectrum.  This  difference  could  be 
explained  if  our  estimate  of  the  modal  index  in  the  DFB 
material  were  ~5%  larger  than  its  actual  value.  This  would 
also  be  consistent  with  the  high  threshold  current  density 
seen  in  Fig.  5.  Note  also  that  the  cavity  length  for  the  DFB 
laser  is  half  that  of  the  FP  device.  Thus,  the  longitudinal 
mode  spacing  in  the  DFB  device  would  be  twice  the  spacing 


Fig.  5  L-I-V  characteristics  of  an  edge-contacted,  115-gim- 
wide  by  0.54-mm-long  mesa  DFB  laser 

Data  were  acquired  under  pulsed  conditions  using  a  1-juls  pulse  width  and  a 
10  kHz  repetition  rate  at  a  heatsink  temperature  of  60  K 
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Fig.  6  L-I-V  characteristics  of  an  edge-contacted,  35-gim-wide 
by  1.04-mm-long  mesa  DFB  laser  acquired  under  pulsed 
conditions  using  a  l-fis  pulse  width  and  a  10  kHz  repetition  rate 
at  a  heatsink  temperature  of  80  K 
Top  of  grating  is  covered  with  an  80-A-thick  Au  contact 


energy,  meV 


P—  hmi  '  =■  ;  ■) — ^  1 

3.12  3.13  3.14  3.15  3.16  3.17 


wavelength,  pm 

Fig.  8  High-resolution  emission  spectrum  of  the  DFB  device 
described  in  Fig.  5  under  pulsed  conditions  at  70  K 

Vertical  scale  is  logarithmic  showing  side-mode  suppression  exceeding 
30  dB  at  190  mA  injection  current.  The  spectra  were  acquired  using  1-p.s 
pulses  at  a  10  kHz  repetition  rate 


of  the  cavity  modes  shown  in  Fig.  7  for  the  FP  device. 
Consequently,  the  DFB  lasing  spectra  clearly  consists  of  a 
single  longitudinal  mode.  However,  because  the  processed 
mesa  widths  are  rather  broad  (>65pm),  all  the  lasers 
investigated  here  probably  emitted  several  lateral  modes. 

Higher  resolution  lasing  spectra  are  shown  in  Fig.  8. 
At  a  temperature  of  70  K  and  a  pulsed  injection  current 
of  190  mA,  the  DFB  laser  exhibits  a  side-mode  suppres¬ 
sion  ratio  in  excess  of  30  dB.  At  200  mA  injection 
current,  the  emergence  of  another  mode  can  be  seen  near 
3.122  pun.  Note  that  the  vertical  axis  is  logarithmic. 
Figure  9 a  shows  spectra  acquired  under  pulsed  conditions 
at  temperatures  ranging  from  50  to  80  K.  The  DFB  lasing 
wavelength  is  plotted  as  a  function  of  temperature  in 
Fig.  9b.  The  laser  exhibits  a  temperature  tuning  rate  of 
O.lnm/K,  which  is  consistent  with  expectations  based  on 
the  variation  of  the  refractive  index  of  III-V  materials 
with  temperature. 


wavelength,  pm 


Fig.  7  Emission  spectra  of  the  DFB  Single  longitudinal  mode 
emission  at  60  K  (left)  from  edge-contacted  DFB  laser  device 
described  in  Fig.  5  and  multimode  emission  is  observed  from  FP 
device  fabricated  from  the  same  M2  laser  material  (right)  at  80  K. 
Both  spectra  were  acquired  using  a  1-gis  pulse  width  and  a  10  kHz 
repetition  rate 

(i)  Single-longitudinal-mode  DFB  operation,  115  pm  x  0.54  mm  mesa, 
r  =  60K,/=  175  mA 

(ii)  65  pm  x  1.01  mesa,  Fabry-Perot  laser,  T  =  80  K,  /  =  60  mA 


At  higher  pulsed-current  levels,  the  DFB  laser  exhibits 
line  width  broadening.  Several  lasing  spectra  at  high  pulsed- 
current  levels  are  shown  in  Fig.  10.  Note  the  significant 
broadening  of  the  laser  line  width  at  higher  current  levels. 
The  broadening  is  attributed  to  heating  during  the  current 
pulse  as  well  as  the  influence  of  the  higher  order  lateral 
modes  at  higher  injection-current  levels. 

energy,  meV 


wavelength,  pm 
a 


temperature,  K 
b 

Fig.  9  Emission  characteristics  of  DFB  device  described  in  Fig.  5 

a  High-resolution  emission  spectrun  under  pulsed  conditions  between  50  K 
and  80  K 

b  Emission  wavelength  against  heatsink  temperature  between  50  K  and 
80  K 


IEE  Proc.-Optoelectron.,  Vol.  150,  No.  4,  August  2003 


291 


energy,  meV 


396.0  395.5  395.0  394.5  394.0  393.5 

10“3 


£ 

1  10-4 

2 
!q 

CO 

£ 

«  10“5 
<D 


itr6 

3.13  3.14  3.15 

wavelength,  pm 

Fig.  10  High  -resolution  emission  spectra  of  the  DFB  device 
described  in  Fig.  5  under  pulsed  conditions  at  50  K  and  injection 
currents  of  174,  200  and  225  mA 

Effects  of  heating  during  the  pulse  and  multiple  lateral  modes  are  evident 


provide  feedback  closer  to  the  M2  material’s  gain  peak,  and 
the  improved  top  GaSb  layer  conductivity  will  allow  for  a 
reduced  spreading  resistance  and  eliminate  the  need  for 
covering  the  grating  with  Au  to  achieve  improved  I-V 
characteristics.  This  should  significantly  reduce  optical 
losses,  and  improve  device  output  power,  slope  efficiency 
and  temperature  performance. 
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We  have  successfully  fabricated  the  first  resonant  magnetic  nanostructures  exhibiting  a  negative 
permeability  in  the  midinfrared.  These  metal-dielectric  structures  exhibit  local  resonances  based  on 
metallic  inductive-capacitive  effects  that  are  controlled  by  the  dimensions  of  the  individual 
nanostructures  and  are  independent  of  the  periodicity,  which  is  much  smaller  than  the  resonance 
wavelengths.  Compared  to  other  commonly  used  structures  for  obtaining  negative  permeability  in 
the  microwave  and  THz  regions  which  are  based  on  a  planar  fabrication  paradigm,  this  structure  is 
oriented  vertically  so  that  the  smallest  features  are  controlled  by  deposition  rather  than  by 
lithography.  ©  2004  American  Vacuum  Society.  [DOI:  10.1116/1.1824067] 


I.  INTRODUCTION 

Double  negative  materials  (DNM),  or  left-handed  materi¬ 
als,  are  of  intense  current  interest  as  a  result  of  their  uncon¬ 
ventional  optical  properties1  that  promise  enhanced  linear 
and  nonlinear  optical  effects.2  A  DNM  has  both  a  negative 
permittivity  (e)  and  a  negative  permeability  (/z).  While  met¬ 
als  below  the  plasma  frequency  provide  a  negative  permit¬ 
tivity,  naturally  occurring  materials  with  negative  permeabil¬ 
ity  are  not  available.  Composite  electromagnetic  materials 
with  a  structural  scale  much  smaller  than  an  electromagnetic 
wavelength  can  act  as  effective  media  with  negative  perme¬ 
ability. 

The  most  widely  investigated  structure  for  establishing 
magnetic  resonance  and  thus  a  negative  permeability  is  the 
split  ring  resonator  (SRR)  proposed  by  Pendry  et  al.  in 
1999. 3  The  SRR  consists  of  two  concentric  metal  rings  each 
with  a  gap  facing  towards  the  opposite  directions.  This  an¬ 
tenna  structure  can  be  viewed  as  an  equivalent  inductor- 
capacitor  (LC)  tank  circuit;  the  inductance  is  due  to  the  loops 
while  the  capacitance  is  due  to  the  gaps  and  the  mutual  cou¬ 
pling  between  the  two  loops.  When  a  time- varying  magnetic 
held  penetrates  the  structure,  currents  are  induced  in  the 
metal  rings,  the  magnetic  held  generated  by  the  current  ei¬ 
ther  opposes  or  adds  to  the  external  magnetic  held;  at  fre¬ 
quencies  just  above  the  resonance,  negative  permeability  can 
be  realized.  Since  this  initial  proposal,  many  experiments 
have  been  reported  at  microwave  frequencies.4 

There  has  been  a  strong  interest  in  extending  these  mag¬ 
netic  resonance  structures  to  higher  frequencies  to  extend  the 
domain  of  application.  Recently,  Yen  et  al.  fabricated  a  SRR 
structure  with  a  resonance  around  1  THz.5  They  used  con¬ 
ventional  planar  processing  technologies  with  minimum  fea¬ 
tures  sizes  in  the  range  of  ~  1  gm,  dehned  by  optical  lithog¬ 
raphy.  Using  conventional  technologies,  it  is  difficult  to  scale 
this  structure  by  the  factors  of  —100  to  1000  needed  to  shift 
the  resonance  to  the  infrared  and  optical  spectral  regions.  In 
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this  article,  we  describe  the  fabrication  and  modeling  of  an 
array  of  nanostructures  with  a  resonance  in  the  mid-IR  re¬ 
gion  (to  65  THz)  and  properties  that  are  indicative  of  a  nega¬ 
tive  effective  permeability.  This  structure  has  a  vertical  con¬ 
figuration,  the  smallest  features  are  dehned  by  self-aligned 
deposition  instead  of  lithography,  so  they  are  relatively  easy 
to  control  at  the  ~1  nm  level,  and  the  processing  steps  are 
very  robust  and  readily  extended  to  large  areas.  To  our 
knowledge,  this  is  the  first  experimental  work  on  negative 
permeability  reported  for  the  mid-IR. 

The  structure  (schematic  is  shown  in  Fig.  1)  consists  of  an 
array  of  upper  open  gold  “staples”  each  with  two  footings, 
separated  from  a  lower  continuous  gold  layer  by  a  dielectric 
layer  of  ZnS.  The  light  is  incident  along  the  normal  to  the 
wafer  (z  direction  in  Fig.  1),  and  the  polarization  is  dehned 
as:  TM(TE)  for  E(H)  held  along  the  v  direction.  Each  staple 
is  an  LC  circuit  with  the  major  part  of  the  inductor  associated 
with  the  loop  of  the  staple  and  with  capacitors  formed  be¬ 
tween  the  top  gold  footings,  the  dielectric  layer,  and  the  bot¬ 
tom  gold  layer.  In  the  simplest  model,  the  inductance  and  the 
capacitance  of  the  LC  circuit  can  be  written  as  L=jul0A 
=  jul0WH  and  C=sdc/tc ,  where  A  is  the  area  of  the  loop,  s  is 
the  dielectric  constant  of  ZnS,  and  the  other  parameters  are 
labeled  in  Fig.  1.  However,  due  to  the  hnite  electron  inertia 
of  the  metal  at  optical  frequencies  and  edge  effects  of  the 
structure  geometry,  both  the  inductance  term  and  capacitance 
term  cannot  be  simply  expressed  by  analytical  expressions. 
Still,  the  geometrical  parameters  qualitatively  explain  how 
resonance  frequency  changes;  for  example,  when  the  footing 
size  dc  is  increased,  the  capacitance  increases,  and  the  reso¬ 
nance  frequency  /=  1  /  yfEc  decreases.  The  same  reasoning 
also  applies  to  the  change  of  frequency  due  to  a  the  change  in 
the  inductance. 

As  a  result  of  the  thick  Au  him  (>IR  skin  depth),  light 
can  only  be  reflected  or  absorbed  from  the  structure;  there  is 
no  transmission  across  the  IR.  For  analysis,  it  is  simpler  to 
model  the  transmission  through  the  staple  and  its  mirror  im¬ 
age  in  the  metal  surface,  as  shown  in  the  bottom  panel  of 
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Fig.  1.  Top:  Schematic  of  the  nanostructures.  The  light  is  incident  along  the 
normal  from  the  top;  for  TM  polarization,  H  is  in  the  ^-direction,  perpen¬ 
dicular  to  the  loop.  Bottom:  The  equivalent  LC  circuit  formed  between  the 
top  staple  structure  and  its  image  in  the  metal.  The  dashed  lines  are  the 
reference  planes  for  the  effective  permeability  calculation. 

Fig.  1.  The  period  of  the  array  is  600  nm,  which  is  much 
smaller  than  the  resonant  wavelength  that  varied  between  4- 
and  7-/zm  for  these  experiments.  Three  ID  samples  (mag¬ 
netic  structures  extending  >X  into  the  plane  of  Fig.  1)  (A,  B, 
C)  and  one  2D  (as  in  Fig.  2;  extent  of  each  individual  struc¬ 
ture  sample  (D)  were  fabricated  with  different  geomet¬ 
ric  parameters  as  shown  in  Table  I.  Figure  2  shows  the  FEM 
pictures  of  the  2D  structure  (D)  and  ID  structure  (B). 
Samples  A  and  C  have  different  footing  sizes  ( dc )  while 
samples  A  and  B  have  different  staple  heights  (. H ),  with  other 
parameters  held  constant.  Sample  D  has  the  same  parameters 


Fig.  2.  SEM  pictures  of  the  arrays  of  2D  [sample  D,  shown  in  (a)]  and  ID 
[sample  B,  shown  in  (b)]  staple-shaped  nanostructures. 


Table  I.  Sample  geometrical  parameters  (all  dimensions  in  nm). 


Sample 

W 

H 

dc 

?Au 

(a.u  top 

U 

A 

130 

280 

190 

—  15 

-30 

80 

B(D) 

130 

180 

190 

-15 

-30 

80 

C 

130 

280 

90 

-15 

-30 

80 

as  B  except  for  the  segmentation  in  the  y -direction  and  the 
removal  of  the  material  inside  the  loop. 

II.  FABRICATION 

The  processing  steps  for  the  ID  samples  are  as  follows. 
First,  a  ~100-nm  thick  layer  of  gold  is  evaporated  onto  a 
silicon  wafer,  followed  by  dielectric  evaporation  of  a  80-nm 
thick  layer  of  ZnS  (n~  2.26),  which  is  transparent  through¬ 
out  the  ~2-  to  10-jiim  IR  region.  The  Au  layer  forms  a 
ground  plane  for  the  structure  and  the  ZnS  is  the  dielectric 
spacer  for  the  capacitor.  Next,  a  layer  of  Si3N4(n~  1.7)  is 
grown  on  the  sample  by  plasma  enhanced  chemical  vapor 
deposition  (PECVD),  followed  by  spun-on  layers  of  a  poly¬ 
mer  bottom  antireflection  coating  (BARC)  and  an  I-line  pho¬ 
toresist  (PR).  A  large-area,  600-nm  pitch,  one-dimensional 
grating  is  fabricated  by  interferometric  lithography  (IL)  us¬ 
ing  a  355-nm  third-harmonic  YAG  laser  source  [Figs.  3(a) 
and  3(b)].6  The  pattern  is  transferred  into  the  Si3N4  layer  by 
reactive  ion  etching  (RIE)  using  CHF3  and  02  gases  and  the 
residual  PR  and  ARC  are  subsequently  removed  by  plasma 
ashing  with  02  gas  [Fig.  3(c)].  The  etching  is  selective  and 
leaves  the  ZnS  layer  untouched.  Metal  evaporation  is  carried 
out  at  an  angle  to  deposit  Au  on  the  top  and  one  sidewall  of 


a  <-  Substrate 


1 1 1 1 1 

c 


Fig.  3.  Schematic  diagrams  of  fabrication  steps  for  ID  structure,  (a)  Inter¬ 
ferometric  lithography  with  two  coherent  UV  beams,  (b)  The  grating  in  PR 
after  development,  (c)  The  grating  in  the  nitride  layer  after  RIE  etching,  (d) 
Au  deposition  at  an  angle  from  one  side. 
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Fig.  4.  Schematic  flow  charts  of  processing  steps  for  2D  structure,  (a)  Grat¬ 
ings  in  the  nitride  layer,  (b)  Arc  and  PR  are  deposited  by  spin  coating,  (c) 
Gratings  in  PR  perpendicular  to  the  nitride  gratings  are  fabricated,  (d)  Lift¬ 
off  process  to  obtain  metal  gratings,  (e)  Metal  evaporations  at  an  angle  are 
carried  out  on  both  sides,  (f)  ARC  is  removed  by  02  plasma  ashing,  (g) 
Nitride  removed  by  CHF3  plasma  ashing. 

the  Si3N4  gratings,  and  to  form  the  footings  on  one  side  [Fig. 
3(d)].  A  second  evaporation  is  then  carried  out  from  the  other 
side  to  form  the  completed  “staple”  structure.  Because  the 
top  of  the  staple  structure  receives  two  Au  depositions,  the 
thickness  of  the  Au  on  the  top  is  twice  that  on  the  sidewalls 
and  footings.  The  angle  of  the  metal  evaporation  along  with 
the  nitride  dimensions  controls  the  footing  size,  and  hence 
the  capacitance.  The  dimensions  of  the  nitride  pattern  control 
the  area  of  the  loop  and,  thus,  the  inductance. 

For  the  2D  samples,  the  same  procedures  are  followed  up 
to  Fig.  3(c),  which  is  repeated  as  Fig.  4(a).  A  thick  layer  of 
ARC  is  then  spun  on  the  wafer  to  cover  the  Si3N4  grating 
and  planarize  the  surface.  Then  a  layer  of  PR  is  added  by 
spinning  [Fig.  4(b)].  Another  exposure  is  carried  out  to  pro¬ 
duce  gratings  perpendicular  to  the  buried  Si3N4  gratings 
[Fig.  4(c)],  followed  by  a  metal  evaporation  at  normal  inci¬ 
dence  [Fig.  4(d)].  The  wafer  is  then  immersed  in  acetone  to 
remove  the  PR,  and  a  grating  pattern  is  obtained  in  the  metal. 
Using  the  metal  grating  as  mask,  RIE  etching  in  an  02 
plasma  is  done  to  selectively  etch  away  the  thick  ARC  layer 
locally  exposing  the  Si3N4  gratings  which  are  not  etched  in 
the  02  plasma.  Two  evaporations  of  Au  at  an  angle  are  car¬ 
ried  out  to  form  the  staple  structures  [Fig.  4(e)].  The  ARC 
layer  is  then  removed  by  a  high  pressure,  isotropic  02 
plasma  ashing  step  [Fig.  4(f)].  Finally,  the  Si3N4  gratings  are 
removed  by  CF4  plasma  ashing  leaving  the  2D  array  of  hol¬ 
low  staple  nanostructures  [Fig.  4(g)]. 

III.  INFRARED  REFLECTANCE 
CHARACTERIZATION 

FTIR  reflectance  measurements  were  performed  on  these 
four  samples  for  both  TE  and  TM  polarizations.  Due  to  the 
anisotropy  of  the  staple  structure,  we  expect  to  optically 
couple  to  the  resonant  structure  only  when  the  magnetic  field 
of  the  incoming  light  is  perpendicular  to  the  loop  of  the 
staple  (TM  polarization).  The  IR  transmission  is  negligible 
due  to  the  thick  bottom  metal,  so  the  nanostructure  optical 
properties  are  solely  characterized  by  the  resonance  absorp¬ 


Fig.  5.  Experimental  (top)  and  simulation  (middle)  results  for  the  reflectivity 
as  a  function  of  frequency  for  three  different  structures  (cf.  Table  I).  The 
resonance  frequency  is  a  function  of  the  nanostructure  parameters  and  is 
independent  of  the  array  period,  which  is  the  same  for  all  three  structures. 
The  resonance  is  observed  only  for  TM  polarization,  i.e.,  incident  magnetic 
field  coupled  into  the  inductive  structure.  The  bottom  panel  shows  the  ef¬ 
fective  permeability  extracted  from  the  model  for  sample  C. 


tion.  As  expected,  the  TE  measurements  on  all  the  samples 
did  not  show  any  structure  [to  avoid  clutter  in  the  figure,  the 
TE  reflectance  is  shown  only  for  sample  C  in  Fig.  5 (top)]. 
The  TM  measurements  show  reflectance  dips  that  vary  ac¬ 
cording  to  the  sample  geometrical  dimensions  directly  con¬ 
firming  the  existence  of  structure-dependent  magnetic  activ¬ 
ity  for  these  staple  arrays.  The  strong  resonance,  with  as 
much  as  50%  absorption  in  a  single  layer  of  nanostructures, 
suggests  that  very  high  fields  are  generated  in  the  nanostruc¬ 
tures  that  will  be  of  interest  for  future  nonlinear  optics  ex¬ 
periments.  As  shown  in  Fig.  5  (top),  sample  A  with  the  larg¬ 
est  inductance  and  capacitance,  has  the  lowest  resonant 
frequency.  Sample  C  has  the  same  inductance  as  sample  A 
but  a  smaller  capacitance  (smaller  footing  width  dc ),  giving  a 
higher  frequency  resonance  (shorter  wavelength).  Sample  D 
(and  B)  has  the  same  capacitance  as  sample  A  and  a  lower 
inductance  (shorter  loop  height,  H ),  also  giving  a  higher  fre¬ 
quency  resonance.  Comparison  between  the  ID  sample  and 
2D  sample  (samples  B  and  D)  with  the  same  geometric  pa¬ 
rameters  (not  shown)  was  made;  there  is  no  significant  dif¬ 
ference  either  in  the  location  of  the  resonance  dip  or  on  the 
depth  of  the  resonance.  Experimentally,  samples  B  and  D 
have  relatively  smaller  reflection  dips;  the  reason  for  this 
variation  is  under  investigation. 
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IV.  DISCUSSION 


The  simulation  results  for  the  three  structures  are  shown 
in  Fig.  5  (middle).  The  simulation  is  based  on  the  rigorous 
coupled  wave  analysis,  or  RCWA,  which  is  commonly  used 
for  analysis  of  scattering  of  electromagnetic  waves  from  pe¬ 
riodic  structures.7  Bulk  values  of  the  wavelength  dependence 
of  the  Au  optical  properties  were  used.8  The  modeling  ex¬ 
plains  the  experimental  data  well  except  that  the  width  of 
absorption  dip  of  the  experimental  data  is  somewhat  larger 
than  that  of  the  model.  Both  nonideal  material  parameters 
(particularly  electron  scattering  in  the  thin  Au  films)  nonuni¬ 
formity  of  the  structures  across  the  ~l-cm  diameter  mea¬ 
surement  area  can  contribute  to  this  observation. 

To  further  investigate  the  strength  of  this  magnetic  activ¬ 
ity,  we  extracted  the  effective  permeability  of  sample  C  from 
the  simulated  results  of  the  complex  transmission  and  reflec¬ 
tion  coefficient  of  the  equivalent  LC  circuit  (Fig.  1, 
bottom).9,10  For  a  uniform  layer  of  metamaterials  with  inci¬ 
dent  and  outgoing  media  of  air,  the  complex  transmission 
and  reflection  coefficient  are  expressed  as 


l 

t 


cos (nkd)  - 


z  +  -)  sin  (nkd) 


(1) 


r 

t 


sin  (nkd) , 


(2) 


where  n  and  z  are  the  refractive  index  and  impedance  of  the 
metamaterial.  By  inverting  Eqs.  (1)  and  (2),  we  can  obtain  n 
as  a  function  of  r  and  t , 


1 

n-  —  cos 
kd 


-l 


(3) 


After  n  is  evaluated,  z  is  obtained  from 
1  +  r 

i  - -  cos  (nkd) 

sin  (nkd)  ^  ^ 

Finally,  the  complex  value  of  the  effective  permeability  is 
juiQff=nz.  As  shown  in  Fig.  5  bottom,  the  real  part  of  the 
permeability  /z  shows  a  strong  modulation  and  is  negative 
over  a  restricted  frequency  range;  the  imaginary  part  shows 


the  complementary  Lorentzian-type  peak — features  charac¬ 
teristic  of  a  strong  magnetic  resonance.  Extending  the  mod¬ 
eling  (not  shown)  across  a  range  of  structure  parameters  sug¬ 
gests  that  these  results  can  be  extended  to  a  more  negative 
permeability  by  increasing  the  width  W  of  the  staple  struc¬ 
tures,  i.e.,  increasing  the  fill  factor. 

In  summary,  nano  structured  magnetic  resonators  have 
been  fabricated  with  resonances  in  the  mid-IR  spectral  re¬ 
gion.  The  measured  reflectivity  shows  a  strong  absorption 
dip  in  good  agreement  with  the  simulation  results.  The  simu¬ 
lation  shows  strong  magnetic  response  and  indicates  negative 
magnetic  permeability  over  a  limited  frequency  range.  Com¬ 
bining  this  negative  permeability  structure  with  a  negative 
permittivity  structure,  for  example  by  segmenting  the  under¬ 
lying  Au  film,  will  lead  to  metamaterials  with  negative  re- 
fractivity  in  the  IR  suitable  for  transmissive  as  well  as  reflec¬ 
tive  measurements.  Further  work  is  underway  to  investigate 
the  scaling  of  these  structures  to  the  near-IR  and  visible  spec¬ 
tral  regions. 
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Midinfrared  Resonant  Magnetic  Nanostructures  Exhibiting  a  Negative  Permeability 
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We  experimentally  demonstrate  the  first  midinfrared  (mid-IR)  resonant  magnetic  nanostructures 
exhibiting  a  strong  magnetic  response  corresponding  to  a  negative  permeability.  This  result  is  an 
important  step  toward  the  achievement  of  a  negative  refractive  index  in  the  IR.  The  possibility  of 
extending  negative  permeability  to  higher  frequencies  is  discussed;  a  structure  with  a  negative  effective 
permeability  at  a  near-IR  resonance  frequency  of  230  THz  (1.3  jam)  is  proposed. 
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In  1968,  Veselago  proposed  the  concept  of  a  negative 
refractive  index  or  left-handed  material  (LHM)  [1]  with 
both  a  negative  permittivity  (e)  and  a  negative  permeability 
(ja).  Many  interesting  properties  are  associated  with  nega¬ 
tive  materials  including  a  negative  index  of  refraction, 
backward  phase  propagation,  a  reversed  Doppler  effect, 
and  backward  emission  of  Cherenkov  radiation.  Recently, 
interest  in  LHMs  has  increased  substantially  with  the 
theoretical  prediction  that  a  planar  slab  of  LHM  functions 
as  a  perfect  lens  without  any  diffractive  loss  of  resolution 
[1,2]  and  with  the  first  demonstrations  of  LHMs  in  the  rf 
and  THz  electromagnetic  regions  [3-6]. 

While  metals  provide  a  negative  permittivity  at  fre¬ 
quencies  below  the  plasma  frequency,  naturally  occurring 
materials  with  negative  permeability  are  not  available. 
Composite  electromagnetic  materials  with  resonant  struc¬ 
tures  with  sizes  much  less  than  the  wavelength  can  act  as  an 
effective  homogeneous  media  with  a  negative  permeabil¬ 
ity.  Pendry  et  al.  proposed  a  split  ring  structure  that  re¬ 
sponds  to  the  magnetic  field  of  incident  radiation  [7].  The 
split  ring  structure  (SRS)  can  be  viewed  as  an  equivalent 
inductor-capacitor  (LC)  tank  circuit.  In  the  presence  of  a 
time- varying  magnetic  field,  the  magnetic  field  generated 
by  the  current  induced  in  the  ring  opposes  the  external 
magnetic  field.  At  frequencies  in  the  vicinity  of  the  reso¬ 
nance,  a  negative  effective  permeability  can  be  realized. 
While  most  experimental  work  has  been  done  at  micro- 
wave  and,  more  recently,  THz  frequencies,  extending  the 
phenomena  to  infrared  (IR)  and  visible  frequencies  will 
greatly  increase  the  range  of  applications.  Based  on  the 
previous  modeling  work  [7]  and  on  the  fabrication  diffi¬ 
culties  associated  with  scaling  the  SRS  to  higher  fre¬ 
quency,  there  has  been  general  pessimism  about  the 
prospects  of  extending  these  properties  to  optical  frequen¬ 
cies  in  metallic  structures. 

In  this  Letter,  we  describe  the  fabrication,  characteriza¬ 
tion,  and  modeling  of  arrays  of  a  new  nanostructure  design 
with  resonances  in  the  mid-IR  region  and  properties  that 
demonstrate  strong  magnetic  activity  indicative  of  negative 
permeability.  To  our  knowledge,  this  is  the  first  experimen¬ 
tal  work  on  negative  permeability  reported  in  the  IR. 


Importantly,  the  scaling  of  these  results  to  even  higher 
frequency  is  investigated,  and  a  structure  exhibiting  nega¬ 
tive  permeability  in  the  technologically  important  near-IR 
range  (1.3  jam)  is  proposed. 

The  structure  (Fig.  1,  top)  consists  of  an  array  of  gold 
“staples”  each  with  two  outwardly  splayed  footings,  sepa¬ 
rated  from  a  thick  continuous  gold  film  by  a  ZnS  dielectric 
layer.  Each  staple  is  a  LC  circuit  with  the  structure  part  of 
the  inductor  associated  with  the  upper  loop  of  the  staple 
and  with  two  capacitors  formed  between  the  gold  staple 
footings — the  dielectric  layer  and  the  bottom  gold  layer. 
As  a  result  of  the  thick  Au  film,  light  can  only  be  reflected 
from  or  absorbed  within  the  structure;  there  is  no  trans¬ 
mission  in  the  IR.  For  analysis,  it  is  simpler  to  view  this 
structure  in  transmission  as  the  staple  and  its  image  mir- 
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FIG.  1.  Top:  schematic  of  the  nanostructures.  The  light  is 
incident  from  the  top;  for  TM  polarization,  H  is  in  the  y 
direction,  perpendicular  to  the  loop.  Bottom:  the  equivalent 
LC  circuit  formed  between  the  top  staple  structure  and  its  image 
in  the  metal.  The  dashed  lines  are  the  reference  planes  for  the 
effective  permeability  calculation. 
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rored  in  the  metal  surface,  as  shown  in  the  bottom  panel  of 
Fig.  1.  The  period  of  the  array  is  600  nm,  much  smaller 
than  the  experimentally  observed  resonant  wavelengths  of 
between  4  and  7  jmm.  The  area  of  the  open  gold  ring  loop 
and  the  size  of  the  staple  footings  along  with  the  thickness 
of  the  ZnS  dielectric  determine  the  inductance  and  capaci¬ 
tance,  respectively,  in  the  equivalent  LC  circuit.  Three  ID 
samples  (magnetic  structures  extending  »A  into  the  plane 
of  Fig.  1)  (A,  B,  C )  and  one  2D  structure  (as  in  Fig.  2,  extent 
of  each  individual  structure  «A)  [sample  (D)]  were  fab¬ 
ricated  with  different  geometric  parameters  as  shown  in 
Table  I.  Figure  2  shows  SEM  micrographs  of  a  2D  structure 
( D )  and  a  ID  structure  (. B ).  Samples  A  and  C  have  different 
footing  sized  ( dc )  while  samples  A  and  B  have  different 
staple  heights  (//),  with  all  other  parameters  held  constant. 
Sample  D  has  the  same  parameters  as  B  except  for  the 
periodicity  in  the  y  direction  and  the  removal  of  the  nitride 
inside  the  loop. 

The  processing  steps  for  the  ID  samples  are  as  follows. 
First,  a  200  nm  thick  gold  film  is  deposited  on  a  silicon 
wafer  by  metal  evaporation,  followed  by  dielectric  evapo¬ 
ration  of  a  layer  of  ZnS  ( n  ~  2.26).  Zinc  sulfide  is  chosen 
because  it  is  transparent  throughout  the  — 2  to  10  jmm 
IR  region.  Next,  a  layer  of  Si3N4  is  grown  on  the  sample 
by  plasma  enhanced  chemical  vapor  deposition  (PECVD) 
0 n  ~  1.7),  followed  by  spun-on  layers  of  a  bottom  anti¬ 
reflection  coating  (BARC)  and  an  i- line  photoresist  (PR). 
A  ID  600  nm  pitch  grating  is  fabricated  by  interfero¬ 
metric  lithography  (IL)  using  a  355  nm,  third-harmonic 
(YAG)  laser  source  [8]  [Fig.  3(a)  and  3(b)].  The  pattern 
is  then  transferred  into  the  Si3N4  layer  by  reactive  ion 


FIG.  2.  SEM  pictures  of  the  arrays  of  2D  [sample  D,  shown 
in  (a)]  and  ID  [sample  B ,  shown  in  (b)]  staple-shaped  nano¬ 
structures. 


etching  (RIE)  and  the  residual  PR  and  ARC  are  removed 
[Fig.  3(c)].  The  etching  is  selective  and  does  not  affect  the 
ZnS  layer.  Shadow  evaporation  is  used  to  deposit  Au  on  the 
top  and  on  one  sidewall  of  the  Si3N4  gratings,  and  to  form 
the  footing  on  one  side  [Fig.  3(d)]  of  the  staples.  A  second 
evaporation  is  carried  out  at  the  opposite  tilt  to  form  the 
completed  “staple”  structure.  The  angle  of  the  metal 
evaporation  controls  the  footing  size.  For  the  2D  samples, 
the  same  procedures  are  followed  up  to  Fig.  3(c),  then  ARC 
and  PR  are  spun  on  the  wafer  again,  and  another  grating  of 
the  same  period  but  perpendicular  to  the  Si3N4  grating  is 
fabricated,  and  further  etching  and  lift-off  processes  are 
performed.  For  these  2D  structures,  the  nitride  is  removed 
by  plasma  ashing  after  the  metal  has  been  deposited. 

Because  of  the  anisotropic  nature  of  the  staple  structure, 
we  expect  to  couple  to  the  resonant  structure  only  when  the 
magnetic  field  of  the  incoming  light  is  perpendicular  to  the 
loop  of  the  staple  (TM  polarization).  Fourier  transform 
infrared  spectroscopy  (FTIR)  reflectance  measurements 
were  performed  on  these  four  samples  for  both  TE  and 
TM  polarizations.  The  IR  transmission  is  negligible  due  to 
the  thick  bottom  metal,  so  the  nanostructure  optical  prop¬ 
erties  are  solely  characterized  by  the  resonance  absorp¬ 
tion.  As  expected,  the  TE  measurements  on  all  the  samples 
did  not  show  any  structure  [to  avoid  clutter  in  the  figure, 
the  TE  reflectance  is  shown  only  for  sample  C  in  Fig.  4 
(top)].  The  TM  measurements  show  reflectance  dips  that 
vary  according  to  the  sample  geometry  directly  confirming 
the  existence  of  structure-dependent  magnetic  activity  for 
these  staple  arrays.  The  strong  resonance,  with  as  much 
as  50%  absorption  in  a  single  layer  of  nanostructures, 
suggests  that  very  high  fields  are  generated  in  the  nano¬ 
structures  that  will  be  of  interest  for  future  nonlinear  op¬ 
tics  experiments.  As  shown  in  Fig.  4  (top),  sample  A  has 
the  largest  inductance  and  capacitance,  so  the  resonance 
occurs  at  the  lowest  frequency.  Sample  C  has  the  same 
inductance  as  sample  A  but  a  smaller  capacitance,  giving  a 
higher  frequency  resonance  (shorter  wavelength).  Samples 
D  and  B  have  the  same  capacitance  as  sample  A  and  a 
smaller  inductance,  also  giving  a  higher  frequency  reso¬ 
nance.  Comparison  between  the  ID  sample  and  2D  sample 
(samples  B  and  D)  with  the  same  geometric  parameters 
(not  shown)  was  made;  there  is  no  significant  difference 
either  in  the  location  of  the  resonance  dip  or  the  depth  of 
the  resonance.  Experimentally,  both  samples  B  and  D  have 
relatively  smaller  reflection  dips;  the  reason  for  this  varia¬ 
tion  is  under  investigation. 


TABLE  I.  Sample  geometrical  parameters  (all  dimensions 
in  nm). 
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UV  light 


FIG.  3  (color  online).  Schematic  diagrams  of  fabrication  steps. 

(a)  Interferometric  lithography  with  two  coherent  UV  beams. 

(b)  The  grating  in  PR  after  development,  (c)  The  grating  in  the 
nitride  layer  after  RIE  etching,  (d)  Au  deposition  at  an  angle 
from  one  side. 


Rigorous  coupled  wave  analysis  (RCWA)  results  for  the 
three  ID  structures  are  shown  in  Fig.  4  (middle).  The 
RCWA  is  commonly  used  for  analysis  of  scattering  of 
electromagnetic  waves  from  periodic  structures  [9].  Bulk 
values  of  the  wavelength  dependence  of  the  Au  optical 
properties  were  used  [10].  The  modeling  explains  the 
experimental  data  well  except  that  the  width  of  absorption 
dip  of  the  experimental  data  is  somewhat  larger  than  that 
of  the  model.  The  broadening  is  probably  due  to  the  non¬ 
uniformity  of  the  structures  across  the  ~1  cm  measured 
spot  as  well  as  to  the  nonideal  material  parameters. 

To  further  investigate  the  strength  of  this  magnetic 
activity,  we  extracted  the  effective  permeability  of  sample 
C  from  the  equivalent  LC  circuit  (Fig.  1,  bottom) 
[11,12].  The  real  part  of  the  permeability  shows  a  very 
strong  modulation  and  is  negative  over  a  restricted  fre¬ 
quency  range;  the  imaginary  part  shows  a  complementary 
Lorentzian-like  peak,  features  characteristic  of  a  strong 
magnetic  resonance.  Extending  the  modeling  across  a 
range  of  structure  parameters  shows  that  a  more  negative 
permeability  can  be  obtained  by  increasing  the  width  W  of 
the  staple  structures,  i.e.,  increasing  the  fill  factor. 

Direct  scaling  of  all  dimensions  of  the  present  structure 
to  extend  these  results  to  higher  frequency  was  investi¬ 
gated.  The  results  are  shown  in  Fig.  5.  The  resonance 
frequency  does  not  scale  linearly  with  the  inverse  of  the 
dimensional  scaling  factor;  when  the  dimensions  are  re¬ 
duced  by  a  factor  of  4,  the  resonance  only  shifts  from  70  to 


FIG.  4.  Experimental  (top)  and  simulation  (middle)  results  for 
the  reflectivity  as  a  function  of  frequency  for  three  different 
structures  (cf.  Table  I).  The  resonance  frequency  is  a  function  of 
the  nanostructure  parameters  and  is  independent  of  array  period, 
which  is  the  same  for  all  three  structures.  The  resonance  is 
observed  only  for  TM  polarization,  i.e.,  incident  magnetic  field 
coupled  into  the  inductive  structure.  The  bottom  panel  shows  the 
effective  permeability  extracted  from  the  model  for  sample  C. 


100  THz.  At  the  same  time,  the  minimum  permeability 
becomes  positive,  evidencing  weaker  magnetic  effects. 
This  is  consistent  with  previous  results.  Two  terms  con¬ 
tribute  to  the  inductance:  a  structural  term  Lg  related  to  the 
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FIG.  5.  Scaling  of  the  present  structure  does  not  extend  nega¬ 
tive  permeability  to  significantly  higher  frequencies  as  a  result  of 
inertial  contributions  to  the  structure  inductance. 
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FIG.  6.  Simulation  of  the  effective  permeability  for  an  opti¬ 
mized  structure  extending  the  regime  of  negative  permeability  to 
near-IR  wavelengths. 

area  of  the  loop,  and  an  intertial  term  Lt  associated  with 
the  finite  electron  mass.  Lt  is  proportional  to  the  circum¬ 
ference  of  the  loop  and  inversely  proportional  to  the  thick¬ 
ness  of  the  metal.  The  difficulty  with  simple  scaling  is  the 
Lt  is  independent  of  the  scale  factor  since  the  effects  of 
reducing  the  loop  perimeter  and  the  metal  thickness  cancel. 
Consequently,  the  inductance  and  the  resonance  frequency 
vary  only  weakly  with  dimensional  scaling. 

The  resonance  properties  of  our  experimental  samples 
are  not  yet  optimized:  the  ratio  of  the  linewidth  W 
(130  nm)  to  the  pattern  period  (600  nm)  results  in  a  small 
fill  factor;  and  the  height  of  the  loop  H  (280  nm)  is  much 
larger  than  W ,  so  the  ratio  of  the  area  to  the  perimeter  is 
relatively  small,  increasing  the  relative  importance  of  L{. 
Increasing  W ,  decreasing  H  so  that  2 H  ~  W ,  and  only 
reducing  the  metal  thickness  slightly  during  the  scaling 
improves  the  results  dramatically.  Figure  6  shows  the 
simulated  effective  permeability  for  an  optimized  structure 
(with  dimensions  given  in  the  last  row  of  Table  I).  The 
pitch  is  reduced  to  180  nm,  again  small  compared  to  the 
resonance  wavelength  of  1.30  jmm.  The  dielectric  material 
was  changed  to  Si02  to  further  decrease  the  capacitance. 
The  real  part  of  permeability  has  a  strong  modulation  and  a 

minimum  as  low  as - 1  at  a  frequency  of  230  THz 

(1.3  jmm)  in  the  center  of  the  technologically  important 
near-IR  spectral  region. 

In  summary,  nanostructured  magnetic  resonators  have 
been  fabricated  with  resonances  in  the  mid-IR  spectral 
region.  The  measured  reflectivity  shows  an  absorption 
dip  in  good  agreement  with  the  simulation  results.  The 


simulation  shows  strong  magnetic  response  and  indicates 
negative  magnetic  permeability  over  a  limited  frequency 
range.  Scaling  to  higher  frequencies  is  investigated,  and  an 
optimized  structure  exhibiting  strong  magnetic  resonance 
behavior  at  near-IR  wavelengths  is  proposed.  Combining 
this  negative  permeability  structure  with  a  negative  per¬ 
mittivity  structure  will  lead  to  metamaterials  with  negative 
refractivity  in  the  IR.  This  can  be  accomplished  by  appro¬ 
priately  modifying  the  lower  metal  film,  for  example,  by 
segmenting  it  in  the  orthogonal  direction  into  an  array  of 
wires  to  decrease  the  effective  permittivity  [7].  This  would 
also  lift  the  present  restriction  to  reflective  operation.  The 
independent  control  of  the  magnetic  (staple)  and  electrical 
(wires)  response  based  on  simple  geometrical  changes  will 
allow  tuning  of  the  electromagnetic  response  over  a  wide 
range. 

Support  for  this  work  was  provided  by  the  ARO/MURI 
in  “Deep  Subwavelength  Optical  Nanolithography.” 
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We  report  the  observation  of  enhanced  near-infrared  transmission  through  arrays  of  subwavelength 
coaxial  metallic  structures  compared  with  that  through  comparable  diameter  hole  arrays  as  a  result  of 
localized  electromagnetic  modes  supported  by  the  complex  coaxial  unit  cell.  Polarization  and  angle- 
dependent  transmission  measurements  clearly  demonstrate  the  coupling  between  this  localized  mode  and 
delocalized  surface  plasmon  modes.  A  generalized,  multiple  discrete  states  Fano  line  shape  provides  a 
good  fit  to  the  experimental  results. 
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The  observation  of  anomalously  high  transmission 
through  periodic  arrays  of  subwavelength  holes  in  metal 
films  has  sparked  considerable  interest  [1].  The  small 
extent  of  these  structures  could  lead  to  subwavelength- 
scale  photonic  circuits.  Numerous  studies  of  the  transmis¬ 
sion  through  hole  arrays  and  through  single  apertures 
surrounded  by  periodic  structures  have  been  reported  [2]. 
The  concentration  of  the  light  in  the  apertures  leads  to  an 
electric  field  enhancement  that  can  be  used  to  manipulate 
light-matter  interactions  and  enhance  nonlinear  phe¬ 
nomena  [3].  Surface  plasma  waves  (SPWs),  electromag¬ 
netic  waves  bound  to  each  metal-dielectric  interface  (air 
and  substrate),  contribute  to  the  enhanced  transmission 
through  metallic  hole  arrays.  This  association  is  often  clear 
from  simple  momentum  conservation  arguments  as  fea¬ 
tures  in  the  response  as  functions  of  angle  and  wavelength 
track  the  locations  expected  for  SPW  interactions,  but  the 
detailed  origin  of  the  enhanced  transmission  through  such 
arrays  is  still  under  investigation  [4-7]. 

Arrays  of  more  complex  unit  cells  offer  additional  de¬ 
grees  of  freedom  for  optimizing  the  optical  response.  Baida 
[8]  and  Moreau  [9]  proposed  that  an  array  of  metallic 
coaxial  structures  supporting  attenuated  guided  electro¬ 
magnetic  modes  would  have  substantially  greater  trans¬ 
mission  and  correspondingly  larger  field  enhancements 
than  simple  hole  arrays.  In  this  Letter,  we  present  the  first 
experimental  results  for  Au  coaxial  structures  on  a  fused 
silica  substrate  and  confirm  this  prediction.  With  an  open 
annulus  of  width  182  nm  and  center  radius  of  194  nm 
(outer  radius  376  nm)  on  a  1.03  yam  period  square  array 
(individual  open  area  of  0.33  yam2;  fractional  open  area  of 
31%),  79%  transmission  is  observed  at  photon  energy  of 
0.63  eV  (1.97  yam).  In  comparison,  a  hole  array  with  an 
outer  diameter  of  324  nm  (same  open  area)  shows  a  peak 
transmission  of  only  41%  at  the  larger  photon  energy  of 
0.74  eV  (1.68  yam)  corresponding  to  a  metal- substrate 
SPW  coupling  resonance.  This  very  high  transmission 
(  >  50%)  for  subwavelength  apertures  in  metal  films  sub¬ 
stantially  increases  the  attractiveness  of  these  materials  for 
many  applications.  The  enhanced  transmission  and  peak 
wavelength  shift  are  the  result  of  the  coupling  between  a 


PACS  numbers:  42.25.Bs,  42.79.Dj,  78.66.-w 

localized  mode  (specifically,  the  TEn  mode)  supported  by 
the  coaxial  unit  cell  and  the  distributed  surface  plasma 
polaritons  of  the  metal-dielectric  interfaces.  The  transmis¬ 
sion  peak  wavelength  increases  with  increases  of  the  co¬ 
axial  gap  and  increases  of  the  refractive  index  of  fluids  in 
the  coaxial  gap  area,  confirming  the  TEn  identification. 
The  periodic  array  provides  the  necessary  phase  matching 
to  allow  coupling  between  all  of  these  states  and  the 
radiation  continuum.  Fano  line  shapes,  extended  to  de¬ 
scribe  coupling  between  the  continuum  and  a  multiplicity 
of  discrete  states,  provide  a  good  description  of 
polarization-dependent  and  angle-dependent  spectra. 

We  used  a  process  based  on  interferometric  lithography 
to  fabricate  the  annular  coaxial  metal  grids.  Unlike  the 
more  common  electron-beam  lithography  approach,  there 
is  no  difficulty  in  extending  the  sample  area  for  interfero¬ 
metric  lithography,  which  is  ideal  for  making  large  areas  of 
periodic  structures  [10].  All  the  samples  reported  on  here 
were  uniformly  fabricated  over  2.5  X  2.5  cm2  areas  using 
a  self-aligned  process  requiring  only  a  single  lithography 
step  [11]. 

Three  metallic  annular  coaxial  grid  samples  and  one 
simple  hole-array  sample  were  fabricated,  all  with 
1.03  yam  pitch  and  a  metal  (Au)  thickness  of  100  nm. 
For  all  of  the  coaxial  samples,  the  inner-ring  dot  radius 
was  194  nm;  the  outer-ring  radii  were  298,  348,  and 
376  nm.  The  hole-array  sample  was  fabricated  with  a 
radius  of  324  nm.  A  scanning  electron  micrograph  of  the 
348  nm  coaxial  array  is  shown  in  the  inset  to  Fig.  1(a). 
Transmission  spectra  were  recorded  with  a  Fourier- 
transform  infrared  spectrometer  at  normal  incidence  with 
an  incoherent  light  source,  and  are  normalized  to  the  trans¬ 
mission  (  ~  92%)  of  a  bare  substrate.  The  refractive  index 
of  the  substrate,  evaluated  from  the  Fourier-transform  in¬ 
frared  transmission  curve,  is  1.47.  Figure  1(a)  shows  the 
transmission  spectra  with  unpolarized  incident  light, 
equivalent  to  linearly  polarized  light  for  this  symmetric 
geometry.  All  three  of  the  coaxial  array  structures  showed 
higher  peak  transmission  than  did  the  hole  array.  The  peak 
transmission  was  weakly  dependent  on  the  width  of  the 
coaxial  gap  region,  shifting  to  lower  energy  as  the  gap 
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FIG.  2.  Coaxial  peak  positions  as  a  function  of  refractive  index 
of  fluids  in  the  coaxial  space  (top  curve  for  348/194  nm  sample) 
and  as  a  function  of  the  outer  radius  for  the  three  experimental 
coaxial  samples  (bottom  curve). 


Wavelength  (pm) 


FIG.  1.  (a)  Measured  transmission  spectra  for  four  samples. 

Inset:  scanning  electron  microscopy  of  coaxial  array  with 
1.03  jmm  pitch,  348  nm  outer  and  194  nm  inner  radii, 
(b)  Angular  dependence  of  TM  transmission  for  348/194  nm 
coaxial  sample. 

width  was  increased.  As  shown  in  Fig.  2,  the  peak  trans¬ 
mission  wavelength  is  linearly  dependent  on  the  annulus 
outer  diameter.  Nearly  80%  peak  transmission  at  0.60  eV 
(2.05  jmm)  was  observed  for  the  largest  gap  width  of 
182  nm.  In  contrast,  the  simple  hole  array  showed  a  peak 
transmission  of  only  41%  for  the  same  open  area.  The 
coaxial  transmission  peaks  clearly  reflect  the  coaxial  ge¬ 
ometry  as  well  as  the  matching  condition  for  coupling  to 
SPWs  based  on  the  periodicity,  with  the  peak  transmission 
occurring  at  a  lower  energy  than  the  SPW  resonance.  In 
contrast,  the  hole-array  transmission  peaks  are  coincident 
with  the  resonance  predicted  from  the  SPW  dispersion 
relation.  This  gives  additional  degrees  of  flexibility  for 
designing  structures;  the  higher  transmission  through 
smaller  holes  also  gives  a  stronger  field  amplification  that 
may  be  useful  for  nonlinear-optics  applications.  The  small 
feature  on  the  spectra  around  0.46  eV  (2.7  jmm)  is  instru¬ 


mental,  due  to  atmospheric  C02  absorption  in  the  mea¬ 
surement  light  path. 

At  normal  incidence,  there  is  no  polarization  depen¬ 
dence  for  this  square  symmetric  pattern.  However,  off 
normal  incidence,  there  is  a  strong  difference  between 
TM  polarization,  with  an  angle-dependent  coupling  to 
the  various  metal- substrate  and  metal-air  SPW  orders, 
and  TE  polarization  for  which  the  energies  of  the  SPW 
coupling  are  independent  of  angle.  Figure  1(b)  shows 
several  transmission  spectra  for  TM  polarization.  Spectra 
were  obtained  for  angles  of  incidence  between  0°  and  60° 
at  5°  intervals.  Because  of  limitations  of  the  available  wire- 
grid  polarizer,  results  are  only  presented  for  energies  less 
than  0.9  eV  (A  >1.4  jmm).  As  the  tilt  angle  is  increased,  the 
peak  transmission  shifts  to  lower  energies  and  additional 
features  resulting  from  SPW  coupling  (peaks)  and  Wood’s 
anomalies  (dips  at  the  diffraction-order  horizons)  appear 
and  progress  to  lower  energies. 

More  insight  into  the  origins  of  the  enhanced  transmis¬ 
sion  is  obtained  by  measuring  the  transmission  resonance 
as  a  function  of  the  refractive  index  of  the  medium  inside 
the  gap.  A  series  of  measurements  was  carried  out  with 
different  index  matching  fluids  by  sandwiching  the 
348/194  nm  annular  metal  structure  between  two  quartz 
plates.  The  peak  transmission  wavelength  is  proportional 
to  the  refractive  index  of  the  medium  in  the  gap  as  shown  in 
Fig.  2. 

Two  complementary  approaches  to  modeling  the  results 
were  pursued.  Rigorous  coupled  wave  analysis  (RCWA)  is 
used  to  obtain  a  detailed  simulation.  Because  of  the  limi¬ 
tations  of  available  computational  capabilities,  the  RCWA 
code  was  only  applied  to  normal-incidence  geometries 
where  symmetry  significantly  reduced  the  computational 
difficulty.  A  more  intuitive,  phenomenological,  Fano  line 
shape  approach  is  used  to  extract  an  understanding  of  the 
multiple  resonances  and  couplings  involved  in  the  problem 
[7,12].  This  approach  successfully  demonstrates  that  the 
transmission  is  due  to  coherent  interactions  between  SPW 
modes  supported  by  the  metal  film  and  a  localized  reso- 
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nance,  corresponding  to  the  TEn  mode  of  the  coaxial 
structures. 

Normal-incidence,  RCWA  simulations  [13]  were  carried 
out  for  all  of  the  samples.  In  order  to  obtain  convergence 
with  a  reasonable  number  of  diffraction  orders,  the  simu¬ 
lation  was  carried  out  for  a  square  coaxial  case  keeping  the 
same  open  area  as  the  annular  experimental  geometry.  The 
comparison  of  experiment  and  simulation  for  the  high 
transmission  case  (376/194  nm)  is  shown  in  Fig.  3(a).  The 
simulation  intensity  peak  was  a  bit  higher  than  the  mea¬ 
surement,  but  overall  the  experimental  results  are  very  well 
matched,  even  reproducing  the  fine  structure  around  0.8  eV, 
which  corresponds  to  coupling  to  the  metal-glass  SPW. 

The  analysis  based  on  Fano  line  shapes,  describing  the 
interference  between  direct  scattering  into  transmitted  pho¬ 
tons  and  coupling  via  discrete  states  such  as  the  metal-air 
and  the  metal-glass  SPWs  as  well  as  coaxial  resonances, 
provides  additional  insight.  The  Fano  line  shape  [14]  ex¬ 
tended  to  a  number  of  discrete  states  is  expressed  as 
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FIG.  3.  (a)  Experiment,  RCWA,  and  Fano  fit  simulations  for 

the  376/194  nm  coaxial  sample  at  normal  incidence,  offset  for 
clarity,  (b)  Experimental  transmission  and  Fano  fit  for 
348/194  nm  coaxial  sample  for  TM  polarization  at  an  incident 
angle  of  45°.  The  resonance  positions  for  the  Fano  fit  are 
indicated. 


where  cov  is  the  resonant  state  frequency,  Tv  is  the  line- 
width  (FWHM)  and  qv  is  the  Breit-Wigner-Fano  [15] 
coupling  coefficient  for  the  vi\\  discrete  state.  From 
Eq.  (1),  for  real  qv ,  the  Fano  curves  necessarily  extend  to 
zero  intensity.  While  the  corresponding  experimental  fea¬ 
tures  are  not  as  pronounced,  due  to  structural  variations  and 
bandwidth  limitations,  the  asymmetric  line  shapes  fit  the 
experimental  spectra  very  well,  particularly  away  from 
normal  incidence.  The  Fano  fit  for  the  376/194  nm  coaxial 
structure  is  shown  in  Fig.  3(a)  along  with  the  RCWA 
calculation  and  the  experimental  data.  Overall  the  agree¬ 
ment  is  quite  good,  with  the  exception  of  the  zero  trans¬ 
mission  discrepancy  noted  above.  As  a  result  of  this 
difference,  there  is  significant  uncertainty  in  the  fit  around 
the  SPW  resonance  for  the  normal-incidence  case.  At 
higher  incident  angles,  the  Fano  fit  is  much  better  defined, 
the  result  for  45°  incidence  TM  transmission  spectrum, 
Fig.  3(b),  shows  excellent  agreement  between  this  phe¬ 
nomenological  model  and  the  experiment  (with  some  de¬ 
viations  at  the  highest  energies  where  the  experimental 
peaks  are  least  well-defined). 

By  carrying  out  this  fitting  procedure  for  all  of  the 
experimental  spectra,  the  dispersion  of  the  various  discrete 
resonances  contributing  to  the  Fano  line  shape  can  be 
mapped  out.  These  dispersion  lines  strongly  suggest  cou¬ 
pling  [4,16,17]  between  the  localized  coaxial  mode  and  the 
SPW  modes  on  both  the  metal-substrate  and  metal-air 
interfaces.  The  energy  eigenmode  matrix  can  be  written  as 


/ Ea 

0 

0\ 

/  ^coax 

*12 

*13  \ 

° 

Eb 

0 

|=diag  k\2 

^SPl 

0  ), 

(2) 

V  0 

0 

Ec) 

V  *13 

0 

^SP2  / 

where  Kn  and  /c13  are  coupling  coefficients  between  the 
localized  coaxial  mode  and  the  SPW  modes,  Ea  b  c  are  the 
Fano  fit  resonant  states,  ECOSiX  is  the  localized  coaxial  mode, 
and  £SP  are  the  SPW  modes,  which  are  given  approxi¬ 
mately  be  a  kinetic  model  (for  a  continuous  metal  film) 


m A  =  \(rq  /  SdS,n  ±  sin^A  (3) 

V  ^ed  +  em  ) 

where  A  is  the  pitch  of  the  sample  and  sd ,  sm  are  the 
dielectric  constants  of  the  metal  and  dielectric  (air  and 
glass),  respectively.  For  each  angle,  we  can  solve  for  the 
unknowns  in  Eq.  (2),  namely  Ec oax,  /c12,  and  ac13. 
Physically,  ku  and  /c13  should  be  relatively  constant  as 
the  incident  angle  is  varied;  optimal  results  were  obtained 
for  |/c12|  =  0.039  and  |/c13|  =  0.014.  The  coupling  be¬ 
tween  the  coaxial  mode  and  the  metal- substrate  SPW  is 
stronger  than  the  coupling  between  the  coaxial  mode  and 
the  metal-air  SPW.  These  fixed  coupling  strength  numbers 
were  used  to  evaluate  the  energy  for  the  localized  coaxial 
mode,  as  shown  in  Fig.  4(b)  (hollow  symbols).  The  average 
localized  energy  state  from  this  coupling  optimization  is 
0.588  eV,  with  less  than  a  2%  variation  across  the  fitted 
angular  range. 
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FIG.  4.  (a)  Peak  positions  from  experimental  spectra  (gray) 

and  from  Fano  fits  (black).  At  low  angles  (0°  and  5°)  the  features 
corresponding  to  the  metal-glass  SPW  are  not  simple  peaks  and 
the  entire  region  is  denoted  by  the  shaded  rectangular  bars, 
(b)  Spectral  resonances  extracted  from  Fano  fits  (black  symbols), 
SPW  energies  (dashed  lines),  and  the  localized  coaxial  energy 
from  perturbation  calculations  (hollow  symbols). 

Recently,  Baida  et  al.  reported  [18]  extensive  modeling 
of  the  enhanced  transmission  through  coaxial  arrays  and 
concluded  that  the  dominant  transmission  was  due  to  the 
cutoff  of  the  TEn  coaxial  mode  given,  for  a  perfectly 
conducting  metal,  by 

ACutoff,TEu  ~  77 -n{a  +  b)  (4) 

where  n  is  the  refractive  index  of  the  medium  filling  the 
coaxial  structure,  and  a  and  b  are  the  inner  and  outer  radii, 
respectively.  The  linear  variation  of  the  resonance  wave¬ 
length  as  a  function  of  both  the  outer  radius  b  and  the 
refractive  index  of  the  index  matching  fluid  supports  this 
identification  (Fig.  2).  Note  that  initially  [8,9],  it  was 
proposed  that  the  TEM0o  mode,  with  no  cutoff  frequency, 
might  play  a  large  role  in  the  enhanced  transmission.  By 
simple  symmetry  considerations,  there  is  no  coupling  be¬ 
tween  the  antisymmetric  radial  electric  field  of  the  TEM0o 


mode  and  a  uniform  plane  wave  excitation  and,  therefore, 
the  TEMqo  mode  plays  no  role  in  the  enhanced 
transmission. 

In  conclusion,  we  demonstrated  the  use  of  interferomet¬ 
ric  lithography  to  make  large  area  annular  coaxial  metal 
arrays.  Enhanced  transmission  compared  to  hole  arrays 
was  shown  experimentally  and  theoretically  with  a  peak 
transmission  of  80%  realized.  Coupling  between  multiple 
localized  guided  modes,  specifically  the  TEU  of  the  coax 
and  the  SPW  resonances  of  the  metal  sheet  and  the  radia¬ 
tion  continuum  was  demonstrated.  Further  studies  are 
underway  to  explore  the  applications  of  these  nanopho¬ 
tonic  structures  in  nonlinear-optics  and  infrared  sensing. 

This  work  was  supported  by  U.S.  Army  Research  Office 
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High-resolution  scanning  Hall  probe  microscopy  has  been  used  to  image  vortex  configurations  in  very 
large  periodic  arrays  of  artificial  pinning  sites.  Strong  matching  effects  are  seen  at  fields  where  either 
one  or  two  vortices  can  sit  at  a  site;  with  three  vortices  per  site,  however,  no  clear  matching  is  observed. 

Matching  effects  have  also  been  observed  at  several  fractional  multiples  of  the  matching  held,  including 
1/5,  1/4,  1/3,  1/2,  and  3/4.  These  fractional  values  are  characterized  by  striking  domain  structure  and 
grain  boundaries. 
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The  behavior  of  superconducting  vortices  in  the  pres¬ 
ence  of  a  periodic  array  of  holes  reveals  rich  and  unex¬ 
pected  static  and  dynamic  phenomena.  In  such  arrays,  flux 
can  be  quantized  either  “in”  the  holes  (as  a  circulating  cur¬ 
rent  around  their  peripheries),  or  in  the  interstitial  regions 
between  the  holes  as  ordinary  Abrikosov  vortices.  At  low 
fields,  the  holes  act  as  very  strong  pinning  centers  for  flux, 
and  it  is  thus  possible  for  holes  to  contain  multiquantum 
vortices.  However,  the  energy  of  such  multiquantum  vor¬ 
tices  increases  rapidly  with  the  number  of  quanta  n  [1]. 
This  leads  to  a  saturation  number  ns ,  beyond  which  it  is 
energetically  favorable  for  any  additional  flux  to  enter  the 
interstitial  regions  between  the  holes  as  Abrikosov  vor¬ 
tices.  Transport  [2-4]  and  magnetization  [5-9]  studies 
show  distinctive  features  at  matching  fields  where  the  vor¬ 
tex  structure  is  commensurate  with  the  hole  array.  Interest¬ 
ing  configurations  also  arise  at  fractional  matching  fields, 
where  the  occupation  number  of  each  hole  or  the  number 
of  interstitial  vortices  forms  a  superstructure  locked  to  the 
basic  hole  array  [5,9]. 

Although  magnetization  and  transport  studies  have  elu¬ 
cidated  much  of  the  basic  phenomenology  of  the  array/ 
vortex  system,  they  can  measure  only  its  global  properties 
and  cannot  deduce  details  of  the  local  configurational  vor¬ 
tex  state.  Recently,  Lorentz  [10]  and  scanning  Hall  probe 
[11]  microscopies  have  been  used  to  image  vortex  configu¬ 
rations,  but  only  over  relatively  small  areas  of  —10  X  10 
pinning  sites.  In  this  Letter,  we  present  large-area  scan¬ 
ning  Hall  probe  microscope  (SHM)  images  of  vortex  con¬ 
figurations  in  arrays  containing  — 106  holes.  Our  images 
span  some  5000  holes,  and  so  yield  important  information 
on  the  large-scale  structure  of  the  vortex  configurations. 
These  studies  reveal  striking  multiquantum  and  interstitial 
vortex  patterns  in  a  square-periodic  hole  array.  At  frac¬ 
tional  matching  fields,  we  resolve  distinctive  domains  of 
phase-slip  related  vortex  superstructures  that  are  separated 
by  domain  walls  with  characteristic  internal  structure. 
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The  Hall  probe’s  active  area  coupled  with  its  scan  height 
above  the  surface  yield  an  effective  FWHM  resolution  of 
3.4  jutm.  The  sample  investigated  was  a  100-nm- thick  nio¬ 
bium  film  with  0.3-/zm-diameter  holes  on  a  square  lattice. 
The  lattice  constant  a  was  1.870  jmm,  yielding  an  expected 
first  matching  field  Hm  =  d>0 1  a1  =  5.913  G.  The  array 
was  produced  using  a  lithographic  technique  based  on  the 
interference  of  light  [8],  which  yields  very  large  and  uni¬ 
form  arrays  (here  —2  mm  X  3  mm).  Our  data  were  taken 
far  from  the  edges  of  the  array.  Takezawa  and  Fukushima 
[12]  have  calculated  the  pinning  energy  of  a  hole  vor¬ 
tex  relative  to  one  in  the  bulk.  For  parameters  similar  to 
ours,  they  find  a  very  deep  pinning  potential  of  —100  eV. 
Comparing  this  to  typical  bulk  Nb  pinning  energies  of 
—50  meV  [13],  it  is  clear  that  pinning  from  the  hole  ar¬ 
ray  dominates  pinning  from  defects  in  the  Nb  film. 

SHM  images  near  several  matching  fields  are  shown  in 
Fig.  1.  For  each  image,  the  samples  were  field  cooled  from 
above  Tc  =  8.37  K  to  a  base  temperature  below  3  K  where 
the  images  were  taken.  The  top  row  of  Fig.  1  shows  the 
progression  in  the  vortex  configurations  as  the  applied  field 
H  is  varied  around  H  =  H/Hm  =0.  At  H  —  0  we  see 
several  isolated  vortices  (dark  spots).  The  actual  vortex 
diameter  is  much  smaller  than  these  spots,  whose  size  is 
determined  by  the  Hall  probe  resolution.  A  somewhat 
darker  spot,  just  to  the  right  and  below  center,  appears 
in  all  scans  and  is  presumably  associated  with  a  physical 
defect  in  the  hole  lattice,  which  allows  extra  flux  to  sit  at 
that  point.  As  the  field  is  increased  from  zero,  vortices 
begin  to  enter  the  sample  in  larger  numbers,  as  seen  in 
the  H  =  0.075  and  0.15  images.  A  similar  progression 
is  observed  for  negative  field  increments  away  from  zero, 
as  shown  at  H  =  —0.075  and  —0.15.  Here  the  vortices 
appear  as  light  spots,  indicating  that  they  are  oppositely 
directed  to  those  seen  in  positive  fields. 

As  we  continue  to  increase  the  field  towards  what  we 
will  be  able  to  identify  as  the  first  matching  field  Hm 
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FIG.  1.  SHM  images  obtained  at  applied  magnetic  fields  H 
near  the  matching  fields  H  =  H/Hm  =  1,2,  and  3.  The  scans 
are  124  /mm  X  138  /mm,  contain  some  5000  holes,  and  have  a 
full  scale  of  0.73  G.  Note_the  striking  similarity  between  the 
vortex  configurations  near  H  =  1  and  2  and  those  near  H  =  0. 
The  smooth  backgrounds  at  H  =  1_(2)  consist  of  holes  uni¬ 
formly  filled  with  1  (2)  vortices.  At  H  =  3,  however,  the  vortex 
configuration  is  highly  disordered  because  of  weakly  pinned  in¬ 
terstitials  competing  with  vortices  in  holes.  The  dark  black  spot 
is  a  local  defect  in  the  sample. 

(H  =  1),  we  begin  to  see  a  very  remarkable  progression 
(Fig.  1,  second  row).  For  instance,  at  an  applied  field  of 
H  =  0.85  =  1  —  0.15,  the  image  looks  statistically  iden¬ 
tical  to  the  image  at  H  =  —0.15.  And,  remarkably,  the 
image  at  H  =  1  is  essentially  indistinguishable  from  that 
at  H  =  0.  It  is  important  to  note  here  that  in  each  image 
the  average  value  of  the  field  has  been  subtracted  out.  This 
is  why  the  H  =  1  image  at  about  6  G  and  the  H  =  0  im¬ 
age  appear  to  have  about  the  same  overall  gray  level.  We 
can  then  understand  the  appearance  of  the  H  =  1  scan  as 
follows.  The  field  increment  from  an  exactly  H  =  0  im¬ 
age  (no  spots)  to  the  next  no-spot  image  is  5.929  G,  very 
close  to  the  first  matching  field  of  5.913  G  deduced  from 
the  lattice  geometry.  Thus,  we  identify  this  later  spot-free 
image  at  5.929  G  with  the  first  matching  field  or  H  =  1. 
The  smooth  gray  background  near  H  =  1  can  then  be  ex¬ 
plained  in  the  following  way.  Exactly  at  H  =  1,  each  hole 
contains  exactly  one  vortex,  leading  to  a  dense  and  uni¬ 
form  configuration  of  vortices.  The  spatial  resolution  of 
our  Hall  probe  is  not  good  enough  to  image  individual 
vortices  when  they  are  this  close  together  and  perfectly 
ordered  [14],  so  this  regular  array  of  vortices  appears  as 
a  smooth  gray  background.  The  few  black  spots  in  the 
H  ~  1  image  are  thus  (easily  visible)  extra  vortices. 

The  third  row  of  Fig.  1  shows  the  vortex  configurations 
as  the  field  is  further  increased  through  the  second  match¬ 
ing  field  ( H  =  2).  A  very  similar  progression  is  observed. 
At  H  ~  2,  there  is  again  a  smooth  background  populated 
by  a  few  black  spots.  Again,  we  interpret  this  background 
as  the  highly  ordered  state  with  now  two  vortices  sitting  in 
each  hole;  black  spots  for  H  >  2  are  again  extra  vortices, 
and  white  spots  for  H  <  2  now  represent  a  hole  with  only 


one  vortex.  At  the  third  matching  field  (H  =  3),  however, 
the  appearance  is  radically  different  (Fig.  1,  bottom  right). 
Instead  of  the  smooth  gray  background  we  would  expect  if 
three  vortices  sat  in  each  hole,  we  see  a  rather  muddled  im¬ 
age  with  no  discernible  structure.  We  believe  this  different 
appearance  results  from  the  sudden  presence  of  interstitial 
vortices  above  H  =  2. 

To  probe  this  issue  more  directly,  we  have  taken  close-up 
scans  just  below  and  above  H  =  2.  Figure  2(a)  shows 
a  25  pi m  X  28  pi m  scan  at  a  field  H  =  2  —  0.084,  and 
Fig.  2(b)  shows  H  =  2  +  0.084.  Also  shown  is  the  least- 
squares-fit  positions  of  the  holes  as  determined  from  an 
image  taken  at  H  =  1/2  where  the  hole  positions  are  un¬ 
ambiguous  [see,  e.g.,  Fig.  3(b)].  We  also  have  used  an 
absolute  position  sensor  [15]  mounted  on  the  scanner  head 
to  compensate  for  any  possible  drifts  between  images.  In 
Fig.  2(a)  each  white  spot  (i.e.,  a  “missing”  vortex)  clearly 
sits  directly  on  a  hole,  indicating  as  well  that  all  vortices 
also  sit  on  the  holes.  However,  in  Fig.  2(b)  the  black  spots, 
which  are  now  extra  vortices,  sit  variously  on  holes  or  on 
interstitial  sites. 

Large  holes  “spread  out”  the  fields  of  the  hole  vortices, 
lowering  their  energies  and  leading  to  stronger  pinning. 
Thus  the  maximum  number  of  vortices  ns  that  can  oc¬ 
cupy  a  hole  depends  on  the  hole  radius  R  [1,16].  With  the 
sputtering/lift-off  lithography  used  here  [8],  there  will  be 
inevitable  fluctuations  in  the  hole  diameters,  perhaps  yield¬ 
ing  ns  =  2  for  some  holes  and  ns  =  3  for  others.  Vortices 
nucleating  near  ns  =  2  holes  would  end  up  as  interstitials, 
while  those  near  ns  =  3  sites  could  sit  in  a  hole.  This 
random-appearing  admixture  of  vortices  on  holes  and  in 
interstitial  sites  would,  by  H  =  3,  lead  to  the  very  disor¬ 
dered  configuration  observed  at  that  field  (Fig.  1). 

It  is  also  possible  to  have  matching  effects  at  noninteger 
multiples  of  Hm.  Indeed,  distinct  features  at  H  =  1/4, 
1/2,  and  possibly  1/5  (or  3/16),  1/8,  and  1/16  have 
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FIG.  2.  Vortex_configurations  (a)  just  below  ( H  =  1.916)  and 
(b)  just  above  (H  =  2.084)  the  second  matching  fieldAThe  small 
dark  circles  are  the  positions  of  the  holes.  Below  H  =  2,  the 
vacancies  (white  spots)  all  sit  directly  on  holes;  thus  all  vortices 
must  as  well.  Above  H  =  2,  the  extra  vortices  (dark  spots) 
sit  on  both  hole  and  interstitial  sites.  25  pum  X  27  pi m  field 
is  shown. 
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FIG.  3.  (a)  122  jum  X  138  jmm  image  of  the  vortex  configu¬ 

ration  at  H  =  1/2.  There  are  large  areas  of  vortices  arranged 
in  a  checkerboard  pattern;  domains  [outlined  in  (d)]  of  opposite 
checkerboard  polarity  are  separated  by  stripelike  grain  bound¬ 
aries.  A  close-up  of  this  configuration  is  shown  in  (b)  and 
schematically  in  (c). 

been  observed  in  magnetization  studies  [5,17],  and  Harada 
et  al.  [10]  have  imaged  noninteger  matching  at  1/4,  1/2, 
3/2,  and  5/2.  In  our  imaging  experiments,  we  also  ob¬ 
serve  matching  effects  at  several  of  these  fractional  fields, 
as  well  as  some  not  previously  observed.  We  find  that  vor¬ 
tex  configurations  at  fractional  matching  fields  are  charac¬ 
terized  by  striking  domain  structure  and  associated  grain 
boundaries. 

By  far  the  strongest  submatching  effects  occur  at  H  = 
1/2.  Figure  3(a)  shows  that  the  configuration  consists  of 
large  areas  of  well-matched  vortices  separated  by  curious 
stripelike  boundaries.  The  close-up  in  Fig.  3(b)  reveals 
that  the  smooth  regions  consist  of  vortices  occupying  ev¬ 
ery  other  hole  in  a  checkerboardlike  fashion.  Regions  of 
different  “polarity” — with  the  vortices  occupying  either 
the  “red”  or  “black”  squares  of  the  checkerboard — are 
separated  by  striped  grain  boundaries  [18].  Figure  3(c) 
schematically  shows  how  a  polarity  shift  results  in  rows 
or  columns  of  alternating  pairs  of  vortices  and  vacancies, 
which  appear  in  the  images  as  the  striking  boundary  fea¬ 
tures.  At  the  junctions  between  north-south  and  east-west 
boundaries,  there  is  always  a  bright  or  dark  spot,  whose 
origin  is  again  clear  from  Fig.  3(c).  From  considerations 
of  the  boundaries,  we  have  mapped  out  the  domains  of 
differing  polarity  [inset,  Fig.  3(a)].  One  curious  feature 
of  these  grain  boundaries  is  that  they  run  predominately 
north-south,  indicating  a  breaking  of  the  square  symmetry 
of  the  lattice  in  these  two  directions.  Our  images  demon¬ 
strate  that  the  simple  domain  wall  structure  seen  on  small 
scales  [11]  yields  unexpectedly  complex  patterns  on  large 
scales. 

Weaker  but  still  striking  matching  effects  occur  at  H  = 
1/4  as  well  as  its  complimentary  [19]  field  of  H  =  1  — 
1/4  =  3/4  (Fig.  4).  Zooming  in  on  the  boundary  of  such  a 
patch  [outlined  in  Figs.  4(a)  and  4(b)]  allows  us  to  explore 


FI0  4.  122  /xm  X  136  /xm  images  of  vortex  configurations 

at  H  =  1/4  (a)  and  3/4  (b).  Selected  regions  are  expanded  in 
(c)  and  (d).  The  configurations  consist  of  relatively  small  well- 
ordered  regions  surrounded  by  complex  disordered  boundaries. 
Also  shown  schematically  are  the  inferred  vortex  arrangements 
in  the  ordered  regions.  Light  circles  represent  empty  holes,  and 
dark  circles  occupied  holes. 

the  vortex  configuration  in  detail  [Figs.  4(c)  and  4(d)].  The 
vortex  configuration  in  the  ordered  regions  is  shown  to 
scale  schematically  in  Fig.  4.  Notice  that  the  configura¬ 
tion  may  be  viewed  in  terms  of  alternating  empty  and  half- 
filled  rows  [10,20];  this  again  indicates  the  presence  of  a 
symmetry-breaking  field  which  selects  this  particular  grain 
orientation.  One  possible  source  of  this  asymmetry  could 
be  a  small  difference  in  the  interhole  spacing  a  in  the 
horizontal  and  vertical  directions.  However,  no  such  asym¬ 
metry  is  visible  at  the  —0.1%  accuracy  of  our  optical 
diffraction  measurements  of  a. 

The  origin  of  the  intricate  vortex  patterns  at  submatching 
fields  can  be  understood  from  their  configurational  ener¬ 
gies.  In  our  samples,  the  vortex  spacing  is  large  compared 
to  the  penetration  depth  and  the  energy  of  a  configura¬ 
tion  can  be  estimated  from  nearest-neighbor  interactions 
only.  At  1  / 2  filling,  the  observed  vertical  and  horizontal 
domain  walls  contain  nearest  neighbors  on  half  the  lattice 
sites,  whereas  a  45°  boundary,  for  example,  would  have 
nearest  neighbors  on  all  the  lattice  sites  along  the  bound¬ 
ary  and  thus  represent  a  much  higher  energy.  The  energy 
of  the  domain  wall  is  considerably  higher  than  that  of  the 
ordered  phases  where  neighboring  lattice  sites  are  never 
occupied.  This  favors  few  boundaries  and  large  ordered 
regions,  as  observed.  In  contrast,  at  1/4  filling  the  vortices 
are  substantially  farther  apart,  lowering  their  repulsive  en¬ 
ergy.  The  energy  differences  between  domain  walls  and 
ordered  phases  are  less  pronounced,  and  the  possible  do¬ 
main  wall  configurations  are  also  closer  in  energy.  There¬ 
fore  more  domain  wall  configurations  appear  with  greater 
frequency  than  at  1/2. 
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FIG.  5.  122  jam  X  137  jam  images  of  vortex  configurations 

at  several  fractional  values  of  H.  Also  shown  is  the  “irrational” 


field  0  =  (V 5  -  l)/2. 


Intuitive  analysis  is  less  obvious  at  other  fractions 
(Fig.  5),  but  one  can  turn  to  simulations  for  guidance.  At 
1/3  filling,  Reichhardt  and  Gr0nbech- Jensen  [21]  find  a 
disordered  state  containing  many  filled  45°  diagonal  rows 
separated  by  two  empty  diagonals,  as  seen  in  Fig.  5(b). 
An  ordered  structure  of  this  type  has  also  been  predicted 
by  Watson  [20]  for  a  “zero  range”  repulsive  potential 
and  by  Teitel  and  Jayaprakash  [22]  in  superconduct¬ 
ing  wire  networks.  At  1/5  filling,  filled  diagonals  at 
arctan(l/2)  =  26.6°  separated  by  four  empty  diagonals 
are  the  lowest  energy  configuration  [5,20,21].  Again, 
since  the  vortex  density  is  low,  other  configurations  have 
only  slightly  higher  energies  and  occur  frequently.  The 
same  kinds  of  imperfect  diagonal  structures  of  filled  and 
empty  rows  can  be  seen_at  2/5  filling  [Fig.  5(c)],  and  at 
the  irrational  field  of  (y/5  —  l)/2  ~  0.618  [Fig.  5(d)],  in¬ 
dicating  their  relatively  low  configurational  energies.  The 
frequent  occurrence  of  configurational  disorder  obscures 
any  long-range  distinction  between  rational  and  irrational 
filling  in  Fig.  5. 

Vortex  configurations  in  square-periodic  hole  arrays 
thus  reveal  remarkably  complex  patterns  reflecting  the 
interplay  between  the  pinning  energy  of  the  hole  array  and 
the  interaction  energy  of  the  multiquantum  and  interstitial 
flux  structures.  Our  images  show  clear  multiquantum  oc¬ 
cupation  of  the  holes  up  to  saturation,  followed  by  the  ap¬ 
pearance  of  interstitial  Abrikosov  vortices  at  higher  fields. 
At  fractional  filling  factors,  domains  of  well-ordered 
vortices  are  separated  by  a  network  of  domain  boundaries 
displaying  remarkably  simple  order  for  half  filling  and 


increasing  complexity  at  other  fillings.  There  appears  to 
be  a  nontrivial  set  of  ordered  fractional  states,  a  complete 
inventory  of  which  awaits  a  general  theory  correctly 
incorporating  the  several  competing  interactions  present. 
Similar  to  other  collective  vortex  phenomena,  these 
competing  ground  states  can  be  mapped  onto  a  variety  of 
statistical  models  that  describe  the  basic  phenomena  of 
many  other  condensed  matter  systems  [23]. 
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We  demonstrate  that  a  square  lattice  of  artificial  pinning  centers  in  a  superconducting  Nb  film  induces  the 
formation  of  highly  ordered  interstitial  vortex  phases  with  different  symmetries  for  external  magnetic  fields  as 
high  as  the  eighth  matching  field.  These  “supermatching”  phases  are  identified  by  distinct  differences  in  the 
behavior  of  their  critical  currents,  magnetoresistivity,  and  magnetization.  Our  results  are  consistent  with  pre¬ 
dictions  of  supermatching  lattice  symmetries  by  recent  numerical  simulations.  [SO  163- 1829(99)5 1142-9] 


It  is  well  known  that  vortex-vortex  interactions  in  the 
mixed  state  of  a  type  II  superconductor  generally  lead  to  the 
formation  of  a  triangular  flux-line  lattice  (FLL)  that  is  dis¬ 
torted  in  the  presence  of  random  pinning.  The  introduction  of 
a  regular  lattice  of  artificial  pinning  centers  (APC’s)  into  a 
superconducting  thin  film  provides  additional  strong  pinning 
interactions  that  induce  prominent  anomalies  in  the  tempera¬ 
ture  and  magnetic-field  dependences  of  the  magnetization, 
electrical  resistance,  and  critical  current  7C.1-5  The  anoma¬ 
lies  are  particularly  pronounced  at  “matching  fields”  Hn, 
where  the  density  of  flux  lines  (FL’s)  coincides  wit  the  den¬ 
sity  of  the  APC,  and  essentially  all  of  the  magnetic  field 
inside  the  film  resides  in  quantized  fluxoids  centered  on  the 
APC.  However,  there  is  a  maximum  number  ns  =  D/4£;(T)  of 
flux  quanta  [where  D  is  the  APC  characteristic  size  and  £(  T) 
is  a  temperature-dependent  coherence  length]  that  can  be 
trapped  by  the  APC.6  Further  increases  in  the  applied  field 
above  Hn(ns)  lead  to  the  formation  of  Abrikosov  vortices 
within  the  interstices  of  the  APC.  Although  these  “intersti¬ 
tial  FF’s”  (IFF’s)  are  relatively  mobile,  it  has  been  discov¬ 
ered  recently  that  they  can  organize  into  stable,  “super¬ 
matching”  flux-line  lattices  (SF’s).7,8 

The  SF  symmetry  in  the  presence  of  a  square  lattice  of 
antidots  (circular  holes)  was  studied  in  numerical  simulations 
by  Reichhardt  et  al.9  for  the  case  ns  =  1.  They  predict  that  the 
interstitial  FF  undergoes  a  transition  from  a  lattice  of  corner¬ 
sharing  squares  to  one  of  triangular  coordination  when  the 
number  of  interstitial  vortices  changes  from  2  to  3.  This  tran¬ 
sition  was  directly  observed  in  a  thin,  perforated  Nb  foil  by 
Forentz  microscopy.7  The  existence  of  these  SF  was  also 
inferred,  and  their  relative  stability  studied,  in  ac  magnetiza¬ 
tion  measurements  of  an  amorphous  W0  67Ge0  33  film  with  a 
square  lattice  of  antidots  at  various  ac  drive  levels  and  angu¬ 
lar  orientations  of  the  film  plane  with  respect  to  the  applied 
dc  field.8 

PRB  60 


However,  up  to  now,  the  existence  and  symmetry  of  the 
SF  for  higher  fields  has  remained  uncertain.  Calculations9 
show  that  for  the  fifth  matching  field  (MF)  the  IFF  again 
forms  a  square  SF,  and  that  low-symmetry  SF  states  are 
formed  at  the  sixth  and  seventh  MF.  A  transition  to  a  trian¬ 
gular  SF  at  the  eighth  MF  was  also  predicted.  In  this  paper, 
we  show  that  highly  ordered  SF  at  the  fifth  and  eighth  MF 
are  clearly  visible  in  magnetization  and  transport  measure¬ 
ments  on  thin  Nb  film  with  a  square  antidot  lattice,  and  that 
formation  of  low-symmetry  SF  at  the  sixth  and  seventh  MF 
leads  to  a  broad  maximum  in  IC(H )  which  probably  reflects 
a  smooth  transition  between  the  SF  for  n  =  6  and  7. 

Vortex  pinning  by  a  lattice  of  APC  competes  with  a  back¬ 
ground  of  the  usual  mechanisms  of  pinning,  including  thick¬ 
ness  modulations,  nonsuperconducting  inclusions,  grain 
boundaries,  surface  roughness,  etc.  In  order  to  clearly  ob¬ 
serve  matching  phenomena,  pinning  by  the  APC  should  be 
stronger  than  other  pinning  interactions.  This  imposes  strict 
requirements  on  the  quality  of  sample  films  with  APC.  In 
order  to  dominate  the  pinning,  the  APC  must  have  a  size 
comparable  to  the  superconducting  coherence  length  £(T) 
=  £0/(l  —  T/Tc)m,  which  is  estimated  to  be  smaller  than 
100  nm  for  Nb  at  temperature  relevant  to  this  study.  This 
small  size  scale  restricts  the  techniques  that  can  be  used  to 
fabricate  such  pinning  centers.  To  date,  electron-beam  li¬ 
thography  has  been  used  nearly  exclusively  to  perform  this 
task.1-5  The  main  drawback  of  this  technique  is  that  it  is 
expensive  and  too  slow  to  pattern  samples  having  convenient 
dimensions  of  order  0.1 -1.0  mm.  An  effective,  alternate 
method  for  producing  large-area  lattices  of  AP  in  supercon¬ 
ducting  thin  films  is  based  on  the  use  of  laser  interferometric 
lithography  (IF).10-15  IF  and  photoresist  liftoff  were  em¬ 
ployed  to  produce  a  lattice  of  antidots  of  period  d  =  1  jmm 
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and  diameter  D^0.3  jmm  imbedded  in  a  1000  A  Nb  film 
deposited  by  magnetron  sputtering  on  a  patterned  Si/Si02 
substrate.  Details  of  the  patterning  and  thin-film  preparation 
can  be  found  elsewhere.16  The  film  used  in  the  present  study 
had  a  relatively  low  superconducting  transition  temperature 
Tc~6.7  K  due  to  a  residual  oxygen  pressure  in  the  growth 
chamber.17 

Transport  measurements  were  accomplished  by  attaching 
Au  wires  directly  to  the  Nb  film  using  a  silver  epoxy  and 
postbaking,  ac  susceptibility  and  transport  measurements 
were  performed  using  a  Quantum  Design  MPMS  supercon¬ 
ducting  quantum  interference  device  (SQUID)  magnetometer 
with  the  applied  dc  magnetic  field  oriented  perpendicular  to 
the  film  plane.  We  concentrated  our  measurements  on  the 
temperature  interval  near  Tc  where  the  depth  (Bean)  profile 
of  the  magnetic  field  in  the  sample  is  nearly  flat.2  In  this  case 
uncertainties  concerning  the  penetration  profile  of  the  FL  are 
essentially  eliminated,  and  the  observed  sample  behavior  can 
be  understood  by  considering  only  a  few  “unit  cells”  of  the 
antidot  lattice. 

The  critical  current  IC(H)  was  defined  using  a  voltage 
criterion  of  0.2  /xV,  and  the  results  for  the  field  dependence 
of  I C(H)  and  p(H )  at  various  temperature  are  summarized  in 
Fig.  1.  Sharp  anomalies  in  IC(H )  and  p(H)  are  observed  at 
applied  fields  equal  to  multiples  of  the  MF,  Hm  =  nHl , 
where  Hl  =  (&/d2^20.1  G,  O0  is  the  flux  quantum  and  n  is 
an  integer. 

It  was  shown  earlier2,18  that  the  number  of  observed  MF 
varies  with  temperature,  with  more  anomalies  visible  at 
lower  T,  which  was  attributed  to  the  change  in  the  maximum 
number  ns  =  D/4^(T)  of  flux  quanta  trapped  by  a  single  an¬ 
tidot  due  to  the  strong  temperature  dependence  of  £(  T)  close 
to  Tc .  Therefore,  changes  in  the  collective  pinning  interac¬ 
tions  between  the  IFL,  and  the  IFL  and  the  fluxoid  currents, 
with  increases  in  the  number  of  interstitial  vortices  were  not 
considered  in  detail  prior  to  the  initial  discovery  of  SL  for 
In  contrast,  in  the  temperature  interval  A  T=TC 
—  T=  0.3  K  emphasized  in  our  measurement,  we  estimate 
ns=  1  (D  =  0.3  /zm,  T/Tc> 0.955,  and  £0^160  A),  which 
means  that  only  a  single  O0  can  be  trapped  at  each  antidot. 
Therefore,  all  matching  anomalies  observed  at  applied  dc 
fields  above  Hx  must  be  caused  by  the  existence  of  relatively 
stable  Si  with  different  symmetries.  Thermal  fluctuations 
have  less  influence  on  the  stability  of  SL  at  lower  tempera¬ 
ture,  which  explains  why  the  number  of  well  defined  MF 
anomalies  visible  in  Fig.  1  changes  from  2  at  T=  6.68  K 
=  0.997TC  to  8  at  T=  6.55  K=0.978TC .  Moreover,  there  was 
an  absence  of  the  usual  type  of  MF  anomaly  near  H6  and 
H7 ,  which  was  replaced  by  a  broad  anomaly  in  the  magne¬ 
toresistance  and  transport  critical  current  data  for  7^6.6  K. 

The  above  observations  were  confirmed  using  more  sen¬ 
sitive  measurements  of  complex  ac  magnetic  susceptibility 
X= x'  +  ix"*  taking  care  that  the  ac  amplitude  h0  was  suffi¬ 
cient  to  attain  full  penetration  of  the  ac  field  to  the  sample 
center.  The  results  in  Fig.  2  show  a  broad  anomaly  in  x(H) 
for  fields  spanning  Hm  =  6Hx  and  1HX  (which  cannot  be 
separately  resolved),  followed  by  a  weak  anomaly  at  H 
=  Hm=8Hi .  Taken  together,  the  data  of  Figs.  1  and  2  pro¬ 
vide  strong  evidence  that  the  anomalies  at  H>H5  reflect  the 
intrinsic  properties  of  the  SL  phases. 

Numerical  simulations  have  pointed  out  that  dissipation 
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FIG.  1.  (a)  Magnetoresistivity  (a)  p(H)  vs  applied  dc  magnetic 
field  H  for  different  temperatures  and  an  applied  ac  current  I 
=  1067  pA  driven  at  a  frequency  of  23  Hz.  (b)  The  critical  current 
I C(H)  vs  H  corresponding  to  the  data  of  (a).  Inset:  SEM  image  of  a 
square  lattice  of  antidots  for  the  sample  Nb  film. 

by  IFL  is  mainly  due  to  the  motion  of  IFL  chains  along  the 
(100)  or  (010)  directions  of  the  antidot  lattice.9  The  chart  of 
possible  IFL  given  in  Fig.  3  indicates  that  for  H5^H^H7 
ther  are  no  long,  straight  chains  of  IFL  which  could  “slip” 
along  the  (100)  or  (010)  directions.  This  effectively  places  a 
restriction  on  IFL  motion  that  would  explain  the  suppression 
of  dissipation  which  is  reflected  in  the  minima  in  p(H)  [Fig. 
1(a)],  the  enhancement  of  the  critical  current  IC(H )  [Fig. 
1(b)],  and  the  corresponding  anomalies  clearly  visible  in 
X(H)  (Fig.  2)  for  H5^H^H7. 

The  nonmonotonic  dependence  of  I c(Hm)  leads  us  to  the 
conclusion  that  at  lower- symmetry  SL  may  yield  a  relatively 
high  critical  current,  possibly  through  the  suppression  of  mo¬ 
bile  FL  chains.  The  reemergence  of  relatively  strong  pinning 
at  778  corresponds  to  the  transition  of  the  IFL  to  a  more  ideal 
hexagonal  coordination  of  the  IFL  about  the  antidot  sites. 
These  data  are  in  good  qualitative  agreement  with  numerical 
simulations9  that  produce  matching  anomalies  to  the  second 
and  fifth  MF  (corresponding  to  a  square  coordination  of  the 
IFL  about  the  antidot  sites),  and  the  fourth  and  eighth  MF 
(corresponding  to  roughly  hexagonal  coordination  of  the  IFL 
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FIG.  2.  Field  dependences  of  the  real  (x')  and  imaginary  (x") 
parts  of  the  ac  magnetic  susceptibility  x  times  the  ac  drive  ampli¬ 
tude  h0  for  a  Nb  film  containing  a  square  lattice  of  antidots  at 
different  temperatures  T,  h0  =  0.2  G,  and  measuring  frequency 
co=  10  Hz.  The  inset:  Fractional  matching  peaks  in  x"  at  T=  6.60  K. 


H/H 

m 

FIG.  4.  Magnetoresistivity  p(/7)  vs  applied  dc  magnetic  field  H 
at  temperature  T=  6.65  K  for  different  transport  currents  7=117, 
223,  330,  435,  540,  645,  752,  856,  963,  and  1067  julA,  driven  at  a 
frequency  of  23  Hz.  The  arrow  indicates  the  direction  of  the  trans¬ 
port  current. 


about  the  antidot  site).  These  stable  SL  configurations  can  be 
realized  only  in  the  case  of  strong  vortex-vortex  interactions 
extending  over  a  few  intervortex  separations.  Indeed,  it  was 
shown  that  at  relatively  high  temperatures  close  to  Tc  the 
effective  thin-film  penetration  depth  A  is  much  higher  than 
intervortex  separation  d ,  and  one  can  expect  collective  vortex 
behavior.2,3,8  The  presence  of  long-range  vortex-vortex  inter¬ 
actions  is  also  supported  by  our  observation  of  fractional 
matching  field  (771/2>  773/2,  etc)  for  H<H 3  in  the  ac  sus¬ 
ceptibility  data  of  Fig.  2.  These  MF  indicate  the  expected 
existence  of  superstructures  or  lattices  derived  from  vacancy 
ordering  in  the  IFL  when  the  vortex- vortex  interactions  ex¬ 
tend  beyond  the  antidot  separation.2,7,19 
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FIG.  3.  Supermatching  flux-line  lattices  SL  for  a  square  lattice 
of  antidots,  as  predicted  in  the  numerical  simulations  of  Ref.  9. 


The  p(/7)  data  for  different  transport  currents  and 
T=  6.65  K  are  shown  in  Fig.  4,  which  demonstrates  that  the 
number  of  resolved  MF  in  the  electrical  transport  data  is  also 
dependent  upon  applied  current.  This  is  again  consistent  with 
simulation  results9  (see  Fig.  3),  which  show  that  IFL  tend  to 
move  in  chains  along  linear  channels  perpendicular  to  the 
applied  current,  since  an  increase  in  current  along  the  (100) 
direction  induces  an  increase  in  the  Lorentz  force  acting  on 
the  IFL  along  the  (010)  direction.  This  leads  one  to  expect 
relatively  high  dissipation  near  77=774,  where  the  SL  con¬ 
sists  of  nearly  straight  lines  along  the  (010)  and  (100)  direc¬ 
tions,  in  contrast  to  the  case  for  H  =  H3 ,  where  relatively 
low-density  chains  are  broken  into  short  segments  consisting 
of  two  IFL. 

In  conclusion,  we  have  demonstrated  that  relatively  stable 
supermatching  vortex  lattice  phases  comprised  of  two  differ¬ 
ent  type  of  fluxoids — those  strongly  trapped  at  APC  sites  and 
Abrikosov  vortices  weakly  pinned  within  the  interstices  of 
the  APC  lattice — can  exist  in  patterned  superconducting  thin 
film  in  fields  up  to  877m  ,  much  higher  than  the  first  matching 
field.  We  present  evidence  that  rather  subtle  changes  in  the 
SL  symmetry  are  responsible  for  the  unexpected  loss  of  dis¬ 
tinct  MF  anomalies  observed  in  our  measurements.  Even 
with  a  relatively  simple  square  APC  lattice,  the  commensu¬ 
rate  IFL  are  surprisingly  complex  and  varied.  Our  results 
show  that  this  complexity  has  dramatic  influence  on  the  dy¬ 
namics  of  the  IFL,  inducing  large  changes  in  the  depinning 
threshold  and  in  the  dissipation  of  the  moving  vortices.  The 
close  connection  between  complexity  and  dynamics  makes 
superconductors  with  APC  lattices  an  attractive  system  for 
studying  dynamic  behavior.  The  complexity  can  be  con¬ 
trolled  easily  with  the  applied  field  and  the  symmetry  of  the 
APC,  and  the  depinning  dynamics  should  be  sensitive  not 
only  to  the  matching  field  but  also  to  the  Lorentz  force  di¬ 
rection.  One  may  find,  for  example,  a  periodic  variation  in 
the  critical  current  with  angle  as  the  Lorentz  force  sweeps 
through  easy  and  hard  depinning  directions.  As  we  have 
shown,  the  matching  field  behavior  as  seen  in  transport  and 
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ac  susceptibility  point  to  an  unexpectedly  rich  spectrum  of 
static  and  dynamic  behavior  in  superconductors  with  peri¬ 
odic  artificial  pinning  centers. 
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The  fabrication  of  nanoscale  structures  with  dimensions  approaching  the  scale  of  biological 
molecules  offers  approaches  to  the  study  of  fluid  dynamics  and  biomolecular  transport.  Ultimately, 
a  parallel  lithographic  approach  will  be  necessary  if  devices  based  on  these  nanofluidics  are  to 
achieve  widespread  availability  and  acceptance.  We  report  on  a  flexible,  all-optical  lithography 
alternative  that  is  amenable  to  large-scale  production.  We  use  interferometric  lithography  (IL)  and 
anisotropic  etching  to  produce  large  areas  of  parallel,  nanofluidic  channels  with  widths  of  —100  nm 
and  depths  of  up  to  500  nm.  We  also  use  standard  optical  lithography  to  create  interfacing 
microchannels,  such  that  the  range  of  spatial  scales  on  one  chip  varies  by  104  (from  mm  scale 
reservoirs  to  100  nm  nanochannels).  We  provide  initial  demonstrations  of  capillary  action  and 
electrophoretic  motion  of  fluorescent  dye  solutions.  ©  2003  American  Vacuum  Society. 

[DOI:  10.1116/1.1625964] 


I.  INTRODUCTION 

The  study  of  molecular  transport  phenomena  in  fluidic 
channels  of  nanoscopic  dimensions  is  a  current  frontier  in 
experimental  and  theoretical  fluid  dynamics.1,2  Nanofluidic 
systems  have  a  variety  of  applications  including  molecular 
separations,  manipulation  and  detection  of  individual  bio¬ 
molecules,  and  sensors.3-6  The  lack  of  convenient  and 
readily  available  experimental  systems  has  hindered  the  vali¬ 
dation  of  theoretical  and  simulation  studies  that  have  pre¬ 
dicted  unique  transport  properties7  and  molecular 
dynamics1,8  in  such  systems.  Fabrication  techniques  of  such 
systems  need  to  be  amenable  to  high  throughput  production, 
allow  nanoscale  patterning  over  large  surface  areas,  facilitate 
integration  of  nanofluidic  components  to  the  micro-  and 
macroscale  components,  permit  flexibility  in  design,  and  em¬ 
ploy  materials  that  are  compatible  with  biomolecules. 
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A.  Previous  approaches 

Electron  beam  (e-beam)  lithography  has  been  most 
widely  used  in  producing  nanofluidic  systems  with  well  de¬ 
fined  and  controllable  feature  sizes,  e-beam  lithography 
allows  for  complex  patterns  and  exquisite  resolution.9,10 
However,  high-throughput  fabrication  is  not  practical 
with  e-beam  lithography  due  to  its  slow,  serial  processing 
nature  that  is  not  well  suited  for  producing  large  areas  of 
nanostructures. 

Recently,  nanoimprint  lithography5,11,12  has  been  investi¬ 
gated  as  an  alternative  approach.  This  technique  requires  a 
mold  to  be  formed  first,  usually  with  e-beam  lithography  for 
nanoscaled  features12,13  and  optical  lithography  for  larger 
features,  such  as  interfacing  gradient  structures.12  The  pattern 
is  transferred  to  a  thermoplastic  polymer  (e.g.,  polymethyl¬ 
methacrylate)  through  heat  and  pressure5,11,12  or  into  an  ul¬ 
traviolet  (UV) -polymerizing  liquid.14  Nanoimprint  lithogra¬ 
phy  approaches  are  parallel,  fast,  and  well  suited  for  creating 
nanotextured  patterns  over  macroscopic  areas.  However,  a 
new  mold  must  be  created  whenever  a  feature  characteristic 
needs  to  be  changed  (e.g.,  pitch,  channel  size,  gradient  scale, 
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etc.).  Additionally,  nanoimprint  techniques  have  some  diffi¬ 
culty  in  accommodating  wide  ranges  of  feature  size  into  a 
single  pattern.  The  issue  of  stamp  lifetimes  has  also  not  yet 
been  fully  explored  and  may  pose  a  challenge  to  high- 
throughput  fabrication.15 


B.  All-optical  lithographic  approach 

We  have  based  our  fabrication  efforts  on  optical  lithogra¬ 
phy,  which  is  well  developed,  reliable,  and  flexible.  Inter¬ 
ferometric  lithography  (IL),  a  maskless  technique  based  on 
the  interference  of  two  or  more  coherent  beams,  allows  one 
to  inexpensively  and  quickly  pattern  nanoscopic  features 
over  large  surface  areas  with  easily  varied  feature  dimen¬ 
sions  (e.g.,  pitch  size  and  channel  width).  It  is  well  suited  to 
high-throughput  manufacturing.16 

While  traditional  optical  lithography  has  been  used  by 
others  for  the  fabrication  of  step- shaped  nanofluidic  devices 
on  silicon,  these  structures  are  nanoscopic  in  vertical  dimen¬ 
sion  only  (controlled  by  deposition  or  etching)  with  macro/ 
microscopic  transverse  dimensions  (controlled  by  litho¬ 
graphic  pattern  formation).4  With  IL  two  coherent  light 
beams  of  wavelength  X  are  crossed  at  an  angle  26 ,  producing 
a  regular  interference  pattern  with  d=X/(2Xsin  0)  describ¬ 
ing  the  period.16  With  an  UV  light  source,  one  can  easily 
obtain  periods  on  the  order  of  hundreds  of  nm  and  transverse 
pattern  features  in  the  sub- 100  nm  range,  well  beyond  the 
scales  available  from  traditional  optical  lithography  ap¬ 
proaches.  Developments  such  as  deeper  ultraviolet  sources 
and  immersion  techniques  promise  to  extend  these  scales  to 
sub-100  nm  periods  and  —10  nm  channel  widths.17 

With  this  approach,  nanochannels  can  be  etched  into  sili¬ 
con,  rather  than  being  pressed  into  plastic.  Silicon  can  be 
easily  oxidized  after  etching,  providing  an  inert,  electrically 
insulating,  and  hydrophilic  surface  that  can  be  chemically 
functionalized  with  silane  chemistry.  A  variety  of  etching 
processes  and  wafer-bonding  techniques  are  available  for 
silicon.  This  is  important  because  most  nanofluidic  devices 
require  one  to  seal  the  tops  of  nanotextured  surfaces  to  form 
nanoscopic  tunnels  rather  than  trenches.4,5,10,12 

IL  may  be  combined  with  traditional  optical  lithography 
to  yield  a  wide  range  of  characteristic  sizes  (mm  to  nm)  on  a 
single  device.  In  this  article,  we  describe  the  fabrication  of 
two  variations  of  nanofluidic  devices.  The  first  is  a  large-area 
nanochannel  array  chip  consisting  essentially  of  an  oxidized 
nanoscale  Si  grating  with  an  anodically  bonded  Pyrex  roof 
with  holes  in  it  for  convenient  introduction  of  fluids.  The 
second  device  contains  a  limited  area  of  nanoscale  features, 
integrated  microchannels,  and  macroscopic  reservoirs,  using 
a  convenient  cross  configuration  that  interfaces  the  nanoflu- 
idics  to  the  macroscopic  world  and  provides  control  mecha¬ 
nisms  for  fluid  flow  (others  have  also  used  microchannels  as 
an  interface  but  used  simpler,  linear  configurations).10  Such 
cross  configurations  have  been  successfully  and  widely  used 
in  study  and  manipulation  in  microfluidics.  For  example,  the 
formation  of  a  compositional  plug  or  band  in  microchannels 
is  facilitated  by  the  cross  configuration.18 


II.  FABRICATION 

Silicon  (100)  wafers  were  cleaved  into  3X4  cm2  chips 
that  were  cleaned  in  piranha  solution  (1  part  H202,  2  parts 
H2S04  by  volume),  triple  rinsed  with  de-ionized  (DI)  water, 
dipped  in  HF  acid  (to  remove  the  native  oxide  layer  and  any 
remaining  inorganic  contaminants),  and  again  triple  rinsed 
with  DI  water.  A  150  nm  thick  layer  of  XHRiC-16  (Brewer 
Science,  Inc.)  antireflective  coating  (ARC)  was  spin  depos¬ 
ited  (4000  rpm,  30  s)  and  hard  baked  at  175  °C  for  3  min. 
This  was  followed  by  spin  deposition  (4000  rpm,  30  s)  of  a 
200  nm  layer  of  positive  photoresist  [SPR510a  photoresist 
diluted  by  an  equal  amount  of  EC- 11  solvent  (Shipley,  Inc.)] 
and  soft  baking  at  95  °C  for  3  min.  The  exposure  source  was 
the  frequency-tripled  (X  =  355  nm)  output  of  a  YAG-Nd  la¬ 
ser  (Infinity  40-100,  Coherent  Inc.).  The  laser  beam  is  ex¬ 
panded  and  illuminates  a  right-angle  assembly  containing  a 
mirror  and  a  vacuum  chuck  to  hold  the  Si  sample.  Although 
we  have  focused  on  pitches  of  500  nm  (0—26°),  the  reflec¬ 
tor  assembly  can  be  rotated  to  produce  a  variety  of  grating 
pitches. 

After  exposure,  each  sample  was  soft  baked  at  110  °C  for 
1  min,  developed  using  undiluted  MF702  developer  (Shipley, 
Inc.),  and  rinsed  with  water,  leaving  a  photoresist  grating. 
The  developed  chip  was  placed  in  an  e-beam  evaporator 
where  a  thin  (35-40  nm)  layer  of  Cr  was  deposited.  We  then 
lifted  off  the  remaining  photoresist  (and  the  Cr  on  top  of  it) 
using  an  airbrush  acetone  spray,  leaving  a  negative-tone  Cr 
etch  mask  layer  on  top  of  the  remaining  ARC  (which  is 
impervious  to  the  acetone).  A  field-emission  scanning- 
electron  microscope  (FE-SEM)  image  of  one  such  mask  is 
shown  in  Fig.  1(A). 

Additional  processing  steps  were  used  to  prepare  the  in¬ 
tegrated  microfluidics  on  the  more  complex  samples  before 
etching.  To  form  the  200  /im  wide  microchannel  interfaces 
and  the  2  mm  diameter  reservoirs,  we  first  spun  (3000  rpm 
for  30  s)  a  1.44  jum  thick  layer  of  AZ  5214-E  (Shipley,  Inc.) 
resist  onto  the  samples  with  Cr  etch  masks.  The  chips  were 
then  exposed  with  a  rectangular  exposure  mask,  using  a  con¬ 
ventional  proximity  aligner,  developed,  and  baked  (95  °C  for 
10  min)  to  produce  a  protective  layer  over  the  area  where  we 
wanted  to  retain  the  nanochannels,  as  shown  in  Fig.  1(B).  We 
placed  the  chips  in  CEP-200  Cr  etchant  (Microchrome  Tech¬ 
nology)  to  remove  the  unprotected  regions  of  the  Cr  mask, 
then  removed  the  protective  photoresist  layer  with  acetone 
leaving  a  small  area  ( 1  X  10  mm2)  on  the  chip  with  a  Cr  etch 
mask  [Fig.  1(C)].  A  1.44  jmm  thick  layer  of  AZ  5214-IR 
(Shipley,  Inc.)  resist  was  spun  over  the  chips,  then  exposed 
with  both  the  microchannel  exposure  mask  [Fig.  1(D)]  and  a 
second  rectangular  exposure  mask  to  remove  the  resist  from 
the  nanochannel  area  [Fig.  1(E)].  This  results  in  a  chip  with 
an  etch  mask  made  of  photoresist  and  a  Cr  grating  as  repre¬ 
sented  in  Fig.  1(F). 

The  samples  were  reactive  ion  etched  (RIE)  using  a  mix¬ 
ture  of  02  and  CHF3 .  The  etched  silicon  gratings  were 
cleaned  with  piranha  solution  to  remove  the  ARC,  Cr,  and 
residual  polymer  from  the  RIE  process  [Fig.  2(A)].  After 
cleaning,  we  placed  the  chips  in  a  quartz  tube  furnace  con- 
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aluminum  block,  which  was  placed  on  top  of  the  Pyrex  roof 
and  connected  to  a  high  voltage  dc  power  supply.  We  raised 
the  temperature  of  the  hot  plate  to  380  °C  before  charging  the 
capacitor  structure.  We  would  typically  increase  the  capaci¬ 
tor  voltage  slowly  until  the  current  reached  a  value  of  1-2 
mA,  then  allow  the  current  to  decay  to  —0.5  mA  (indicating 
the  formation  of  a  space  charge  blocking  layer)  before  fur¬ 
ther  increasing  the  voltage.  We  used  as  large  a  voltage  as 
possible  while  avoiding  arcing  between  the  electrodes  (typi¬ 
cally  800-1000  V).  We  monitored  the  current  flow  through 
the  system  until  it  decayed  to  —100  pA  per  chip  before 
shutting  off  the  heating  element  and  allowing  the  sample  to 
cool.  The  voltage  was  decreased  to  zero  when  the  tempera¬ 
ture  dropped  below  150  °C.  A  FE-SEM  image  of  a  bonded 
chip  cross  section  is  shown  in  Fig.  2(C). 


III.  RESULTS 


Fig.  1.  Mask  fabrication  steps:  (A)  FE-SEM  image  of  a  Cr  etch  mask  after 
lift-off;  (B)  photoresist  is  spun  over  the  Cr  grating  and  exposed  (using  an 
intensity  mask)  and  developed,  leaving  a  small  area  of  the  Cr  etch  mask 
protected  by  a  layer  of  resist;  (C)  after  Cr  etch  and  acetone  rinse  a  smaller 
area  of  Cr  grating  remains;  (D)  a  second  resist  coat  is  spun,  exposed  with 
microchannel  mask,  and  developed;  (E)  the  chip  is  ready  for  etching  after  an 
additional  exposure  and  develop  process  to  clear  photoresist  away  from  the 
Cr  etch  mask;  and  (F)  a  top  view  of  the  final  etch  mask  (consisting  of  a 
chrome  etch  mask  for  the  nanochannel  area  and  a  photoresist  etch  mask)  for 
an  integrated  chip  with  reservoir  areas  numbered  for  reference. 


taining  ultrahigh-purity  grade  02  at  1100  °C  for  45-60  min 
to  form  an  insulating  oxide  layer  [Fig.  2(B)]. 

After  etching  and  oxidation,  all  chips  were  capped  with  1 
mm  thick  Pyrex  No.  7740  roofs  using  anodic  bonding.  These 
glass  plates  had  a  surface  quality  of  1.8  X  [measured  across  a 
2.54  cm  diameter  circle  with  a  laser  interferometer  (Zygo, 
Inc.)  at  633  nm].  The  roofs  for  the  integrated  chips  were 
predrilled  with  four  holes  (—2  mm  diameter),  located  above 
the  circular  reservoirs  at  the  ends  of  the  microchannels.  The 
roofs  for  the  large-area  nanochannel  chips  had  four  holes  (3 
mm  diameter)  drilled  in  a  row  on  one  edge  to  provide  ports 
for  convenient  loading  of  fluorophores  and/or  other  solu¬ 
tions. 

The  bottom  electrode  of  the  bonding  apparatus  was  the 
grounded  metal  surface  of  a  hot  plate  that  supported  the  oxi¬ 
dized  silicon  sample.  The  Pyrex  roof  was  placed  on  top  of 
the  oxidized  silicon  chip.  The  upper  electrode  was  a  small 


Fig.  2.  FE-SEM  images  of  etched  samples:  (A)  silicon  wafer  after  etching, 
(B)  the  same  wafer  after  oxidation,  and  (C)  an  oxidized  grating  with  a 
bonded  Pyrex  roof.  All  samples  have  been  sputtered  with  gold  prior  to 
imaging  to  reduce  charging  effects.  The  scale  bars  indicate  distances  of  100 
nm. 


We  prepared  a  solution  of  standard  Tris/glycine  electro¬ 
phoresis  buffer  (0.24  mM  Tris  and  1.92  mM  glycine,  p H 
8.8).  This  buffer  was  filtered  through  a  0.2  pm  filter  to  re¬ 
move  particulate  contaminants  and  then  degassed  under 
vacuum  for  at  least  3  h  to  reduce  outgassing  during  electro- 
kinetic  motion.  A  5  mg/ml  suspension  of  Alexa  Fluor  532  C5 
maleimide  (Molecular  Probes)  was  prepared  in  1.5  mM  Tris 
HC1  buffer  (pH8.8). 

A.  Fluid  flow  in  the  large-area  nanochannel  array  chip 

We  mounted  the  large-area  nanochannel  array  design  into 
a  Teflon  chuck  with  reservoirs  at  each  end  of  the  chip.  Plati¬ 
num  wire-mesh  electrodes  were  inserted  into  the  sides  of 
both  reservoirs  (—3  cm  from  the  chip  edge).  Poly(dimethyl- 
siloxane)  was  used  to  secure  the  chip  into  the  chuck  and  seal 
the  system  both  for  fluid  flow  and  for  electrical  isolation.  The 
assembly  was  imaged  using  an  upright,  laser  scanning  (543 
nm)  confocal  microscope  (Axioskop  using  an  LSM5  scan¬ 
ning  head,  Zeiss,  Inc.).  The  fluorescence  output  passed 
through  a  long- wavelength  pass  optical  filter  (560  nm  cut¬ 
off).  —2  p\  of  Tris/glycine  solution  was  added  to  each  hole. 
After  capillary  action  caused  the  buffer  to  move  —5  mm 
through  the  nanochannels,  we  added  a  few  p\  of  Alexa  532 
solution  to  one  of  the  holes  in  the  glass  roof  and  imaged  the 
flow  of  liquid  due  to  capillary  action.  Figures  3(A)  and  3(B) 
show  the  progression  of  the  dye  solution  over  30  s.  The 
average  fluid  velocity  (average  of  several  measured  veloci¬ 
ties  at  different  points  on  the  liquid  front)  was  12.2 
±0.6  /im/s. 

After  the  entire  chip  was  filled  via  capillary  action  with 
Tris/glycine  buffer,  we  filled  the  reservoirs  with  buffer.  We 
placed  1  pi  of  Alexa  532  dye  into  one  of  the  four  holes  on 
the  top  of  the  chip  and  biased  the  electrodes  with  50  V. 
Figures  3(C)  and  3(D)  shows  the  progression  (due  to  elec¬ 
trophoresis)  of  the  negatively  charged  Alexa  532  dye  towards 
the  positive  electrode  over  a  period  of  150  s.  The  average 
electrophoretic  velocity  (average  of  several  measured  veloci¬ 
ties  at  different  points  on  the  liquid  front)  was  0.77 
±0.03  pm/s. 
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Fig.  3.  Confocal  images  of  fluid  motion  in  the  large-area  grating  design:  the 
scale  bars  indicate  distances  of  200  gm,  the  reservoir  to  the  left  of  the 
picture  is  biased  at  +50  V,  and  the  reservoir  to  the  right  of  the  picture  is 
grounded.  (A)  The  bright  area  is  Alexa  532  while  the  dark  area  corresponds 
to  air-filled  nanochannels.  The  ring  like  interference  patterns  are  due  to 
some  surface  damage  in  the  glass  roof  from  the  bonding  process.  (B)  30  s 
later,  capillary  action  has  caused  the  Alexa  solution  to  flow  down  the  chan¬ 
nels.  (C)  At  a  different  location  on  the  chip  after  the  chip  has  been  filled  with 
buffer  by  capillary  action:  The  bright  area  is  Alexa  532  while  the  dark  area 
is  buffer  in  the  nanochannels.  (D)  150  s  later,  the  negatively  charged  Alexa 
dye  has  clearly  moved  towards  the  positive  electrode,  indicating  motion 
dominated  by  electrophoresis. 


B.  Electrokinetic  flow  through  integrated  chips 

We  loaded  one  of  the  integrated  chips  with  liquid  via 
capillary  action.  First,  50  jud  of  DI  water  was  loaded  into 


Fig.  4.  Confocal  microscopy  of  electrophoresis  in  integrated  chip:  the  chan¬ 
nels  are  outlined  with  dotted  lines  for  clarity,  the  left  hand  side  of  each 
picture  contains  the  nanochannel  area,  and  microchannels  are  on  the  upper, 
lower,  and  right  hand  sides  of  each  picture.  The  scale  bars  indicate  200  gm 
distances.  (A)  The  electrode  for  the  lower  channel  is  biased  at  — 100  V  while 
the  other  three  remain  grounded.  Electrophoresis  causes  the  dye  to  flow 
through  the  lower  microchannel  towards  the  top  part  of  the  picture.  It  begins 
to  flow  down  the  nanochannels  and  the  microchannel  on  the  right  as  it 
reaches  them.  (B)  60  s  later  dye  has  progressed  down  the  nanochannels.  (C) 
This  picture  was  taken  after  the  electrode  for  the  channel  on  the  right  hand 
side  of  the  picture  is  biased  at  —100  V  (along  with  the  lower  channel).  The 
dye  has  been  flushed  out  of  the  right  hand  side  microchannel  and  pinched 
off  a  slug  of  dye  (outlined  with  broken  lines)  in  the  nanochannels  directly 
opposite  the  right  hand  side  microchannel.  (D)  20  s  later  the  slug  has  pro¬ 
gressed  down  the  nanochannel  area. 


reservoir  4  [as  numbered  in  Fig.  1(F)].  We  monitored  the 
flow  of  water  through  the  microchannel  and  nanochannel 
areas  by  eye  until  the  water  had  reached  the  end  of  the 
nanochannels.  At  that  point,  50  jud  of  DI  water  was  intro¬ 
duced  into  reservoir  3,  filling  the  remaining  three  microchan¬ 
nels.  After  allowing  the  system  to  equilibrate  for  30  min,  50 
jud  of  Tris/glycine  buffer  was  added  to  reservoirs  1  and  2  and 
reservoirs  3  and  4  were  topped  off  with  buffer.  We  inserted 
platinum  wire  electrodes  into  all  four  reservoirs  and  imaged 
the  assembly  with  the  confocal  microscope.  The  reservoir  2 
electrode  was  grounded.  Three  independent  power  supplies 
(common  ground)  were  connected  to  the  remaining  three 
electrodes. 

We  introduced  Alexa  532  dye  into  reservoir  3  and  biased 
its  electrode  at  —100  V.  The  remaining  electrodes  were  at 
ground  potential.  The  negatively  charged  dye  moved  towards 
the  center  of  the  chip,  entering  both  the  nanochannel  area 
and  the  microchannel  connected  to  reservoir  1,  as  seen  in 
Fig.  4(A).  Figure  4(B)  shows  the  juncture  60  s  after  Fig. 
4(A)  was  recorded.  Since  the  dye  entered  the  nanochannels 
closest  to  reservoir  3  first,  it  has  progressed  the  farthest  in 
those  (the  lowest)  channels. 

We  then  changed  the  bias  for  reservoirs  1  and  3  to  —100 
V  while  reservoir  2  remained  at  0  V.  This  flushed  the  dye  out 
of  microchannel  1  and  pinched  off  a  slug  of  dye  in  the 
nanochannels  across  from  microchannel  1,  as  seen  in  Fig. 
4(C).  Figure  4(D)  shows  the  clear  progression  of  the  slug 
down  the  nanochannel  area  after  20  s.  We  calculate  the  ve¬ 
locity  of  this  slug  to  be  26.5  ±0.9  jjl m/s. 

C.  Discussion 

We  have  demonstrated  that  our  designs  are  suitable  for 
experiments  studying  electrokinetic  motion  in  nanoscale 
channels.  It  is  interesting  to  note  that  we  were  able  to  achieve 
much  higher  electrophoretic  velocities  in  the  integrated  chip. 
This  is  due,  in  part,  to  the  difference  in  nanochannel  lengths: 
The  length  of  the  nanochannel  area  of  the  integrated  chip  is 
about  1/3  the  length  of  the  sealed  nanochannels  in  the  large- 
area  chip.  The  microchannels  are  less  than  2/3  of  the  length 
of  the  sealed  channels,  and  have  a  lower  resistance  (due  to 
the  larger  cross-sectional  area),  resulting  in  a  larger  electric 
field  across  the  nanochannels  for  a  given  electrode  bias  volt¬ 
age.  This,  coupled  with  a  twofold  higher  electrode  bias, 
could  account  for  a  factor  of  6  difference,  less  than  the  factor 
of  32  observed.  Another  contributing  factor  may  the  differ¬ 
ence  in  buffer  concentrations  between  the  two  solutions.  It  is 
also  interesting  to  note  that  the  fluid  velocities  due  to  capil¬ 
lary  action  in  the  large-area  chip  are  also  quite  fast  (approxi¬ 
mately  half  that  of  the  electrophoretic  velocities  in  the  inte¬ 
grated  chips).  Further  investigation  is  necessary  to  fully 
characterize  the  fluid-flow  characteristics  in  these  nanochan¬ 
nel  samples. 

Application  of  100  V  to  the  large-area  nanochannel  array 
chips  quickly  resulted  in  a  drop  of  current  across  the  chip 
and  an  eventual  cessation  of  electrokinetic  motion,  which  is 
caused  by  outgassing  of  the  solution.  Indeed,  rapid  genera¬ 
tion  of  microscopic  bubbles  could  be  seen  at  the  edge  of  the 
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glass  on  these  chips.  These  effects  were  not  seen  in  the  inte¬ 
grated  chips,  possibly  because  of  the  much  better  defined 
macro -micro -nano  interface  hierarchy. 

IV.  CONCLUSIONS 

We  have  created  functional  nanofluidic  chips  based  on 
all-optical  lithographic  processes  with  feature  sizes  ranging 
from  <100  nm  to  2  mm  on  a  single  chip.  IL  is  the  basis  of 
the  nanochannel  fabrication,  and  allows  for  flexible  nanopat- 
terning  of  silicon  chips  over  large  surface  areas.  Traditional 
optical  lithography  provides  convenient  microfluidic  struc¬ 
tures  for  interfacing  with  the  nanochannels  and  controlling 
fluid  flow  as  we  have  demonstrated  with  cross-type  microf¬ 
luidics.  Our  techniques  are  suited  to  high-throughput  manu¬ 
facturing,  provide  flexible  nanotexturing  over  large  areas, 
and  can  produce  a  broad  range  of  feature  sizes  on  a  single 
chip.  They  allow  one  to  use  inert  and  hydrophilic  nanotex- 
tured  surfaces  (oxidized  silicon  with  glass  roofs)  that  are 
compatible  with  electrokinetic  studies. 

Future  fabrication  work  will  involve  the  addition  of  inte¬ 
grated  microelectrodes  to  the  chips,  smaller  nanochannel 
pitches,  two-dimensional  patterning  of  the  nanochannels 
(e.g.,  introducing  gradients  in  channel  widths),  and  adding 
additional  switching  capabilities  to  form  more  complex 
micro/nano  fluidic  arrangements.  Future  experimental  work 
will  include  detailed  parametric  studies  of  electrokinetic  mo¬ 
tion  and  flow  rates  of  biomolecular  species,  and  investiga¬ 
tions  of  separations  of  biomolecular  species  within  the 
nanochannel  devices. 
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ABSTRACT 

We  present  a  facile  approach  to  the  directed  assembly  of  silica  nanoparticles  100-nm  diameter)  into  periodic  arrays  on  flat  surfaces  using 
interference  lithography  and  spin  coating.  Periodic  photoresist  patterns,  used  as  templates  for  the  spin-coating  process,  were  prepared  using 
interference  lithography.  Silica  nanoparticle  dispersions  were  spin  coated  on  these  patterned  surfaces,  and  the  resist  was  removed,  leaving 
periodic  nanoparticle  patterns  on  flat  surfaces.  Parallel  arrays  and  continuous  cross  networks  of  particles  were  fabricated  successfully  with 
periods  between  hundreds  of  nanometers  and  several  microns  over  areas  of  several  square  centimeters.  The  morphology  (period,  layer  width 
and  thickness,  etc.)  of  the  particle  patterns  can  be  controlled  by  varying  the  process  conditions. 


The  fabrication  of  patterned  arrays  of  nanoparticles  exhibiting 
various  properties  on  surfaces  and  having  a  desired  morphol¬ 
ogy  is  an  essential  step  toward  the  creation  of  novel  nano¬ 
scale  materials  and  devices.  The  assembly  and  fabrication 
of  one-,  or  two-dimensional  (1D/2D)  micro/nanoscale  colloi¬ 
dal  structures  are  very  promising  for  this  purpose.  These 
micro/nanoscale  colloidal  arrays  may  find  applications  in 
photonic  materials,1’2  optoelectronic  devices,3’4  biochemical 
sensors,5  template  processing  (nanosphere  lithography,6’7 
colloidal  stamps,8’9-10  ordered  porous  materials11),  etc.  Devel¬ 
opment  of  a  versatile  manufacturing  technology  for  produc¬ 
ing  these  ordered  arrays  is  a  critical  requirement  for  the  suc¬ 
cess  of  these  applications.  Several  approaches  have  been  de¬ 
veloped  to  precisely  position  nanoparticles  in  patterned 
arrays,  including  sedimentation,12  electrostatic-induced  crys¬ 
tallization,13’14-15  convective  self-assembly,16,17  and  physical 
confinement.18,19  Most  of  these  studies  have  focused  on  the 
creation  of  patterned  particle  arrays  with  ordered  structures 
onto  well-defined  confined  geometries18,19  or  chemically 
modified  surfaces.14,15  For  most  of  these  reports,  the  patterned 
array  periodicity  has  been  at  length  scales  on  the  order  of 
micrometers  or  larger,  and  the  particles  used  for  patterned 
arrays  have  been  larger  than  ~  100-nm  diameter111418  (up 
to  micrometers20),  although  some  experiments  using  ^  20-nm 
diameter  particles  have  been  reported.21,22  Achieving  one- 
and  two-dimensional  (1D/2D)  patterned  arrays  with  10-  to 
100-nm  diameter  nanoparticles  having  controlled  morphol¬ 
ogies  on  a  flat  substrate  remains  an  important  challenge  that 
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is  receiving  a  great  deal  of  attention  for  establishing 
nanofabrication  techniques. 

Even  though  electron-beam  lithography  enables  fabrication 
of  small  structures  on  micro/nanoscale  ranges,  its  throughput 
is  too  low  for  the  manufacturing3  of  large-area  nanoscale 
patterns  and  it  requires  sophisticated  and  expensive  instru¬ 
mentation.23  Interferometric  lithography  (IL)  is  a  very 
promising,  low-cost,  reliable,  and  scalable  technology  for 
the  fabrication  of  nanoscale  periodic  patterns  over  large 
areas.24  For  IL,  two  (or  a  few)  coherent  UV  beams  are  used 
to  produce  a  periodic  interference  pattern  on  a  photoresist 
(PR)  film  with  the  spatial  period  P  =  2/2sin 0  where  2  is  the 
wavelength  and  0  is  the  half  angle  between  the  two  beams. 
With  the  addition  of  immersion  techniques,25  the  limits  of 
interferometric  lithography  extend  to  periods  of  2/2 n,  where 
n  is  the  immersion  liquid  refractive  index. 

In  this  work,  we  used  IL  to  make  large  areas  of  periodic 
photoresist  structures  on  a  Si  wafer  for  use  as  templates  for 
patterning  spin-coated  silica  nanoparticles.  Following  particle 
deposition,  the  PR  was  removed  by  heating  or  by  chemical 
treatment,  leaving  ordered  silica  particle  arrays  on  a  flat 
substrate. 

Immersion  methods  for  patterning  colloid  particles  are 
relatively  slow  (^hours)  and  require  a  large  volume  of  col¬ 
loidal  solution.  Uniform  filling  of  open,  continuous  channels 
or  networks  with  colloidal  crystals  can  prove  difficult  because 
of  channel  plugging  problems  during  capillary-driven  infil¬ 
tration  of  suspensions.26  Spin  coating  is  widely  employed 
in  semiconductor  manufacturing,  is  simple  and  convenient, 
uses  only  a  small  volume  of  fluid,  and  is  rapid  (~1  min) 


Figure  1.  Schematic  illustration  of  deposition  of  patterned  silica  particles  on  planar  surface  using  developing- soluble  ARC  (wet-i)  layer: 
(1)  expose  using  interference  lithography  and  develop;  (2)  deposit  silica  particle  using  spin  coating;  (3)  strip  off  PR/wet-i. 


and  highly  reproducible.  Spin-coating  assembly  allows  the 
formation  of  ordered  patterns  of  nanoparticles  with  controlled 
thicknesses  and  transverse  patterns  within  confined  PR 
templates  on  an  otherwise  flat  substrate. 

Figure  1  illustrates  the  scheme  for  directed  assembly  of 
silica  nanoparticles  into  patterned  arrays  on  flat  surfaces  using 
interference  lithography  and  spin  coating.  A  bottom  antire¬ 
flecting  coating  (B ARC)  layer  (wet-i,  wet  developable  B ARC 
for  I-line  lithography)  was  spun  on  the  precleaned  sample 
to  minimize  reflection  of  the  IL  beams  from  the  substrate, 
eliminating  vertical  standing  wave  patterns  that  otherwise 
cause  difficulties  with  the  PR  exposure  and  development 
processes.  A  positive-tone  photoresist  (Shipley  SPR510A) 
diluted  to  produce  a  ~200-nm  thick  film  was  used  in  these 
experiments.  IL  with  a  355-nm  tripled  YAG  laser  source 
was  used  to  produce  the  periodic  patterns  on  this  PR/wet-I 
film  stack.  The  angle  between  the  two  beams  determined 
the  period  of  the  pattern,  while  the  exposure  fluence  and 
development  parameters  controlled  the  final  pattern  line 
widths.  Parallel  PR/wet-i  lines  (ID)  were  formed  on  the  Si 
wafer  surface  after  exposing,  baking,  and  developing.  For 
2D  arrays  of  nanoscale  posts,  a  double  exposure  with  a  90° 
rotation  of  the  sample  was  performed,  and  all  other  steps 
were  the  same  as  those  for  ID  patterns. 

After  the  PR  patterning,  silica  nanoparticle  colloidal 
dispersions  were  spin  coated  onto  these  samples.  In  general, 
the  spin  coating  was  30  s  at  4000  rpm  with  acceleration  100 
rpm/s2.  Suspensions  of  two  size  distributions  of  silica 
nanoparticles  were  utilized  for  directed  assembly  of  ID  and 
2D  periodic  particle  structures  supported  on  flat  surfaces. 
The  suspensions  were  Snowtex  colloid  silica  from  Nissan 
Chemical  Industries.  Ltd.  The  concentrations  of  both  large 
silica  particles  (ZL,  78  zb  16  nm  diameter)  and  small  silica 
particles  (OL,  50  ±  13  nm  diameter)  were  5  wt  %.  The 
suspension  was  agitated  for  5  min  in  an  ultrasonic  bath  before 
spin  coating.  There  are  two  reasons  for  utilizing  silica 
particles  instead  of  polystyrene  beads:  (1)  the  silica  particles 
are  likely  to  match  the  Si  wafer  surface  and  to  be  compatible 
with  semiconductor  processing;  and  (2)  the  silica  particles 
are  readily  applicable  in  spin  coating  due  to  their  relatively 
high  mass  density  (2.13  gm  cm-3)  compared  to  polystyrene 
(1.05  gm  cm-3),  which  is  close  to  the  mass  density  of  the 
water-based  solvent. 

The  final  step  was  to  strip  the  PR/wet-i  patterns,  leaving 
ID  or  2D  ordered  patterns  of  silica  nanoparticles  on  the  flat 
surface  of  the  Si  wafer.  Two  methods,  immersion  in  piranha 
solution  and  calcination  at  high  temperature,  were  successful 
in  completely  removing  the  PR/wet-i  while  not  disturbing 


Figure  2.  FE-SEM  images  of  ID  patterns  with  500  nm  period. 
(A)  PR/wet-i  patterns  at  a  cleaved  edge.  (B)  ZL  (~78  nm  diameter) 
silica  particles  confined  between  PR/wet-i  walls.  (C)  Top- view  of 
typical  region  over  a  large  area  after  removal  of  the  PR  template. 
Top  right:  magnified  top  view.  Bottom  left:  cross-sectional  views. 
(D)  Tilted  45°  view  at  a  cleaved  edge. 

the  silica  particle  arrays.  Low  volumetric  ratio  (1:1,  1:1.5, 
1:2)  piranha  compositions  (30%  H2O2  :  98%  H2SO4)  worked 
well  in  removing  the  organic  material  while  retaining  the 
silica  particle  patterns.  Higher  volumetric  ratios  (e.g.,  1:4) 
of  piranha  had  violent  reactions  with  the  PR/wet-i,  producing 
many  bubbles  that  collapsed  the  silica  particle  patterns  to 
some  degree.  In  general,  we  soaked  the  as-patterned  samples 
in  1:1  piranha  solution  for  15  min.  Calcinations  were  then 
carried  out  at  800  °C  for  1.5—2  h.  The  mechanical  stability 
of  particle  patterns  on  flat  surfaces  was  enhanced  as  a  result 
of  partial  sintering  between  the  silica  particles  and  the  Si 
surface  at  this  temperature.27  The  resulting  structures  were 
characterized  by  field-emission  scanning  electron  microscopy 
(FE-SEM)  at  30  kV  with  a  10-nm  thick  Au  film  sputtered 
on  the  sample  surface.  Some  top  views  and  cross-section 
views  of  images  were  also  obtained  to  check  the  structure 
of  the  ordered  patterns. 

Figure  2 A  shows  an  FE-SEM  image  of  typical  PR/wet-i 
lined  patterns  with  ~500  nm  period  on  a  Si  surface.  Standing 
wave  patterns  on  the  PR  sidewalls  were  minimized  after  use 
of  the  wet-i  layer.  The  post-develop  width  of  the  wet-i  layer 
(70—80  nm  thick)  was  controlled  by  preexposure  bake  and 
development  parameters;  it  is  somewhat  over-developed  in 
Figure  2A.  Figure  2B  shows  another  example  of  FE-SEM 
images  for  ordered  ID  patterns  after  spin  coating  with 
suspension  of  ZL  silica  particles.  The  ZL  silica  particles 
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Figure  3.  FE-SEM  images  (tilted  45°)  of  ID  patterns  with  1000  nm  period.  (A)  ZL  (~78  nm)  particle  arrays  at  a  1:1  line/space  ratio.  Top 
right  inset:  magnified  tilted  45°  top  view.  Bottom  left  inset:  cross-sectional  view.  (B)  ZL  (~78  nm)  particle  arrays  with  a  2:1  line/space 
ratio.  Top  right  inset:  magnified  cross-sectional  view.  (C)  OL  (~50  nm)  particle  arrays  at  a  3:1  line/space  ratio.  Top  right  inset:  magnified 
cross-sectional  view. 


densely  filled  the  confined  spaces  between  PR/wet-i  walls 
through  spin  coating  of  silica  suspension.  Some  isolated  point 
defects  existed  on  1-D  particle  lines  with  a  density  of  less 
than  5%.  Well-ordered  patterns  of  close-packed  silica 
particles  were  achieved  by  controlling  the  processing  condi¬ 
tions.  The  thicknesses  of  the  silica  particle  layers  packed 
inside  the  confined  spaces  could  be  increased  by  increasing 
the  number  of  spin  coating  cycles.  ID  ZL  silica  particle 
arrays  were  produced  after  1 : 1  piranha  treatment  to  remove 
the  organic  material,  as  shown  in  Figure  2C  and  2D.  Parallel 
arrays  of  particle  lines  with  500-nm  period  were  produced 
over  a  large  area  (Figure  2C).  Three- wide  columns  of  ZL 
nanoparticles  were  assembled  on  flat  surfaces  (Figure  2D 
and  top-right  inset  in  Figure  2C).  Two  layers  of  particles 
were  close-packed  on  bare  and  flat  surfaces  of  Si  wafer  in 
this  case  (Figure  2D  and  bottom-left  inset  in  Figure  2C). 
The  width  ratio  between  particle-occupied  line  and  unoc¬ 
cupied  was  roughly  3:2  for  this  sample.  The  width  ratios  of 
these  two  regions  could  be  tuned  by  adjusting  exposure  time 
or  power  density. 

One-dimensional,  two-column  particle  arrays  were  also 
prepared  on  flat  surfaces  with  wide  PR/wet-i  wall  patterns 
(data  not  shown).  The  ordered  colloidal  crystal  parallel  lines 
on  flat  surfaces  are  determined  by  template  geometry  (pitch, 
width  ratio),  colloidal  properties  (diameter,  concentration, 
etc.),  and  spin-coating  parameters.26  The  forces  driving  the 
silica  particles  into  these  close-packed  formations  are  a 
combination  of  capillary,  gravitational,  centrifugal,  and 
electrostatic  forces,  as  well  as  the  physical  confinement  at 
the  PR  line  edges.18  Together,  these  forces  cause  the  silica 
particles  to  settle  and  form  close-packed  arrays  within  the 
confined  spaces  of  the  PR  pattern.  The  particle  arrays  remain 
after  piranha  treatments  or  calcinations  to  remove  the  PR. 
The  choice  of  approach  for  removal  of  organic  templates 
depends  on  the  potential  applications  and  the  subsequent 
processing  requirements.  For  example,  the  piranha  treatment 
is  preferable  if  the  silica  arrays  are  to  be  employed  as  an 
intermediate  template  for  a  subsequent  synthesis  or  fabrica¬ 
tion,  and  the  silica  arrays  are  then  removed;  the  calcinations 
are  mostly  likely  to  be  a  good  choice  if  the  micro/ 
nanostructure  is  fabricated  based  on  these  particle  arrays. 

Figure  3  demonstrates  the  flexibility  of  this  approach.  The 
period  of  the  patterns  is  easily  adjusted  by  changing  the 
incidence  angles  in  the  interferometric  exposure.  Figure  3 
gives  examples  of  1-jum  period  patterns.  The  uniformity  of 
the  particle  lines  is  good  over  a  large  area.  In  these  large- 
scale,  periodic  patterns,  the  particles  assembled  into  a  thin 


Figure  4.  The  FE-SEM  images  of  2D  cross  network  patterns  of 
ZL  (~78  nm)  particles  with  a  500-nm  period.  (A)  Top- view  of 
typical  region.  (B)  Tilted  45°  view  at  a  cleaved  edge.  Top  right 
inset:  magnified  cross-sectional  view. 

colloidal  crystal  as  expected.  As  noted  from  Figure  3A  and 
3B,  the  morphology  of  the  particle  lines  was  impacted  by 
the  pattern  line/space  ratio  for  the  same  period  and  under 
the  same  experimental  deposition  conditions.  Approximately 
two-layer-thick  films  of  close-packed  particles  were  formed 
for  the  sample  with  the  1:1  line/space  ratio  while  only  a 
single-particle-thick  close-packed  layer  was  obtained  for  the 
2:1  line/space  ratio  sample.  As  seen  in  the  magnified  inset 
to  Figure  3A,  some  particles  extended  under  the  PR  as  a 
result  of  the  undercut  of  the  wet-i  and  formed  a  single  layer 
pattern.  The  defects  were  mainly  grain  boundaries  due  to 
stacking  faults  and  few  points  of  vacancy  from  these  small 
silica  particles  relative  to  the  pattern  periods.11’28  Another 
interesting  phenomenon  is  that  there  was  an  increase  in  the 
film  thickness  at  both  line  edges  as  shown  in  the  inset  images 
in  Figure  3.  We  attribute  this  phenomenon  to  the  high  spin 
speed  and  relatively  small  ratio  between  particle  diameter 
and  width  of  the  lines.  This  edge-high  effect  is  minimum 
for  parallel  arrays  with  fewer  than  three  columns  as  shown 
in  Figure  2C  and  2D,  no  matter  how  many  layers  of  particles 
are  deposited.  Figure  3C  shows  ordered  particle  lines 
fabricated  with  smaller  OL  silica  particles  (~50-nm  diameter) 
for  a  line/space  ratio  3:1  sample.  Close-packed,  multiparticle 
thick  silica  particle  lines  were  observed  for  these  small  silica 
OL  particles.  Moreover,  the  enhanced  thickness  at  the  edges 
of  the  line  was  more  pronounced  for  this  case. 

Two-dimensional  periodic  patterns  of  particles  on  a  flat 
surface  were  also  fabricated  using  this  lithography/deposition 
approach.  To  create  the  2D  periodic  structures  shown  in 
Figure  4,  we  produced  a  periodic  array  of  PR/wet-i  posts  on 
a  flat  surface  through  double  exposures  with  positive  PR, 
and  then  followed  the  same  procedures  as  for  ID  patterns. 
Periodic,  continuous  cross-network  patterns  at  a  500  nm 
period  were  fabricated  with  ZL.  The  empty  circular  hole  was 
about  220-  to  250-nm  diameter.  One  layer  of  close-packed 
2D  patterns  was  formed  in  this  case  (Figure  4B).  The  period 
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Figure  5.  The  FE-SEM  images  (tilted  45°  view)  of  ID  patterns 
with  500  nm  period.  (A)  Binary  particle  arrays  with  ZL  (~78  nm) 
and  OL  (~50  nm)  silica  particles  at  a  cleaved  edge.  (B)  Dented 
Au  lines  using  silica  particle  array  template. 

and  hole  diameter  of  2D  networks  are  tunable  through 
adjusting  parameters  in  the  exposure  step,  while  the  multi¬ 
layer  deposition  and  particle  size  are  controllable  in  the  spin¬ 
coating  step.  We  can  also  produce  other  kinds  of  2D  patterns 
(e.g.,  elliptical  hole,  or  hexagonal  symmetry  holes)  in  the 
exposure  step,  thus  more  complex  2D  particle  patterns  can 
be  fabricated  through  this  flexible  approach. 

The  combination  of  large-area,  nanoscale  lithography  and 
spin  coating  provides  a  very  useful  method  for  engineering 
complex  nanoscale  building  blocks.  The  fabrication  of  a 
nanoscale  patterned  surface  is  the  first  step  or  prerequisite 
which  may  be  relevant  for  a  broad  range  of  technological 
applications  such  as  controlling  crystallization,  fabricating 
biological  and  chemical  sensors,  and  engineering  microelec¬ 
tronic  and  optoelectronic  devices.  For  ID  ordered  particle 
patterns,  the  patterned  particle  arrays  with  two-particle  or 
binary  particle  assemblages  could  be  fabricated  further  for 
applications  in  photonic  band  gap  materials  and  optical 
devices.11  Further,  ternary  particle  arrays  could  be  fabricated 
for  potential  applications  in  the  display  devices  and  2D 
photonic  crystals.  As  for  2D  ordered  particle  patterns,  they 
could  be  used  as  square  templates  for  the  growth  of  large 
colloidal  particles  with  the  (100)  planes  parallel  to  the  surface 
of  the  substrate.19’29  More  complicated  nanostructures  with 
silica  nanoparticles  can  be  fabricated  using  these  initial 
structures  as  building  blocks.  In  addition,  ordered  silica 
particle  patterns  on  Si  wafers  could  serve  as  the  patterned 
SiC>2  layer  for  epitaxial  masks.30  The  process  for  fabrication 
of  patterned  SiC>2  layers  with  silica  particles  instead  of  a 
thermally  grown  Si02  layer  was  simple,  rapid,  and  did  not 
require  additional  processing:  etching,  metallization,  etc.  To 
our  knowledge,  this  is  the  first  demonstration  of  fabrication 
of  periodic  patterns  with  silica  particles  in  the  range  of  50- 
to  100-nm  diameters  on  a  bare  Si  planar  surface. 

Figure  5  shows  two  typical  examples  of  further  fabrication 
based  on  ID  ordered  particle  patterns.  Binary  particle  arrays 
were  formed  through  a  second  spin  coating  of  a  smaller 
particle  OL  suspension  onto  an  already  ordered  ZL  particle 
array  (Figure  5A).  The  small  silica  particle  OL  completely 
filled  the  gap  between  large  silica  particle  (ZL)  walls.  Some 
OL  particles  also  filled  the  voids  between  the  large  ZL 
particles  on  the  top  surface  of  the  ZL  particle  walls.  These 
silica  particle  patterns  could  also  be  used  as  masks  in 
subsequent  processing  steps,  such  as  metallization.  Metal 
lines  were  deposited  through  a  ID  array  of  ZL  particles 
(Figure  5B).  The  Au  lines  were  deposited  by  e-beam 
evaporation  on  a  ID  ZL  particle  patterned  sample  prepared 


by  immersion  in  piranha  solution  for  removal  of  the  organic 
layer.  After  the  Au  deposition,  the  silica  pattern  was  easily 
removed  through  ultrasonic  treatment  and  a  large  area  of 
Au  lines  was  formed  on  a  Si  flat  surface.  The  patterned  Au 
thin  film  could  be  used  as  a  catalyst  for  aligned  growth  of 
ZnO  nanowires  that  have  potential  applications  in  optoelec¬ 
tronics  and  field  emission.31,32  This  demonstrates  a  new, 
alternative  processing  approach  for  nanoscale  patterning 
which  might  be  beneficial  to  some  applications. 

In  summary,  we  have  presented  a  simple,  inexpensive,  and 
reliable  approach  to  directed  assembly  of  ^  100-nm  diameter 
silica  nanoparticles  into  ID  and  2D  periodic  patterns  on  flat 
surfaces  using  interference  lithography  and  conventional 
spin-coating  processes.  Parallel  arrays  of  lines  and  continuous 
cross  networked  particle  arrays  with  empty  cylindrical  holes 
were  successfully  fabricated  through  this  approach  with 
periods  between  hundreds  of  nanometers  and  several  mi¬ 
crometers  over  areas  of  several  square  centimeters.  The 
number  of  particle  layers,  the  pattern  period,  and  the  line/ 
space  ratio  are  controllable  by  varying  the  process  conditions. 
Interferometric  lithography  is  capable  of  periods  to  ^  70 
nm  (2/2 n  for  2  =  193  nm  and  n(H20)  =  1.44)  and  spaces  of 
~35  nm,  these  small  dimensions  will  require  extension  of 
this  work  to  smaller  particle  diameters.  This  method  provides 
a  flexible  and  versatile  route  to  the  fabrication  of  particle 
arrays  on  a  flat  surface,  which  have  potential  applications 
in  optics,  electronics,  sensing,  and  as  a  masking  step  for 
templating  more  complex  nanostructures.  More  monodisperse 
particle-size  distributions  should  result  in  increased  local 
order  within  the  particle  arrays. 
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The  fabrication  of  nanoparticle  ensembles  with  both  short- 
and  long-range  order  across  a  hierarchy  of  spatial  scales  re¬ 
mains  a  major  challenge  for  nanoscience.  Patterned  nanopar¬ 
ticles  on  different  substrates  have  been  the  focus  of  many 
investigations  due  to  their  unique  potential  for  many  appli¬ 
cations  including:  nanoelectronics,  nanophotonics,  magneto¬ 
electronics,  and  biochemical  sensing.[1_5]  Colloidal  crystalli¬ 
zation  using  silica  and  polystyrene  particles  can  lead  to 
optical- wavelength  photonic  crystals.^  Generally,  conven¬ 
tional  lithographic  approaches  are  nearing  fundamental  limits 
and  it  seems  clear  that  directed  self-assembly— an  integration 
of  lithography  and  self-assembly— is  destined  to  play  an  im¬ 
portant  role  across  a  wide  range  of  nanotechnology  applica¬ 
tions.  This  will  require  a  sophisticated  understanding  of  the 
chemical  and  physical  interactions  between  lithographically 
defined  patterns  and  nanoparticles.  This  paper  is  a  step  to¬ 
wards  that  understanding. 

Fabrication  approaches  for  nanopatterning  of  surfaces  in¬ 
clude:  electron-beam  lithography, optical  and  interference 
lithography, [3,9]  imprint  lithography [10,11]  soft  lithography, 
and  colloidal[12]  and  self-assembly  lithography. These  tech¬ 
niques  can  create  topographical  (grooves,  holes,  elliptical 
“tubs”,  etc.)  and  material  variations  on  substrate  surfaces. 
Even  though  electron-beam  lithography  enables  fabrication 
of  small  structures  with  high  flexibility,  its  serial  nature  makes 
it  too  slow  to  generate  large  numbers  of  large-area  nanome¬ 
ter-scale  patterns. For  self-assembly,  the  long-range  order  is 
limited  in  many  cases  by  the  formation  of  multiple  domains  or 
orientations  with  attendant  grain  boundaries.  Imprint  lithog¬ 
raphy  and  interference  lithography  (IL)  are  available  to  pat¬ 
tern  large-area  samples  with  nanometer-scale  features.  IL  is 
very  promising  as  a  low-cost,  reliable,  and  scalable  technology 
for  establishing  sub-100  nm  features  across  large  areas.[13] 

A  rich  variety  of  methods  have  been  demonstrated  for 
depositing  colloid  particles  on  patterned  surfaces  into  highly 
ordered  one-dimensional  (ID),  two-dimensional  (2D),  or 
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three-dimensional  (3D)  arrays  for  many  applications. [8,10,14,15] 
Typical  approaches  include  gravity  sedimentation, convec¬ 
tive  self-assembly^10,16]  electrostatically  induced  crystalliza¬ 
tion,1 [8,17]  and  self-assembly  under  physical  confinement. [7,18,19] 
Immersion  methods  for  depositing  colloidal  particles  are  rela¬ 
tively  slow  and  require  a  large  volume  of  colloidal  solution. 
Spin-coating  is  widely  employed  in  semiconductor  manufac¬ 
turing  facilities,  particularly  in  lithography  processes  for  the 
deposition  of  uniform  photoresist  layers.  It  is  simple,  conveni¬ 
ent,  and  sophisticated  automated  tooling  is  available.  Spin¬ 
ning  requires  only  a  small  volume  of  solution,  offers  high 
reproducibility,  is  very  fast,  and  builds  upon  a  large  installed 
manufacturing  base. 

In  this  communication,  we  present  an  approach  that  makes 
use  of  traditional  spin-coating  to  assemble  silica  nanoparticles 
(<  100  nm  diameter)  in  nanometer-scale  grooves  and  cylindri¬ 
cal  holes.  This  work  is  an  extension  of  previous  reports  of 
spin-coating-directed  deposition  of  silica  particles  from  the 
micrometer  regime  well  into  the  nanometer  regime J2°]  Silica 
particles  were  chosen  for  this  study  because  of  their  chemical 
compatibility  with  the  silica  channels;  their  relatively  high 
mass  density  (compared  with  latex  spheres);  and  their  chemi¬ 
cal  and  thermal  stability.  In  future  studies,  functionalization  of 
the  silica  nanoparticles  will  be  explored. 

A  field-emission  scanning  electron  microscope  (FE-SEM) 
was  used  to  characterize  the  patterns.  The  influences  of  fac¬ 
tors  including  the  solution  concentration  and  pH  value,  and 
the  spin  speed,  etc.  were  investigated  in  order  to  optimize  the 
patterns  of  nanoparticles  in  grooves  on  Si  samples. 

The  procedure  for  the  fabrication  of  nanometer-scale 
groove  structures  on  Si  wafer  is  described  in  the  Experi¬ 
mental  section,  and  is  depicted  schematically  in  Figure  1. 
The  procedure  for  the  fabrication  of  nanometer-scale  cylin¬ 
drical  holes  in  a  silicon  dioxide  layer  on  a  Si  substrate  is 
similar  to  that  for  nanometer-scale  groove  structure  on  Si 
wafer  except  for  double  exposures  (rotated  by  90°)  at  the  li¬ 


thography  step.  Silica  nanoparticles  were  spun  onto  the  pat¬ 
terned  Si  samples. 

For  the  preliminary  investigation  of  spin-coating-directed 
self-assembly  of  silica  nanoparticles  into  nanometer-scale 
grooves,  the  influence  of  different  pH  values  on  the  final  pat¬ 
terns  was  observed.  The  spin  speed  was  3000  rpm  and 
~  80  nm  diameter  silica  particles  (ZL,  see  Experimental  sec¬ 
tion)  were  used.  For  a  strongly  acidic  solution,  the  ZL  parti¬ 
cles  agglomerate  in  solution  and  the  spin-coating  does  not 
produce  an  ordered  result.  At  medium  acidic  solutions 
(~pH4),  some  silica  particles  remain  on  the  mesa  surface,  al¬ 
though  the  selectivity  of  deposition  into  the  grooves  is  much 
improved.  This  behavior  is  likely  caused  by  interactions  and 
bonding  between  silica  particles  in  the  solution  before  spin¬ 
ning.  The  extent  of  inter-particle  bonding  depends  on  the  sili¬ 
ca  surface  chemistry,  which  varies  with  the  solution  pHJ21,22^ 
At  >pH7,  the  adhesion  and  interparticle  bonding  is  dramati¬ 
cally  decreased  so  that  the  silica  particles  stay  separate  in  the 
colloidal  suspension.[22]  The  surface  of  as-synthesized  silica 
particles  is  always  terminated  with  a  silanol  group  (-Si-OH), 
which  ionizes  to  give  a  negatively  charged  interface  at 
>pH7.[17,21]  Since  the  patterned  samples  used  in  our  experi¬ 
ments  were  treated  with  oxygen  plasma,  their  surface  should 
also  have  had  negative  charges  at  >  pH  7.  As  a  result,  both  the 
repulsive  interactions  between  particles  and  those  between 
the  particles  and  the  surface  prevent  clustering  in  the  liquid 
and  serve  to  maintain  the  particles  in  the  liquid  as  long  as  pos¬ 
sible  during  the  spin-coating  for  >pH7.[1?]  Our  experimental 
results  at  >  pH  7  show  that  particles  were  deposited  selectively 
into  the  grooves  during  the  spin-coating.  Despite  the  fact  that 
the  grooves  were  not  fully  filled  in  some  cases,  a  relatively 
uniform  distribution  into  these  grooves  was  nevertheless  ob¬ 
tained  over  the  entire  sample.  In  the  following  results,  a  ZL 
colloidal  solution  of  pH  8.8  was  used. 

The  concentration  of  silica  nanoparticles  also  has  an  effect 
on  the  formation  of  the  particle  pattern  found  on  the  grooves. 

If  an  excessively  high  concentration 


is  used,  a  thick  deposit  of  silica  par¬ 
ticles  that  fully  covers  the  wafer  sur¬ 
face  is  formed  without  any  impact 
of  the  lithographic  patterning.  At 
very  low  concentrations,  only  a  spar¬ 
sely  populated  distribution  of  parti¬ 
cles  forms  within  the  patterned  fea¬ 
tures.  Based  on  initial  screening 
experiments,  a  1  wt.-%  suspension 
was  found  to  be  the  most  suitable 
for  directed  deposition  of  dense, 
patterned  features  using  80  nm  and 
50  nm  silica  particles  at  spin  speeds 
of  between  2000  and  5000  rpm. 

The  forces  contributing  to  depos¬ 
iting  particles  inside  grooves  include 


Figure  1.  Schematic  illustration  of  the  fabrication  of  periodic  arrays  of  grooves  using  interference  lithog¬ 
raphy  (IL):  1)  expose  and  develop;  2)  deposit  Au;  3)  lift-off  and  etch  ARC;  4)  etch  Si;  5)  remove  ARC 
and  Au. 


electrostatics,  capillarity  and  surface 
tension,  physical  confinement,  spin¬ 
ning  forces,  gravitational  force,  and 
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particle-wall  and  particle-particle  binding. [18]  At  the  initial 
stages  of  directed  self-assembly  using  spin-coating,  the  silica 
particle  suspension  spreads  across  the  wafer  without  much  im¬ 
pact  from  the  patterning.  At  later  stages,  the  combination  of 
gravity- driven  sedimentation,  evaporation-induced  capillary 
forces  (dominant),  centripetal  forces,  and  blocking  forces 
from  physical  confinement  at  the  pattern  edges  leads  to  the 
directed  self-assembly  of  silica  particles  into  the  confined 
spaces.  From  an  experimental  perspective,  the  important  fac¬ 
tors  influencing  the  deposition  pattern  through  this  approach 
include:  spin  speed;  particle  properties  (mass  density,  concen¬ 
tration,  particle  shape,  etc.);  fluid  properties  (pH,  solvent, 
viscosity,  evaporation,  surface  tension,  etc.);  and  confinement 
geometry.  Deposition  by  spin-coating  is  a  complex  nonequili¬ 
brium  process. [23] 

The  influence  of  spin  speed  on  nanoparticle  deposition  into 
grooves  is  an  important  issue. [20]  For  these  experiments,  the 
grooves  were  about  200-220  nm  wide  and  150-200  nm  deep. 
ZL  silica  colloidal  solution  (pH  8.8)  was  used.  The  FE-SEM 
images  for  spin  speeds  of  between  2000,  2500,  and  3500  rpm 
are  shown  in  Figure  2.  There  are  no  silica  particles  on  the 
mesa  tops  for  any  of  these  cases.  At  2000  rpm  (Fig.  2A),  a  dis¬ 
continuous  column  of  particles  is  observed  on  the  right-hand 
edge  of  several  grooves.  The  rotation  direction  of  the  spinner 
determines  which  groove  edge  these  particles  settle  onto.  The 
centrifugal  force  for  upper  column  particles  at  2000  rpm  is  too 
low  to  overcome  the  physical  confinement  due  to  the  groove 
wall.  Inside  the  grooves,  below  these  edge  particles,  there  are 
two  close-packed  columns  of  particles.  As  the  spin  speed  was 
increased  to  2500  rpm  (Fig.  2B),  a  loosely  packed  chain  (one 
column)  or  a  zigzag  chain  (one  and  a  half  columns)  are  the 
main  particle-distribution  patterns.  At  this  higher  spin  speed, 
the  excess  particles  on  top  of  the  two  columns  of  particles 


were  removed  by  the  spinning  forces.  The  larger  forces  at 
higher  velocities  also  caused  the  particles  inside  the  grooves 
to  re-distribute  and  some  weakly  bound  particles  escaped 
leaving  gaps  in  the  particle  arrays.  However,  at  3500  rpm 
(Fig.  2C),  most  of  the  particles  were  condensed  into  two  par¬ 
allel  discontinuous  close-packed  columns. 

An  interesting  phenomenon  is  that  there  is  often  a  larger 
particle  at  the  end  of  the  deposition  region  followed  by  a  gap 
with  no  particles.  At  this  high  spin  speed,  the  redistribution  of 
particles  results  in  close-packed  columns  with  larger  particles 
terminating  a  region  of  smaller  particles.  This  phenomenon 
was  also  observed  for  smaller,  ~50  nm  diameter  silica  parti¬ 
cles  (OL)  in  small  grooves  (data  not  shown).  The  cross-sec¬ 
tioned  images  (Fig.  2D)  show  the  two  columns  of  particles 
and  the  dimensions  of  grooves  for  spin  speed  of  3500  rpm. 
One  layer  of  ZL  silica  particles  resides  inside  the  grooves  due 
to  the  relatively  deep  groove  profile.  The  discontinuous  close- 
packed  columns  at  this  high  spin  speed  are  also  likely  to  be 
impacted  by  the  cross-sectional  groove  profiles.  The  groove 
side-wall  slope  was  a  result  of  the  reactive-ion  etching  (RIE) 
of  Si.  The  relatively  deep  groove  structures  may  have  had  in¬ 
sufficient  physical  confinement  in  this  high  velocity  spin-coat¬ 
ing  process.  To  test  this,  shallow  groove  structures  (~  100  nm 
deep)  were  employed  for  depositing  ZL  at  4000  rpm.  A  large 
area  of  close-packed,  two-column  patterns  was  obtained  over 
the  entire  sample  (Fig.  2E).  The  magnified  inset  images  show 
that  almost  perfect  two  parallel  columns  of  ZL  silica  particles 
were  formed  into  grooves  except  for  rare  point  defects  and 
overfilled  defects  (indicated  by  the  white  arrows).  Linear  and 
zigzag  chains  of  particles  in  these  wide,  shallow  grooves  were 
also  observed  in  limited  areas  (Figs.  2F,G).  The  particle  pat¬ 
tern  in  Figure  2F  often  appeared  at  the  transition  region  be¬ 
tween  perfect  two-column-particle  region  (Fig.  2E)  and  pseu- 


Figure  2.  Field-emission  scanning  electron  microscope  (FE-SEM)  images  of  ~80  nm  diameter  silica  particles  on  grooved  samples  for  different  spin 
speeds:  A-D)  relatively  deep  groove  structure,  A)  2000  rpm;  B)  2500  rpm;  C)  3500  rpm.  The  inset  is  a  magnified  image  of  a  close-packed  pattern; 
D)  3500  rpm  edge  view;  E-G)  shallow  groove  structure,  E)  4000  rpm.  The  inset  is  a  magnified  image  of  close-packed  pattern  with  isolated  defects 
(white  arrows);  F)  4000  rpm  zigzag  chain  region;  G)  4000  rpm  pseudo-linear  chain  region. 
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do-one-column-particle  region  (Fig.  2G).  Typical  sample  sizes 
were  ~  2  cm2;  consistent  and  reproducible  deposition  was  ob¬ 
served  over  more  than  80  %  of  the  sample  area  exclusive  of 
edge  areas  where  the  fluid  flow  was  complicated  by  geometri¬ 
cal  factors.  Spin-coating  is  widely  used  in  industry  for  much 
larger  samples  (300  mm  diameter  wafers)  and  we  expect  that 
these  results  will  readily  scale  to  much  larger  areas. 

The  silica  particle  size,  groove  width,  groove  profile,  and 
roughness  also  have  an  influence  on  particle  distribution.  A 
spin  speed  of  4000  rpm  was  used  for  this  set  of  experiments.  If 
the  grooves  were  narrowed  to  ~  80-120  nm  wide,  only  a  single 
column  or  a  zigzag  column  (~1.5  particles  wide)  of  ZL  silica 
particles  was  formed.  Overfilled  particles  appear  in  some  re¬ 
gions.  Figures  3A,B  show  that  one  column  of  ZL  is  found  for 
80  nm  wide  grooves;  however,  zigzag-shaped  particle  arrange¬ 
ments  appear  for  the  overfilled  case.  Figure  3C  shows  an  im¬ 
age  of  an  overfilled  two-column  particle  pattern  for  - 100  nm 
wide  grooves.  It  is  likely  that  the  combination  of  spinning 
force  and  physical  confinement  at  the  groove  side  wall  lead  to 
the  formation  of  these  tightly  packed,  overfilled  particle  pat¬ 
terns.  When  a  smaller  OL  particle  suspension  (50  nm  diame¬ 
ter)  was  used  with  the  same  samples,  a  larger  number  of  paral¬ 
lel  columns  of  nanoparticles  were  found  in  the  same  grooves 
(Figs.  3D,E).  Two  or  three  columns  of  OL  silica  particles  were 
assembled  into  - 120  nm  wide  grooves  (Fig.  3D)  while  four  or 
five  particle  wide  columns  of  OL  silica  particles  were  depos¬ 
ited  into  -200  nm  wide  grooves  (Fig.  3E).  Isolated  OL  silica 
particles  were  sparsely  adhered  on  to  the  mesa  tops.  This  was 
possibly  as  a  result  of  the  relatively  low  pH  value  (-4.8)  of 
the  OL  solution.  Figures  3D,E  show  that  the  smaller  OL  silica 
particles  demonstrated  a  lower  order.  This  was  partially  be¬ 


cause  the  smaller  silica  nanoparticles  have  a  larger  polydisper- 
sity  and  potentially  because  they  were  more  comparable  in 
scale  to  the  groove  roughness;  additionally  this  may  be  due  to 
the  fact  that  the  effect  of  confinement  on  packing  order  is 
expected  to  become  weaker  with  an  increase  in  ratio  of  con¬ 
finement  size  (groove  width)  to  the  silica  diameter  J24^ 

The  spin-coating-directed  self-assembly  of  silica  nanoparti¬ 
cles  into  2D  nanometer-scale  patterns  was  investigated  as 
well.  ZL  silica  nanoparticles  were  spun  into  nanometer-scale 
cylindrical  holes  in  an  Si02  layer  on  top  of  a  Si  substrate  with 
a  spin  speed  of  3000  rpm.  Two  patterned  samples  with  differ¬ 
ent  hole  diameters  were  tested  for  holding  silica  particles.  Fig¬ 
ures  4A,B  show  the  images  of  the  large-hole  patterned  sam¬ 
ple  before  spin-coating.  The  diameter  of  the  holes  was  200- 
230  nm  with  good  pattern  coverage  on  almost  the  entire  sam¬ 
ple  area  (Fig.  4A).  The  hole  sidewalls  were  vertical  and  the 
depth  (thickness  of  the  Si02  layer)  was  -60  nm  (Fig.  4B). 
After  spin-coating,  the  large  holes  held  one,  two,  three,  or 
four  ZL  particles  (Fig.  4C-F).  The  typical  polygonal  aggre¬ 
gates  on  this  large-hole  patterned  sample  were  the  colloidal 
cluster  of  triangle  (trimer)  and  square  (regular  quatramer) 
(Fig.  4C-E).  This  confirms  the  predictions  from  the  calcula¬ 
tion  of  colloidal  cluster  as  a  function  of  ratio  (D/d)  between 
the  hole  dimensions  (D)  and  the  diameter  of  silica  particles 
(d).[17,18]  Holes  with  two  particles  were  seldom  observed,  and 
a  hole  with  only  one  particle  was  rare  (Fig.  4D).  However,  in 
some  holes,  the  four  or  five  particles  would  overfill  the  holes 
(Figs.  4C,E).  Figure  4F  shows  typical  images  of  self-assembled 
clusters  inside  holes  and  overfilled  cases.  The  yield  of  clusters 
of  particles  in  cylindrical  holes  was  above  90  %.  Several  kinds 
of  polygonal  aggregates  exist  on  the  same  sample  because  of 
the  large  variation  of  diameter  (d)  for  silica 
nanop articles If  the  more  uniform  nanopar¬ 
ticles  were  available,  we  believe  that  well-de¬ 
fined  aggregates  could  be  obtained  in  nanome¬ 
ter-scale  patterned  structures. 

As  the  hole  diameter  was  decreased  to  90- 
120  nm,  the  number  of  particles  in  the  cluster 
was  decreased  as  expected  (Fig.  5).  In  general, 
monomers  and  dimers  were  the  typical  clusters 
inside  small  holes  (Figs.  5A,B).  The  maximum 
number  was  three  with  overfilled  packing.  The 
monomer  dominates  among  these  three  types 
of  clusters  (Fig.  5C).  Figure  5D  shows  typical 
images  of  self-assembled  clusters  inside  holes 
and  overfilled  cases.  Some  small  islands  of 
close-packed  particles  associated  with  a  hole 
position  were  also  observed  in  some  parts  of 
surface  (Figs.  4E,  5B,C). 

In  summary,  we  have  presented  a  simple,  in¬ 
expensive,  rapid,  and  reliable  approach  to  the 
assembly  of  silica  nanoparticles  (<  100  nm  di¬ 
ameter)  into  nanometer-scale  Si  grooves  and 
cylindrical  holes  using  a  conventional  spin¬ 
coating  process.  The  nanometer-scale-pat¬ 
terned  samples  were  prepared  using  interfero- 


Figure  3.  The  FE-SEM  images  of  80  nm  (ZL)  and  50  nm  (OL)  silica  particles  on  narrow- 
grooved  samples:  A)  one  column  of  ZL  particles  into  80  nm  grooves;  B)  zigzag  chain  of 
overfilled  ZL  particles  in  80  nm  grooves;  C)  two  columns  of  overfilled  ZL  particles  in 
100  nm  grooves;  D)  OL  particle  in  120  nm  grooves;  E)  OL  particle  in  200  nm  grooves. 
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Figure  4.  The  FE-SEM  images  of  80  nm  silica  particles  on  large  cylindrical-hole  samples  (dia.  200-230  nm):  A)  hole  pattern  before  spin-coating.  The 
inset  image  is  a  view  of  a  large  area  of  the  pattern;  B)  side  view  of  hole  pattern  before  spin-coating;  C,D)  typical  regions  with  particle  clusters;  E)  low 
magnification  of  typical  region;  F)  typical  clusters  of  particles. 


Figure  5.  FE-SEM  images  of  80  nm  diameter  silica  particles  on  small  cylindrical  hole  samples  (dia.  90-120  nm):  A,B)  typical  regions  with  particle  clus¬ 
ters;  C)  typical  region;  D)  typical  particle  clusters. 


metric  lithography.  The  width  of  grooves,  diameter  of  cylindri¬ 
cal  holes,  and  the  silica  particle  size  could  be  less  than  100  nm 
over  large  areas.  We  showed  that  the  silica  nanoparticle  distri¬ 
butions  depended  on  the  spin  speed,  the  pH  value,  the  groove 
geometry  (width,  depth,  sidewall  slope)  and  geometry  of  cy¬ 
lindrical  holes  (diameter,  depth),  and  the  particle  size.  Linear 
(one  particle  wide),  zigzag  (1.5  particles  wide)  chains,  and  two 
columns  of  silica  nanoparticles  could  be  deposited  into 
grooves  by  adjusting  the  above  parameters.  We  also  have 
demonstrated  that  long-range  2D  arrays  of  silica  nanoparti¬ 
cles  including  colloidal  clusters  of  monomers,  dimers,  trimers, 
and  regular  quatramers  on  patterned  samples  could  be  con¬ 
trolled  by  adjustment  of  the  ratio  between  the  diameters  of 
holes  and  particles.  Well-defined  clusters  of  2D  array  of  nano¬ 
particles  inside  2D  patterns  could  be  obtained  using  this  ap¬ 
proach  if  the  uniform  nanoparticles  were  employed.  We  be¬ 
lieve  that  extensions  of  this  work  will  lead  to  directed  self- 


assembly  of  particles  in  grooves  and  2D  structures.  This  will 
be  useful  for  applications  in  photonics  and  nanoelectronics. [18] 
A  wide  range  of  potential  applications  exist  for  the  directed 
self-assembly  of  functional  nanoparticles,  especially  biological 
moieties,  into  designed  patterns  using  this  approach.  Directed 
self-assembly  is  an  important  method  for  large-scale  pat¬ 
terned  nanostructured  materials  with  the  hierarchical  range  of 
length  scales  required  to  translate  nanometer-scale  phenom¬ 
ena  into  macroscopic  effects. 

Experimental 

The  fabrication  of  periodic  arrays  of  nanometer-scale  groove  pat¬ 
terns  in  silicon  wafers  includes  using  interferometric  lithography  (IL), 
lift-off,  and  reactive-ion  etching  (RIE).  Initially,  the  silicon  sample 
was  put  into  piranha  solution  and  BOE  20:1  solution  in  order  to  clean 
the  sample  surface.  Following  this,  an  antireflective  (ARC)  polymer 
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layer  (Brewer  Science  XHRIC-16)  was  spun  on  the  sample  at 
4000  rpm  for  30  s  and  oven  prebaked  at  175  °C  for  3  min.  Photoresist 
(PR;  Shipley  510A)  was  spun  on  the  ARC  at  4000  rpm  and  oven  pre¬ 
baked  for  3  min  at  95  °C.  IL  with  355  nm  third-harmonic  YAG  laser 
beam  was  used  to  produce  periodic  nanopatterns  in  the  PR  [9].  The 
angle  between  the  two  beams  determined  the  520  nm  pattern  period 
while  the  exposure  dose  and  development  time  affected  the  detailed 
pattern  shape.  After  exposure,  the  resist  was  post  baked  for  2  min  at 
110  °C  and  developed  using  a  MF-702  developer  for  45  s  followed  by 
a  rinse  with  deionized  (DI)  water.  Pattern  transfer  was  effected  via 
lift-off  of  a  thin  gold  film  (~  30  nm)  that  was  deposited  by  electron- 
beam  evaporation  to  form  a  hard  mask  and  RIE  of  the  underlying 
ARC  and  the  silicon.  Acetone  was  then  used  to  liftoff  the  PR  leaving 
a  hard  Au  mask  atop  the  ARC  layer.  Oxygen  plasma  was  used  to  etch 
through  the  ARC  for  3  min  with  the  power  of  50  W  while  a  mixture 
of  02  and  CHF3  (50:130)  was  used  to  etch  Si  for  3-6  min  at  a  power 
of  150  W  for  an  etch  depth  ranging  from  30  to  200  nm.  Finally,  the 
ARC/Au  layer  was  removed  in  an  ultrasonic  bath  using  piranha.  The 
fabrication  of  periodic  arrays  of  nanometer-scale  cylindrical  hole  was 
performed  on  Si  wafer  coated  with  a  60  nm  Si02  layer.  The  Si02  was 
thermally  grown  in  a  dry-02  oxidation  furnace.  The  processing  for 
fabrication  2D  patterns  was  similar  to  that  for  groove  structures  ex¬ 
cept  for  double  exposures  and  inductively  coupled  plasma  (ICP)  etch¬ 
ing  of  the  Si02.  After  the  samples  were  exposed  with  a  first  ID  grat¬ 
ing,  they  were  rotated  90°  and  exposed  a  second  time  to  obtain  2D 
photoresist  patterns  (here  posts  for  positive  photoresist).  After  lift-off 
of  a  Au  hard  mask  as  above,  an  ICP  system  was  next  used  to  etch  the 
Si02  in  the  regions  where  there  was  no  Au  hard  mask  in  etching  step. 
A  mixture  of  Ar  and  CHF3  (3:1)  was  used  to  etch  Si02  for  2  min  at 
the  RF  power  of  500  W,  bias  of  150  W,  and  the  pressure  of  10  mtorr. 
The  ICP  etching  could  etch  selectively  Si02  with  the  mixture  of  Ar 
and  CHF3  in  this  case. 

Two  particle  sizes  were  investigated.  Both  are  from  Nissan  Chemi¬ 
cal  Industries.  Ltd.  The  larger  silica  particles  (1  wt.-%,  ~ pH 8.8)  were 
prepared  by  diluting  Snowtex  ZL  (40.9  wt.-%,  ~  pH  9.6,  78+16  nm  di¬ 
ameter  [25])  with  DI  water.  The  smaller  silica  nanoparticles  (1  wt.-%, 
~ pH 4.8)  were  prepared  by  diluting  Snowtex  OL  (20  wt.-%,  ~pH3, 
50  ±13  nm  diameter)  with  DI  water.  HC1  or  NH4OH  were  used  to 
adjust  the  pH  value  of  the  water-based  colloidal  solutions.  Before 
spin-coating,  the  treatment  of  an  oxygen  plasma  RIE  was  used  in  or¬ 
der  to  ensure  that  the  sample  surface  was  hydrophilic.  Next,  the  silica 
nanoparticle  colloidal  solution  was  spun  onto  the  patterned  samples. 
Images  were  obtained  using  field  emission  scanning  electron  micros¬ 
copy  (FE-SEM)  (tilted  at  45°)  at  30  kV  with  a  10  nm  thick  Au  film  to 
minimize  charging. 
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Nanostructured  materials  have  received  considerable  atten¬ 
tion  because  of  their  novel  electronic,  optical,  magnetic,  and 
catalytic  properties.  Supermolecules,  such  as  surfactants  and 
copolymers,  are  valuable  structure-directing  agents  for  pre¬ 
paring  these  novel  materials  with  diverse  framework  composi¬ 
tions  (including  silicas,  metals,  metal  oxides/phosphates/sul¬ 
fides,  and  organosilicates).[1,2]  However,  this  process 
encounters  limitations  when  the  formation  of  some  inorganic 
networks  cannot  be  efficiently  controlled.  Recently,  inorganic 
templates  (such  as  mesoporous  silica,  mesoporous  carbon, 
nanocrystals)  have  been  used  to  synthesize  mesoporous  mate¬ 
rials  based  on  a  nanocasting  process. [3_6] 

Tin-based  oxides  have  attracted  much  attention,  as  they  of¬ 
fer  a  promising  carbon  alternative  to  negative-electrode  mate- 
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Lithographically  directed  deposition  of  silica  nanoparticles  using 
spin  coating 
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Directed  self-assembly  is  an  important  direction  for  the  extension  of  patterning  to  the  nanoscale 
regime  and  below.  Here,  examples  are  given  of  both  individual  particle  placement  and  composite 
structure  formation  involving  assemblies  of  many  nanometer  scale  particles,  both  on  patterned 
surfaces  and  on  planar  surfaces  where  the  photoresist  is  used  as  a  sacrificial  layer  to  define  the 
pattern  and  is  subsequently  removed.  The  underlying  technologies  employed  in  these  experiments 
include  interferometric  lithography  to  define  large-area,  nanometer- scale  patterns  and  directed 
self-assembly  by  spin  coating  to  control  particle  placement.  Three  sizes  of  silica  nanoparticles 
(mean  diameters:  78,  50,  and  15  nm)  were  employed  for  spin-coating  processes.  Single  linear  silica 
particle  chain  patterns  and  isolated  two-dimensional  particle  patterns  were  easily  formed  on 
patterned  surfaces.  Silica  particle  rows,  cross  networks,  and  isolated  posts  with  controllable 
thickness  could  be  formed  on  flat  surfaces  using  this  approach.  Directed  self-assembly  using 
nanoscale  lithography  and  spin  coating  is  a  facile  approach  to  the  extension  of  lithographic 
techniques  to  the  nanoscale.  ©  2004  American  Vacuum  Society.  [DOI:  10.1116/1.1821582] 


I.  INTRODUCTION 

Organized  patterns  of  nanoscale  particles  have  a  wide 
range  of  potential  applications  in  photonic  crystals,1,2 
biosensors,3,4  magneto-electronics,5  nanofluidics,6  and  other 
emerging  fields.  For  example,  arrays  of  small  magnetic  ele¬ 
ments  have  potential  application  to  high-density  data  storage 
devices  and  magnetic  random  access  memory  devices.  Iso¬ 
lated  particles  in  periodic  arrays  can  be  initiators  for  the 
growth  of  colloidal  crystals  with  long-range  order  and  pat¬ 
terned  growth  for  some  special  materials  using  catalyst 
particles.8  Top-down,  lithographic  surface  patterning  has 
been  achieved  by  electron  beam  lithography,9  photolithogra¬ 
phy,  interferometric  lithography,10  and  soft  lithography  while 
directed  assembly  of  particles  into  the  lithographically  de¬ 
fined  patterns  has  been  accomplished  by  sedimentation,9  par¬ 
allel  plate  fluidic  confinement,1  spin-coating,2,11  convective 
deposition  (directed  evaporation  induced  self-assembly),12,13 
and  electrostatic  (layer-by-layer)  deposition.14  Most  of  this 
work  has  been  focused  on  particle  diameters  from  above 
100  nm  to  ~  1  jam  or  larger. 

While  significant  advances  continue  to  be  made  in  con¬ 
ventional,  top-down,  lithographic  capabilities,  it  is  unques¬ 
tionable  that  the  extension  of  traditional  techniques  to  the 
10  nm  scale  and  below  is  going  to  be  increasingly  difficult. 
At  the  same  time,  bottom-up  self-assembly  techniques  have 
been  very  successful  at  nanometer  scales  and  below,  but  tend 
to  have  increasing  difficulty  as  the  scale  of  the  pattern  is 
increased,  for  example  with  grain  boundaries  arising  from 
independent  nucleation  of  colloidal  crystals  at  uncorrelated 
positions.  The  combination  of  these  two  approaches,  directed 
self-assembly,  is  an  interesting  alternative  that  will  allow  us 
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to  maintain  the  exquisite  hierarchical  capabilities  of  tradi¬ 
tional  lithography  at  larger  scales  while  extending  to  sub¬ 
nanometer  regimes  with  the  precision  and  accuracy  that 
molecular  forces  provide.  Directed  self-assembly — an  inte¬ 
gration  of  lithography  and  self-assembly — is  destined  to  play 
an  important  role  across  a  wide  range  of  nanotechnology 
applications. 

Interferometric  lithography  (IL)  can  be  used  to  produce 
highly  ordered,  periodic  nanostructure  arrays  over  macro¬ 
scopic  surface  areas  with  easily  varied  feature  dimensions. 
This  process  is  fast  and  inexpensive.  In  this  work,  we  use  IL 
to  define  nanometer- scale  patterns.  Meanwhile,  spin  coating 
has  been  employed  to  rapidly  form  two-dimensional  (2D) 
colloidal  crystals.  Outstanding  advantages  of  the  spin¬ 
coating  process  include  its  rapidity,  the  low  volume  of  solute 
required,  and  the  availability  of  highly  capable  instrumenta¬ 
tion  as  a  result  of  its  widespread  use  in  the  semiconductor 
industry.  The  spin-coating  approach  is  employed  in  this  work 
to  deposit  silica  nanoparticles  on  patterned  surfaces  with 
both  hard-mask  (e.g.,  Si02)  and  soft-mask  (photoresist)  de¬ 
fined  patterns.  For  the  soft-mask  patterns,  a  “lost  wax”  pro¬ 
cess  is  used  to  remove  the  photoresist  after  the  spin  coating, 
leaving  the  particle  patterns  on  a  flat  surface.15  Continuous 
and  isolated  nanoparticle  patterns  are  generated  and  may 
prove  useful  as  templates  for  nanoscale  patterned  growth  and 
nanosphere  optics,  and  as  building  blocks  for  more  complex 
nanostructures. 

II.  EXPERIMENT 
A.  Interferometric  lithography  (IL) 

In  interferometric  lithography,  two  coherent  laser  beams 
with  wavelength  X  are  crossed  at  an  angle  26  to  produce  a 
periodic  interference  pattern  on  a  photoresist  (PR)  film  with 
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the  spatial  period  d=\/(2sin  0)}6  The  angle  between  the 
two  beams  determines  the  pattern  period  while  the  exposure 
time,  power  density,  and  development  time  impact  the  line- 
width  and  the  pattern  morphology.  With  an  ultraviolet  light 
source  (\  =  355  nm,  third-harmonic  from  a  Nd:YAG  laser), 
one  can  easily  obtain  periods  on  the  order  of  hundreds  of 
nanometers. 

B.  Silica  nanoparticle  suspensions 

Three  sizes  of  silica  nanoparticles  were  employed  in  the 
spin-coating  process.  They  are  commercial  products  of  Nis¬ 
san  Chemical  America  Corp.  (Houston,  TX).  The  larger 
silica  particles  were  Snowtex  ZL  with  78  nm  mean  diameter. 
The  medium  silica  nanoparticles  were  Snowtex  OL  with 
50  nm  mean  diameter.  The  smaller  silica  nanoparticles  were 
Snowtex  C  with  15  nm  mean  diameter.  The  desired  silica 
nanoparticle  concentrations  were  prepared  by  diluting  corre¬ 
sponding  Snowtex  products  with  deionized  (DI)  water. 

C.  Directed  deposition  of  silica  nanoparticles  on 
patterned  surfaces 

The  first  step  is  to  fabricate  the  periodically  patterned 
grooves  and  holes  on  a  silicon  wafer  with  a  thin  Si02  layer. 
Initially,  the  silicon  sample  was  put  into  piranha  solution 
(sulfuric  acid  and  hydrogen  peroxide  with  volume  ratio  4:1) 
for  8  min,  subsequently  into  buffered  oxide  etchant  (BOE) 
20: 1  solution  for  2  min  to  clean  the  sample  surface.  A  thin 
Si02  layer  (—60  nm)  was  prepared  with  dry  thermal  oxida¬ 
tion  at  900  to  1000  °C.  Then  an  antireflective  coating 
(ARC)  layer  (XHRiC-16,  Brewer  Science,  Inc.,  Rolla,  MO) 
was  spun  on  the  sample  at  4000  rpm  for  30  s  and  oven- 
baked  at  175  °C  for  3  min  yielding  a  final  thickness  of 
— 150  nm.  Diluted  photoresist  (SPR  510A,  Shipley,  Inc., 
Marlborough,  MA)  was  spun  atop  the  ARC  at  4000  rpm  fol¬ 
lowed  by  a  similar  oven  bake  for  3  min  at  95  °  C  for  a  final 
thickness  of  —250  nm. 

Subsequently,  IL  was  used  to  produce  the  PR  patterns. 
The  exposure  times  were  20-45  s  with  power  50  or  150  mJ. 
A  2  min  post-exposure  bake  at  110  °C  was  used.  The 
sample,  after  exposure  and  post-bake,  was  developed  (MF- 
702,  Shipley,  Inc.)  for  45  s  and  rinsed  with  DI  water.  After 
definition  of  the  PR  pattern,  an  e-beam  deposition  of  gold 
(Au)  was  carried  out  at  a  pressure  of  10-6  mTorr  with  a 
deposition  rate  of  —1  A/s  until  a  thickness  of  —300  A  was 
reached  to  provide  a  hard  mask.  Acetone  was  then  used  to 
dissolve  and  lift  off  the  remaining  regions  of  PR  with  Au 
capping  layers.  The  sample  was  then  put  in  the  vacuum 
chamber  of  a  reactive  ion  etching  (RIE)  system  to  etch  the 
ARC  and  silicon  oxide  layers  in  the  regions  where  there  was 
no  protecting  Au  hard  mask.  Oxygen  was  used  to  etch  ARC 
for  3  min  at  a  power  of  50  W,  a  pressure  of  50  mTorr  while 
a  mixture  of  02  and  CHF3  (pressure  ratio:  50:130)  was  used 
to  etch  Si02  for  3  to  —  6  min  at  a  power  of  150  W.  Finally, 
after  groove  patterns  were  produced  in  the  Si02  layer  by 
RIE,  piranha  was  used  in  an  ultrasonic  bath  to  remove  the 
ARC  layer  along  with  the  top  Au  capping  layer.  Hole 
samples  with  60-nm-thick  Si02  were  prepared  similarly  ex¬ 


cept  for  double-IE  exposures  with  crossed  ID  patterns  and 
inductively  coupled  plasma  (ICP)  etching  of  the  Si02.  After 
the  samples  were  exposed  with  a  first  ID  grating,  they  were 
rotated  90°  and  exposed  a  second  time  to  obtain  2D  positive 
photoresist  patterns.  After  lift  off  of  an  Au  hard  mask  as 
above,  an  ICP  system  was  next  used  to  etch  the  Si02  in  the 
regions  where  there  was  no  Au  hard  mask  in  etching  step.  A 
mixture  of  Ar  and  CHF3  (3:1)  was  used  to  etch  60  nm  Si02 
for  2  min  at  the  rf  power  of  500  W,  bias  of  150  W,  and  the 
pressure  of  10  mTorr.  The  ICP  process  selectively  etches 
Si02  with  vertical  sidewalls. 

In  the  deposition  step,  an  aqueous  silica  nanoparticle  sus¬ 
pension  (1  wt  %)  was  spun  on  the  patterned  sample.  Before 
spin  coating,  an  oxygen  plasma  RIE  (1  to  —2  min)  was 
used  to  ensure  that  the  Si02  surface  was  hydrophilic.  One 
drop  of  suspension  solution  was  applied  on  the  clean  surface 
of  the  sample.  The  spin  coating  was  30  s  at  3000  rpm  with 
an  acceleration  of  100  rpm/s.  After  spin  coating,  the  sample 
was  baked  at  90  °C  for  5  min  to  drive  off  any  residual  sol¬ 
vent.  The  resulting  structures  were  characterized  by  field- 
emission  scanning-electron  microscopy  (FE-SEM)  at  30  kV 
with  a  10  nm  thick  Au  film  sputtered  on  the  sample  surface. 
Cross-section  views  of  images  were  also  obtained  to  check 
the  structure  of  the  ordered  patterns. 

D.  Directed  deposition  of  silica  nanoparticles  on 
planar  surfaces  using  soft  PR  masks 

In  the  process  of  the  directed  deposition  of  silica  nanopar¬ 
ticles  on  planar  surfaces,  a  wet  developable  BARC  for  I-line 
lithography  (wet-i™  10-7,  Brewer  Science,  Inc.,  Rolla,  MO) 
was  spun  on  a  precleaned  sample  at  3000  rpm  for  30  s.  The 
baking  time  for  wet-i  was  90  s  at  200  °C.  A  positive-tone 
photoresist  (SPR510A,  Shipley,  Inc.)  diluted  to  produce  a 
— 200-nm-thick  film  was  used  in  these  experiments.  IL  was 
used  to  produce  the  periodic  patterns  on  this  PR/wet-i  film 
stack.  Parallel  PR/wet-i  lines  (ID)  were  formed  on  the  Si 
wafer  surface  after  exposing,  baking,  and  developing.  For  2D 
arrays  of  nanoscale  posts,  a  double  exposure  with  a  90°  ro¬ 
tation  of  the  sample  was  performed,  and  all  other  steps  were 
the  same  as  those  for  ID  patterns.  For  2D  arrays  of  nanoscale 
holes,  a  negative-tone  photoresist  (NR7-500P,  Futurrex,  Inc., 
Franklin,  NJ)  was  used  in  IL. 

After  the  PR  patterning,  silica  nanoparticle  colloidal  dis¬ 
persions  (5  wt  %)  were  spun  coated  onto  these  samples.  In 
general,  the  spin  coating  protocol  was  30  s  at  4000  rpm  with 
an  acceleration  of  100  rpm/s2.  Suspensions  of  small  silica 
nanoparticles  (OL:50nm,  C:15  nm)  were  utilized  for  di¬ 
rected  assembly  of  ID  and  2D  periodic  particle  structures 
supported  on  flat  surfaces.  The  suspension  was  agitated  for 
5  min  in  an  ultrasonic  bath  before  the  coating  step. 

The  final  step  was  to  strip  off  the  PR/wet-i  patterns  leav¬ 
ing  the  ID  or  2D  ordered  patterns  of  silica  nanoparticles  on 
the  flat  surface  of  the  Si  wafer.  Two  methods — immersion  in 
piranha  solution  and  calcination  at  high  temperature — were 
successful  in  completely  removing  the  PR/wet-i  while  not 
disturbing  the  silica  particle  arrays.  In  general,  we  soaked  the 
as-patterned  samples  in  1:1  piranha  solution  for  15  min  for 
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Fig.  1.  Field  emission  scanning  electron  microscope 
(FE-SEM)  images  (tilted  45°  view)  of  ~78-nm  diam¬ 
eter  silica  particles  on  grooves  (ID)  and  holes  (2D):  (a) 
~100-nm-wide  groove  sample;  (b)  —  80-nm-wide 
groove  sample;  (c)  —60-nm-wide  groove  sample;  (d) 
^50-nm-wide  groove  sample;  and  (e)-(f)  —100  nm 
hole  sample. 


the  immersion  method,  while  the  as-patterned  samples  were 
put  into  furnace  at  800  °C  for  1.5-2  h  for  the  calcination 
method.  In  the  calcination  approach,  the  mechanical  stability 
of  particle  patterns  on  flat  surfaces  was  enhanced  as  a  result 
of  partial  sintering  between  the  silica  particles  and  the  Si 
surface  at  these  elevated  temperatures. 

III.  RESULTS  AND  DISCUSSION 

A.  Directed  deposition  of  silica  nanoparticles  on 
patterned  surfaces 

Several  methods  for  deposition  of  silica  particles  on  pat¬ 
terned  surfaces  were  investigated  before  the  spin-coating  ap¬ 
proach  was  adopted.  These  included  convective  deposition 
and  dip  coating.  In  convective  deposition,  the  sample  was 
mounted  vertically  in  a  beaker  containing  diluted  aqueous 
solution  of  silica  particles.  The  beaker  was  kept  in  a 
temperature-controlled  oven.  The  temperature  was  always 
maintained  at  55-60  °C  and  the  fluid  was  gradually  evapo¬ 
rated.  In  reverse  dip  coating  (bottom-flow  deposition),  the 
sample  was  again  mounted  vertically  in  a  beaker  containing 
the  silica  solution.  A  drain  port  and  valve  at  the  bottom  of  the 
beaker  allowed  for  controlled  removal  of  the  liquid  leaving 
behind  the  deposited  particles  on  the  sample,  essentially  the 
inverse  process  of  lifting  the  sample  out  of  the  liquid.  Even 
though  in  some  cases  the  results  for  dip  coating  and  convec¬ 
tive  deposition  are  similar  to  those  for  spin  coating,  a  large 
amount  of  silica  particle  suspension  is  consumed  in  each 
experiment  and  the  deposition  time  is  considerably  longer 


than  for  spin  coating.  Moreover,  for  these  processes  the  dis¬ 
tribution  of  silica  particles  on  patterned  samples  was  rela¬ 
tively  nonuniform  across  the  sample,  and  alternating  bands 
of  dense  and  sparse  particle  deposition  were  observed.  This 
phenomenon  of  multiple  band  patterns  for  evaporative  depo¬ 
sitions  has  been  previously  observed.17  To  fabricate  a  uni¬ 
form  particle  distribution,  it  is  required  to  reduce  the  evapo¬ 
ration  rate  and/or  to  tilt  the  sample  at  an  angle  greater  than 
10°.  Based  on  these  issues  and  its  comparative  success  and 
ease  of  use,  spin  coating  was  adopted  as  the  principle  ap¬ 
proach  to  the  deposition  of  silica  particles  on  patterned 
samples. 

Several  factors  effect  the  spin-coating  deposition  of  silica 
nanoparticles  on  patterned  samples,  including  the  spin  speed, 
the  solution  pH  value,  and  the  surface  pattern  geometry.10 
We  investigated  the  influence  of  these  factors  in  preliminary 
experiments.  Silica  particles  were  deposited  selectively  into 
the  grooves  or  holes  during  spin  coating  using  silica  suspen¬ 
sions  with  pH ^7.  This  pH  ensured  that  the  particles  re¬ 
mained  negatively  charged  and  separated  in  the  solution.  For 
more  acidic  solutions,  the  particles  tended  to  agglomerate  in 
solution  and  deposit  in  large  clumps  independent  of  the  sur¬ 
face  patterning.  The  spinning  program  was  key  to  ensuring 
the  deposition  was  exclusively  in  the  grooves  or  holes  rather 
than  on  the  top  surface  of  wafer.  Our  results  show  that  a 
relatively  high  speed  of  spin  coating  was  helpful  for  deposi¬ 
tion  of  particle  on  patterned  surfaces.  The  speed  of  4000  rpm 
was  used  for  the  experiments  reported  here.  The  concentra- 
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Fig.  2.  FE-SEM  images  of  ID  particle  arrays  with  —50  nm  diameter  silica  particles  on  planar  surfaces:  (a)  large  area  image  (tilted  45°  view)  with  1000  nm 
period;  (b)  magnified  cross-section  image  with  1000  nm  period;  (c)  large  area  cross-section  image  with  500  nm  period  after  two  cycles  of  spin-coating;  and 
(d)  magnified  cross-section  image  with  500  nm  period  after  two  cycles  of  spin-coating. 


tion  of  silica  particles  also  has  an  effect  on  the  formation  of 
the  particle  pattern  found  on  the  patterned  surfaces.  If  an 
excessively  high  concentration  is  used,  a  thick  deposition  of 
silica  particles  that  fully  covers  the  wafer  surface  is  formed 
without  any  impact  of  the  lithographic  patterning.  At  the 
other  extreme  of  very  low  concentrations,  only  a  sparsely 
populated  distribution  of  particles  forms  within  the  patterned 
features.  Based  on  our  initial  screening  experiments,  a 
1  wt  %  suspension  was  found  to  be  most  suitable  for  di¬ 
rected  deposition  of  ZL  silica  particles  on  patterned  surfaces 
for  these  experiments.  Due  to  accelerated  evaporation  during 
spin  coating,  factors  such  as  humidity  and  ambient  room 
temperature  have  little  effect  on  the  particle  patterns. 

The  directed  deposition  of  silica  nanoparticles  on  narrow- 
groove  samples  was  studied.  These  ID  patterned  samples 
have  ^100-nm-wide  grooves  with  a  500  nm  period.  A  di¬ 
luted  suspension  of  78+16-nm  diameter  silica  nanoparticles 
(1  wt  % ,  pH —  8.8)  was  prepared  by  diluting  Snowtex  ZL 
(40.9%  wt  %,  — pH9.6)  with  DI  water.  The  typical  deposition 
patterns  are  shown  in  Fig.  1.  In  Fig.  1(a),  the  grooves  are 
relatively  wide  (—100  nm)  and  deep,  so  the  silica  particles 
are  almost  completely  contained  within  the  grooves.  Very 
few  silica  particles  settle  on  the  flat  surface  of  the  samples 
away  from  the  grooves,  while  some  voids  exist  along  the 
particle  chains.  Due  to  the  large  distribution  of  silica  particle 
sizes,  zigzag  chains  of  silica  particles  were  observed  in  some 
parts  of  the  grooves.  In  Figs.  1(b)  and  1(c),  the  grooves  are 
narrower  (—60-80  nm)  and  half-buried  particle  chains  are 
formed  along  the  parallel  grooves.  In  addition,  void  defects 
appear  more  often  and  exhibit  longer  gaps  on  these  samples. 


In  Fig.  1(d),  the  width  of  grooves  is  less  than  50  nm.  The 
bulk  of  the  silica  particles  protrude  above  the  surface  of  the 
sample.  A  much  higher  density  of  void  defects  exist  on  this 
sub-50  nm  grooved  sample. 

The  more  important  forces  in  this  deposition  process  are 
the  centripetal  force  associated  with  the  spinning,  the 
particle- wall  blocking  forces,  and  surface  tension  forces.  As 
the  width  of  grooves  decreases,  the  blocking  force  from  the 
sidewall  of  the  groove  becomes  weaker  compared  to  the  cen¬ 
tripetal  forces.  The  silica  particles  are  easier  to  dislodge  from 
the  narrow  grooves.  In  order  to  obtain  good  coverage  of 
particles  into  grooves,  it  is  important  to  match  the  particle 
size  to  groove  spacing. 

The  deposition  of  the  78-nm  diameter  silica  particles  on 
2D  patterned  samples  was  investigated  as  well.  The  diameter 
of  holes  in  the  2D  pattern  is  — 100  nm  at  a  period  of  500  nm. 
In  most  holes,  single  particles  (monomer)  or  particle  pairs 
(dimers)  are  found  [Figs.  1(e)  and  1(f)].  For  the  monomer 
case,  the  individual  particle  is  held  inside  a  hole.  Occasion¬ 
ally,  some  holes  are  empty  without  any  particles  while  some 
particles  appear  on  the  surfaces.  For  the  dimer  case,  the  two 
particles  overfill  the  holes.  In  rarer  cases,  trimers  are  found 
as  shown  in  Fig.  1(f).  Some  clusters  of  particles  also  appear 
on  some  parts  of  surfaces.  We  attribute  the  multiple  modes  of 
particle  clusters  to  the  wide  distribution  of  silica  particle 
sizes  and  also  to  the  variation  of  hole  size.  The  types  of 
particle  clusters  follow  the  predictions  from  the  calculation 
of  colloidal  cluster  coordination  as  function  of  ratio  (Did) 
between  the  hole  dimensions  (D)  and  the  diameter  of  silica 
particles  (J).18 
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Fig.  3.  FE-SEM  images  (tilted  45°  view)  of  2D,  50-nm-diameter  particle  arrays  on  planar  surfaces  with  —500  nm  periods:  (a)  PR/wet-i  posts  for  continuous 
cross  network  of  particles  before  deposition  of  silica  particles  with  spin-coating;  (b)  magnified  image  of  continuous  cross  network  with  —50  nm  silica  particle; 
and  (c)  large  area  image  of  continuous  cross  network  with  —50  nm  silica  particle. 


B.  Directed  deposition  of  silica  nanoparticles  on 
planar  surfaces 

Two  types  of  small  silica  nanoparticles  were  used  for  di¬ 
rected  deposition  using  soft  PR  masks  on  planar  surfaces. 
They  have  the  mean  diameters  of  50+13  nm  (OL)  and 
15  +  5  nm  (C),  respectively.  Approximately  5  wt  %  aqueous 
suspensions  of  OL  and  C  were  used.  The  pH  values  of  sus¬ 
pensions  have  less  influence  on  the  final,  multilayer,  close- 
packed  particle  patterns  on  planar  surfaces  using  small  silica 
nanoparticles.  Figure  2  shows  ID  50-nm-diameter  particle 
patterns  on  planar  surfaces  after  removal  of  the  PR  lines.  In 
Figs.  2(a)  and  2(b),  the  period  is  1000  nm  and  a  single  cycle 
of  spin  coating  was  used.  Parallel  silica  particle  rows  are 
observed  over  a  large  area  [Fig.  2(a)].  The  particle  deposit 
shows  a  nonuniformity  across  the  line  with  thicker  deposits 
at  both  edges  [Fig.  2(b)].  These  are  due  to  the  high  spin 
speed  and  relatively  small  ratio  between  particle  diameter 
and  width  of  the  lines.19  This  “edge-high”  effect  was  also 
observed  for  ID  patterns  with  a  500  nm  period.  Figures  2(c) 
and  2(d)  show  the  ID  patterns  with  a  500  nm  period  after 
two  cycles  of  spin  coating.  With  the  added  deposition,  both 
the  thickness  is  increased  and  the  bending  of  the  deposition 
at  the  edges  of  the  lines  is  reduced.  The  decreased  blocking 
force  from  the  relatively  low  PR  walls  in  the  second  cycle  of 
spin  coating  resulted  in  the  reduced  bending  of  the  deposited 
particle  film  at  the  line  edges. 

Typical  images  of  2D  patterns  of  silica  nanoparticles  de¬ 
posited  on  flat  surfaces  are  shown  in  Figs.  3  and  4.  The 
period  for  each  of  these  2D  samples  was  500  nm.  Figure  3(a) 
shows  the  PR/wet-i  posts  after  double  exposures  and  before 


deposition  of  silica  particles.  After  the  deposition  of  silica 
nanoparticles  and  removal  of  PR/wet-i  posts,  a  continuous 
cross  network  of  50-nm-diameter  silica  particles  with  peri¬ 
odic  voids  is  formed  over  a  large  area  [Fig.  3(c)].  Two  layers 
of  silica  particles  were  deposited  to  form  this  network  [Fig. 
3(b)].  Figure  4(a)  shows  the  PR/wet-i  holes  after  exposure 
and  development  and  before  the  silica  particle  deposition.  A 
negative  PR  with  a  thickness  of  500  nm  was  used.  Slight 
standing  wave  ridges  were  formed  along  hole  walls  even 
though  an  ARC  layer  was  used.  Before  spin-coating  deposi¬ 
tion  of  silica  particles,  the  PR/wet-i  layer  has  a  periodic  array 
of  — 200-nm-diameter  empty  holes.  After  a  spin-coating 
deposition  of  the  smaller  —  15-nm-diameter  silica  nanopar¬ 
ticles  C  and  removal  of  the  photoresist,  2D  isolated  posts  are 
formed  (—15  nm)  as  shown  in  Fig.  4(b).  (The  resolution  of 
the  FE-SEM  is  not  sufficient  to  distinguish  the  individual 
particles  at  this  small  size.)  The  posts,  formed  with  a  single 
deposition  step,  have  concave  top  surface  profiles,  similar  to 
the  ID  deposition  shown  in  Fig.  2(b),  due  to  the  high  spin 
speed  and  relatively  small  ratio  between  particle  and  post 
diameters.  In  contrast,  uniform  isolated  post  patterns  were 
not  successfully  obtained  using  the  larger,  50  nm  particles. 
We  attribute  this  to  the  difficulty  of  filling  these  deep  PR 
holes  with  comparatively  “large”  particles  and  the  possibility 
of  disturbing  the  isolated  particle  clusters  during  the 
immersion/calcination  treatments. 

From  the  above  experiments,  we  can  see  that  the  particle 
size,  pattern  feature,  and  spin  program  are  important  factors 
influencing  the  final  morphology  of  particle  patterns  on  pla¬ 
nar  surfaces.  On  the  one  hand,  it  is  easier  to  fill  empty  spaces 


Fig.  4.  FE-SEM  images  (tilted  45°  view)  of  2D  particle  arrays  with  —  15-nm-diameter  silica  particles  on  planar  surfaces  with  —500  nm  periods:  (a)  PR/wet-i 
holes  and  (b)  isolated  particle  posts. 
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in  PR  patterns  with  smaller  particles.  On  the  other  hand,  the 
smaller  the  ratio  of  particle  diameter  to  feature  pattern  size, 
the  more  pronounced  the  “edge-high”  effect.  Other  deposi¬ 
tion  methods  such  as  layer-by-layer  can  avoid  the  “edge- 
high”  effect  but  with  the  consumption  of  a  large  volume  of 
suspension  and  need  for  a  long  deposition  time.  Continuous 
particle  networks  are  easier  to  form  and  exhibit  fewer  defects 
compared  to  isolated  particle  patterns.  By  applying  more 
cycles  of  spin  coating,  the  “edge-high”  effect  can  be  mini¬ 
mized  and  complex  structures  can  be  obtained. 

IV.  CONCLUSIONS 

ID  and  2D  patterns  of  silica  nanoparticles  were  fabricated 
both  on  patterned  surfaces  and  on  planar  surfaces  using  in¬ 
terferometric  lithography  and  spin-coating.  Single  linear 
silica  particle  chain  patterns  and  isolated  2D  particle  patterns 
were  easily  formed  on  the  patterned  surfaces  with  feature 
sizes  around  100  nm.  Silica  particle  rows,  cross  networks, 
and  isolated  posts  with  controllable  thickness  could  be 
formed  on  flat  surface  with  the  smaller  silica  particles 
(^50  nm)  using  this  approach.  These  initial  experiments  il¬ 
lustrate  some  of  the  possibilities  of  extending  patterning  by 
combining  top-down  and  bottom-up  processes.  Future  work 
will  include  extensions  to  smaller  particles  and  smaller  litho¬ 
graphic  features,  as  well  as  the  extension  to  functionalized 
nanoparticles  such  as  metals  and  biological  species. 
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